ITM 102 Chapter 6: Business Intelligence & Databases

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

Which of the following is the correct sequence in the data hierarchy from the smallest to the largest?

  • Bit, Byte, Field, Record, File, Database (correct)
  • Database, File, Record, Field, Byte, Bit
  • Byte, Bit, Field, Record, File, Database
  • Bit, Field, Byte, Record, File, Database

In the context of file organization terms, what constitutes a 'Record'?

  • A group of related fields. (correct)
  • A collection of databases.
  • A group of related files.
  • A single character, word, or number

What is the primary purpose of an 'attribute' in the context of databases?

  • To identify a person, place, or thing.
  • To serve as the main storage unit for data.
  • To represent a collection of related files.
  • To describe a characteristic or quality of an entity. (correct)

Which of the following scenarios best illustrates 'data redundancy', a common problem in traditional file environments?

<p>A customer's address is stored in both the sales and billing departments' files. (A)</p> Signup and view all the answers

Which of the following is NOT a typical problem associated with traditional file environments?

<p>Data consistency. (B)</p> Signup and view all the answers

What is a key difference between a database and a traditional file system in terms of data management?

<p>A database centralizes data and controls redundancy, whereas a file system maintains data separately. (C)</p> Signup and view all the answers

What is a primary benefit of a Database Management System (DBMS)?

<p>DBMS separates the logical and physical views of data. (B)</p> Signup and view all the answers

In a relational database, what are the 'fields' of a table more commonly known as?

<p>Columns (D)</p> Signup and view all the answers

What is the role of a 'primary key' in a relational database table?

<p>To uniquely identify each record in the table. (D)</p> Signup and view all the answers

What is the purpose of a 'foreign key' in a relational database?

<p>To serve as a lookup field, linking to the primary key in another table. (C)</p> Signup and view all the answers

Which operation in a relational DBMS is responsible for creating a subset of data of all records that meet specific criteria?

<p>SELECT (C)</p> Signup and view all the answers

Which operation in a relational DBMS combines data from multiple tables to provide more information?

<p>JOIN (B)</p> Signup and view all the answers

What is the primary goal of normalization in database design?

<p>To organize data in a way that minimizes redundancy and awkward many-to-many relationships. (A)</p> Signup and view all the answers

What is 'referential integrity' in the context of relational databases?

<p>It ensures that relationships between tables remain consistent. (A)</p> Signup and view all the answers

In database design, what is the purpose of an Entity-Relationship Diagram (ERD)?

<p>To visually represent the relationships between entities in a database. (B)</p> Signup and view all the answers

Which of the following describes a key characteristic of NoSQL databases?

<p>They offer a more flexible data model compared to relational databases. (B)</p> Signup and view all the answers

What is a key advantage of cloud databases for start-ups and smaller businesses?

<p>They provide an affordable and easily scalable database solution. (D)</p> Signup and view all the answers

What is a defining characteristic of a blockchain database?

<p>Distributed ledgers in a peer-to-peer network. (B)</p> Signup and view all the answers

Which of the following is a key characteristic of 'big data'?

<p>Massive sets of unstructured and semi-structured data. (A)</p> Signup and view all the answers

Why is specialized infrastructure often required for managing big data?

<p>Because typical DBMS are not designed to handle the volume and complexity of big data. (B)</p> Signup and view all the answers

What is the primary function of a data warehouse in a business intelligence infrastructure?

<p>To store and analyze historical data from various sources for decision-making. (B)</p> Signup and view all the answers

How does a data mart differ from a data warehouse?

<p>A data mart is a subset of a data warehouse, focusing on a specific subject or line of business. (A)</p> Signup and view all the answers

What is the main purpose of Hadoop Distributed File System (HDFS)?

<p>To distribute data storage across multiple computers. (A)</p> Signup and view all the answers

What is the role of 'MapReduce' in the Hadoop ecosystem?

<p>To break data into clusters and manage them. (B)</p> Signup and view all the answers

What is the primary benefit of in-memory computing in business intelligence?

<p>Faster data retrieval and processing speeds. (D)</p> Signup and view all the answers

What is the primary goal of Online Analytical Processing (OLAP)?

<p>To support multidimensional data analysis for better decision-making. (D)</p> Signup and view all the answers

What does the multidimensional data model used in OLAP allow users to do?

<p>Analyze data from various dimensions such as product, region, and time. (A)</p> Signup and view all the answers

What is the primary goal of data mining?

<p>To discover hidden patterns and relationships in large datasets. (D)</p> Signup and view all the answers

Which of the following is NOT a type of information commonly obtainable from data mining?

<p>Data encryption (A)</p> Signup and view all the answers

What is the primary focus of text mining?

<p>Discovering and analyzing patterns in unstructured text data. (B)</p> Signup and view all the answers

What sets Web mining apart from other data analysis techniques?

<p>It discovers and analyzes useful patterns and information from the web. (C)</p> Signup and view all the answers

What advantage does using the web offer for database access?

<p>It is relatively inexpensive to web enable a database. (C)</p> Signup and view all the answers

What is the main goal of data governance?

<p>To manage data as an organizational resource through policies, with rules for sharing included. (A)</p> Signup and view all the answers

Which of the following aims does data quality assurance accomplish?

<p>Identifying and correcting faulty data before a database is implemented, and establishing proper procedure. (C)</p> Signup and view all the answers

What steps must a firm take before implementing a new database to ensure data quality assurance?

<p>Correct faulty data and determine procedure. (C)</p> Signup and view all the answers

What is the most important aspect of data quality assurance?

<p>Enhance customer satisfaction. (A)</p> Signup and view all the answers

What is the primary step in ensuring data quality?

<p>Data cleansing. (B)</p> Signup and view all the answers

Which of the following would benefit MOST from data governance?

<p>A company that wants to promote better organizational understandings. (D)</p> Signup and view all the answers

Flashcards

What is a database?

A group of related files

What is a file?

A group of records of the same type.

What is a record?

A group of related fields.

What is a field?

A group of characters as word(s) or number(s).

Signup and view all the flashcards

What is an entity?

The person, place, thing on which we store information.

Signup and view all the flashcards

What is an attribute?

Each characteristic, or quality, describing entity.

Signup and view all the flashcards

What is the problem with Traditional File Environment?

Files are maintained separately by different departments leading to data redundancy and inconsistency.

Signup and view all the flashcards

What is a Database?

Serves many applications by centralizing data and controlling redundant data.

Signup and view all the flashcards

What is DBMS?

Interfaces between applications and physical data files and separates logical/physical views of data.

Signup and view all the flashcards

What does Relational DBMS do?

Represents data as two-dimensional tables.

Signup and view all the flashcards

What are Rows (tuples)?

Records for different entities.

Signup and view all the flashcards

What are Fields (columns)?

Represents attribute for entity.

Signup and view all the flashcards

What is the Key field?

Field used to identify each record uniquely.

Signup and view all the flashcards

What is Primary key?

Field in a table used for key fields.

Signup and view all the flashcards

What is Foreign key?

Primary key used in the second table as a look-up field to identify records from the original table.

Signup and view all the flashcards

What is SELECT?

Creates a subset of data of all records that meet stated criteria

Signup and view all the flashcards

What is JOIN?

Combines relational tables to provide more information

Signup and view all the flashcards

What is PROJECT?

Creates a subset of columns in the table, containing only the information specified

Signup and view all the flashcards

What is Normalization?

Streamlining complex groupings of data to minimize redundant data elements and awkward many-to-many relationships

Signup and view all the flashcards

What is Referential Integrity?

Rules used by RDBMS to ensure relationships between tables remain consistent

Signup and view all the flashcards

What are NoSQL Databases useful for?

Non-relational databases are used for more flexible data models.

Signup and view all the flashcards

Who benefits from cloud databases?

Cloud databases offer appeal to start-ups and smaller businesses.

Signup and view all the flashcards

What is Big Data?

Massive sets of unstructured/semi-structured data from web traffic, social media, sensors, and so on.

Signup and view all the flashcards

What is Business Intelligence Infrastructure?

An array of tools for obtaining information from separate systems and big data.

Signup and view all the flashcards

Purpose of a Data Warehouse?

Stores current and historical data from many core operational transaction systems

Signup and view all the flashcards

What is a data mart?

Smaller, focused version of data warehouse

Signup and view all the flashcards

What is Hadoop?

Distributed parallel processing of big data across inexpensive computers

Signup and view all the flashcards

What is In-memory computing?

Used in big data analysis, uses computer main memory (RAM) for data storage

Signup and view all the flashcards

What are Analytical Tools for?

Consolidating, analyzing, and providing access to vast amounts of data to make better business decisions

Signup and view all the flashcards

What is Online Analytical Processing (OLAP)?

Supports multidimensional data analysis.

Signup and view all the flashcards

What is Data Mining?

Finds hidden patterns and relationships in datasets.

Signup and view all the flashcards

What is Text mining?

Extracts key elements from large unstructured text data sets

Signup and view all the flashcards

What is Web mining?

Discovery and analysis of useful patterns and information from web

Signup and view all the flashcards

What is Data Governance?

Policies and procedures to manage data as an organizational resource.

Signup and view all the flashcards

What is Data Quality Assurance?

Before a new database is in place, a firm must identify and correct faulty data.

Signup and view all the flashcards

Study Notes

  • ITM 102 Business Information Systems Winter 2025, Mahdi Abouei

Chapter 6: Foundations of Business Intelligence, Databases, and Information Management

Learning Outcomes

  • Problems of managing data resources in a traditional file environment
  • Major capabilities of database management systems (DBMS)
  • Why a relational DBMS is so powerful
  • Principal tools and technologies for accessing information from databases to improve business performance and decision-making
  • Why data governance and data quality assurance are essential for managing the firm’s data resources

Data Hierarchy

  • The data hierarchy consists of bits, bytes, fields, and components of data hierarchy

File Organization Terms and Concepts

  • Database: A group of related files
  • File: A group of records of the same type
  • Record: A group of related fields
  • Field: A group of characters as word(s) or number(s)
  • Entity: A person, place, or thing on which to store information
  • Attribute: Each characteristic, or quality, describing an entity

Problems with the Traditional File Environment

  • Data maintained separately by different departments
  • Data redundancy
  • Data inconsistency
  • Program-data dependence
  • Lack of flexibility
  • Poor security
  • Lack of data sharing and availability

Database Management Systems (DBMS)

  • Database serves many applications by centralizing data and controlling redundant data
  • Database management system (DBMS) interfaces between applications and physical data files
  • A DBMS separates logical and physical views of data, solving problems of traditional file environments
  • DBMS benefits include controlling redundancy, eliminating inconsistency, uncoupling programs and data, and enabling managing data and data security centrally

Relational DBMS

  • Represent data as two-dimensional tables
  • Each table contains data on an entity and attributes
  • Rows (tuples): records for different entities
  • Fields (columns): represents attributes for an entity
  • Key field: identifies each record uniquely
  • Primary key: field in a table used for key fields
  • Foreign key: Primary key used in the second table as a look-up field to identify records from the original table

Operations of a Relational DBMS

  • SELECT: creates a subset of data of all records that meet stated criteria
  • JOIN: combines relational tables to provide more information
  • PROJECT: creates a subset of columns in the table, containing only information specified

Designing Databases

  • Conceptual design vs. physical design is part of database design
  • Normalization: Streamlining complex groupings of data to minimize redundant data elements and awkward many-to-many relationships is part of database design
  • Referential integrity: Rules used by RDBMS to ensure relationships between tables remain consistent during database design
  • Entity-relationship diagram: visual representation of entities and their relationships in a database

Non-Relational Databases

  • Cloud Databases and Blockchain
  • Non-relational databases are often called "No SQL"
  • Non-relational databases have a more flexible data model
  • Data sets are stored across distributed machines
  • Easier to scale
  • Can handle large volumes of unstructured and structured data
  • Cloud databases appeal to start-ups and smaller businesses
  • Amazon Relational Database Service and Microsoft SQL Azure are clous databases
  • Private clouds are also cloud databases
  • Distributed Databases are stored in multiple physical locations
  • Google Spanner is a distributed database
  • Blockchain uses distributed ledgers in a peer-to-peer distributed database
  • Blockchain maintains a growing list of records and transactions shared by all
  • Encryption identifies participants and transactions in blockchain
  • Blockchain technology is used for financial transactions, supply chain, and medical records
  • The foundation of cryptocurrencies is blockchain

The Challenge of Big Data

  • Big data are massive sets of unstructured or semi-structured data from web traffic, social media, sensors, and so on
  • Big data is too large for typical DBMS
  • Requires new tools and technologies to manage and analyze
  • It can reveal more patterns, relationships, and anomalies

Business Intelligence Infrastructure

  • An array of tools for obtaining information from separate systems and big data
  • Data warehouse
  • Data mart
  • Hadoop
  • In-memory computing
  • Analytical platforms

Data Warehouse

  • Stores current and historical data from many core operational transaction systems
  • Provides analysis and reporting tools

Data Marts

  • Subset of data warehouse
  • Typically focuses on a single subject or line of business

Hadoop

  • Distributed parallel processing of big data across inexpensive computers
  • Hadoop Distributed File System: data storage
  • MapReduce: breaks data into clusters for work
  • Hbase: No SQL database
  • Used by Yahoo and NextBio

In-Memory Computing

  • Used in big data analysis
  • Use computer main memory (RAM) for data storage to avoid delays in retrieving data from disk storage
  • Processing can be reduced from hours/days to seconds
  • Requires optimized hardware

Analytic Platforms

  • High-speed platforms using both relocation and non-relational tools
  • Optimized for large datasets

Analytical Tools

  • Tools for consolidating, analyzing, and providing access to vast amounts of data to make better business decisions
  • Multidimensional data analysis (OLAP)
  • Data mining
  • Text mining
  • Web mining

Online Analytical Processing (OLAP)

  • Supports multidimensional data analysis
  • Allows viewing data using multiple dimensions
  • Each aspect of information (product, pricing, region, time) is a different dimension
  • Enables rapid, online answers to ad hoc queries

Data Mining

  • Finds hidden patterns and relationships in datasets
  • Customer buying patterns are an example
  • Associations
  • Sequences
  • Classification
  • Clustering
  • Forecasting

Text Mining and Web Mining

  • Text mining extracts key elements from large unstructured text data sets
  • Sentiment analysis software also helps with text mining
  • Web mining involves the discovery and analysis of useful patterns and information from the web
  • Types of web mining include web content mining, structure mining, and usage mining

Databases and the Web

  • The web can make some internal databases available to customers or partners
  • Advantages for using the web for database access include ease of use of browser software
  • Having a web interface requires few or no changes to database
  • Web interface is inexpensive to add to a database

Data Governance

  • Policies and procedures to manage data as an organizational resource
  • Establishes rules for sharing, disseminating, acquiring, standardizing, classifying, and inventorying information
  • An organization's information policy that specifies that only selected members of a particular department can view certain information

Data Quality Assurance

  • Before a new database is in place, the firm must identify and correct faulty data
  • Establish and maintain better routines for editing data once database is in operation
  • Data quality audit
  • Data cleansing

Discussion Questions

  • What are the differentiating IT capabilities that are expected from a bank compared to a retailer and why does IT costs so much more for a bank compared to a retailer?
  • Where should CT Bank expect to spend more for in its IT using the IT infrastructure framework?
  • What is CRM, and why is it important for CT Bank and CT Retail?
  • Describe the Triangle reward system and why a customer would provide detailed information in return for rewards
  • Provide one example of a digital-enabled banking service from the CTB webpage and compare to a similar banking service at a major Canadian bank

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team

Related Documents

More Like This

C# Files and Data Hierarchy
20 questions
Data Hierarchy and File Systems
40 questions

Data Hierarchy and File Systems

IntuitiveRisingAction avatar
IntuitiveRisingAction
Data Fundamentals and Hierarchy Quiz
40 questions
Use Quizgecko on...
Browser
Browser