Podcast
Questions and Answers
Which of the following is the correct sequence in the data hierarchy from the smallest to the largest?
Which of the following is the correct sequence in the data hierarchy from the smallest to the largest?
- Bit, Byte, Field, Record, File, Database (correct)
- Database, File, Record, Field, Byte, Bit
- Byte, Bit, Field, Record, File, Database
- Bit, Field, Byte, Record, File, Database
In the context of file organization terms, what constitutes a 'Record'?
In the context of file organization terms, what constitutes a 'Record'?
- A group of related fields. (correct)
- A collection of databases.
- A group of related files.
- A single character, word, or number
What is the primary purpose of an 'attribute' in the context of databases?
What is the primary purpose of an 'attribute' in the context of databases?
- To identify a person, place, or thing.
- To serve as the main storage unit for data.
- To represent a collection of related files.
- To describe a characteristic or quality of an entity. (correct)
Which of the following scenarios best illustrates 'data redundancy', a common problem in traditional file environments?
Which of the following scenarios best illustrates 'data redundancy', a common problem in traditional file environments?
Which of the following is NOT a typical problem associated with traditional file environments?
Which of the following is NOT a typical problem associated with traditional file environments?
What is a key difference between a database and a traditional file system in terms of data management?
What is a key difference between a database and a traditional file system in terms of data management?
What is a primary benefit of a Database Management System (DBMS)?
What is a primary benefit of a Database Management System (DBMS)?
In a relational database, what are the 'fields' of a table more commonly known as?
In a relational database, what are the 'fields' of a table more commonly known as?
What is the role of a 'primary key' in a relational database table?
What is the role of a 'primary key' in a relational database table?
What is the purpose of a 'foreign key' in a relational database?
What is the purpose of a 'foreign key' in a relational database?
Which operation in a relational DBMS is responsible for creating a subset of data of all records that meet specific criteria?
Which operation in a relational DBMS is responsible for creating a subset of data of all records that meet specific criteria?
Which operation in a relational DBMS combines data from multiple tables to provide more information?
Which operation in a relational DBMS combines data from multiple tables to provide more information?
What is the primary goal of normalization in database design?
What is the primary goal of normalization in database design?
What is 'referential integrity' in the context of relational databases?
What is 'referential integrity' in the context of relational databases?
In database design, what is the purpose of an Entity-Relationship Diagram (ERD)?
In database design, what is the purpose of an Entity-Relationship Diagram (ERD)?
Which of the following describes a key characteristic of NoSQL databases?
Which of the following describes a key characteristic of NoSQL databases?
What is a key advantage of cloud databases for start-ups and smaller businesses?
What is a key advantage of cloud databases for start-ups and smaller businesses?
What is a defining characteristic of a blockchain database?
What is a defining characteristic of a blockchain database?
Which of the following is a key characteristic of 'big data'?
Which of the following is a key characteristic of 'big data'?
Why is specialized infrastructure often required for managing big data?
Why is specialized infrastructure often required for managing big data?
What is the primary function of a data warehouse in a business intelligence infrastructure?
What is the primary function of a data warehouse in a business intelligence infrastructure?
How does a data mart differ from a data warehouse?
How does a data mart differ from a data warehouse?
What is the main purpose of Hadoop Distributed File System (HDFS)?
What is the main purpose of Hadoop Distributed File System (HDFS)?
What is the role of 'MapReduce' in the Hadoop ecosystem?
What is the role of 'MapReduce' in the Hadoop ecosystem?
What is the primary benefit of in-memory computing in business intelligence?
What is the primary benefit of in-memory computing in business intelligence?
What is the primary goal of Online Analytical Processing (OLAP)?
What is the primary goal of Online Analytical Processing (OLAP)?
What does the multidimensional data model used in OLAP allow users to do?
What does the multidimensional data model used in OLAP allow users to do?
What is the primary goal of data mining?
What is the primary goal of data mining?
Which of the following is NOT a type of information commonly obtainable from data mining?
Which of the following is NOT a type of information commonly obtainable from data mining?
What is the primary focus of text mining?
What is the primary focus of text mining?
What sets Web mining apart from other data analysis techniques?
What sets Web mining apart from other data analysis techniques?
What advantage does using the web offer for database access?
What advantage does using the web offer for database access?
What is the main goal of data governance?
What is the main goal of data governance?
Which of the following aims does data quality assurance accomplish?
Which of the following aims does data quality assurance accomplish?
What steps must a firm take before implementing a new database to ensure data quality assurance?
What steps must a firm take before implementing a new database to ensure data quality assurance?
What is the most important aspect of data quality assurance?
What is the most important aspect of data quality assurance?
What is the primary step in ensuring data quality?
What is the primary step in ensuring data quality?
Which of the following would benefit MOST from data governance?
Which of the following would benefit MOST from data governance?
Flashcards
What is a database?
What is a database?
A group of related files
What is a file?
What is a file?
A group of records of the same type.
What is a record?
What is a record?
A group of related fields.
What is a field?
What is a field?
Signup and view all the flashcards
What is an entity?
What is an entity?
Signup and view all the flashcards
What is an attribute?
What is an attribute?
Signup and view all the flashcards
What is the problem with Traditional File Environment?
What is the problem with Traditional File Environment?
Signup and view all the flashcards
What is a Database?
What is a Database?
Signup and view all the flashcards
What is DBMS?
What is DBMS?
Signup and view all the flashcards
What does Relational DBMS do?
What does Relational DBMS do?
Signup and view all the flashcards
What are Rows (tuples)?
What are Rows (tuples)?
Signup and view all the flashcards
What are Fields (columns)?
What are Fields (columns)?
Signup and view all the flashcards
What is the Key field?
What is the Key field?
Signup and view all the flashcards
What is Primary key?
What is Primary key?
Signup and view all the flashcards
What is Foreign key?
What is Foreign key?
Signup and view all the flashcards
What is SELECT?
What is SELECT?
Signup and view all the flashcards
What is JOIN?
What is JOIN?
Signup and view all the flashcards
What is PROJECT?
What is PROJECT?
Signup and view all the flashcards
What is Normalization?
What is Normalization?
Signup and view all the flashcards
What is Referential Integrity?
What is Referential Integrity?
Signup and view all the flashcards
What are NoSQL Databases useful for?
What are NoSQL Databases useful for?
Signup and view all the flashcards
Who benefits from cloud databases?
Who benefits from cloud databases?
Signup and view all the flashcards
What is Big Data?
What is Big Data?
Signup and view all the flashcards
What is Business Intelligence Infrastructure?
What is Business Intelligence Infrastructure?
Signup and view all the flashcards
Purpose of a Data Warehouse?
Purpose of a Data Warehouse?
Signup and view all the flashcards
What is a data mart?
What is a data mart?
Signup and view all the flashcards
What is Hadoop?
What is Hadoop?
Signup and view all the flashcards
What is In-memory computing?
What is In-memory computing?
Signup and view all the flashcards
What are Analytical Tools for?
What are Analytical Tools for?
Signup and view all the flashcards
What is Online Analytical Processing (OLAP)?
What is Online Analytical Processing (OLAP)?
Signup and view all the flashcards
What is Data Mining?
What is Data Mining?
Signup and view all the flashcards
What is Text mining?
What is Text mining?
Signup and view all the flashcards
What is Web mining?
What is Web mining?
Signup and view all the flashcards
What is Data Governance?
What is Data Governance?
Signup and view all the flashcards
What is Data Quality Assurance?
What is Data Quality Assurance?
Signup and view all the flashcards
Study Notes
- ITM 102 Business Information Systems Winter 2025, Mahdi Abouei
Chapter 6: Foundations of Business Intelligence, Databases, and Information Management
Learning Outcomes
- Problems of managing data resources in a traditional file environment
- Major capabilities of database management systems (DBMS)
- Why a relational DBMS is so powerful
- Principal tools and technologies for accessing information from databases to improve business performance and decision-making
- Why data governance and data quality assurance are essential for managing the firm’s data resources
Data Hierarchy
- The data hierarchy consists of bits, bytes, fields, and components of data hierarchy
File Organization Terms and Concepts
- Database: A group of related files
- File: A group of records of the same type
- Record: A group of related fields
- Field: A group of characters as word(s) or number(s)
- Entity: A person, place, or thing on which to store information
- Attribute: Each characteristic, or quality, describing an entity
Problems with the Traditional File Environment
- Data maintained separately by different departments
- Data redundancy
- Data inconsistency
- Program-data dependence
- Lack of flexibility
- Poor security
- Lack of data sharing and availability
Database Management Systems (DBMS)
- Database serves many applications by centralizing data and controlling redundant data
- Database management system (DBMS) interfaces between applications and physical data files
- A DBMS separates logical and physical views of data, solving problems of traditional file environments
- DBMS benefits include controlling redundancy, eliminating inconsistency, uncoupling programs and data, and enabling managing data and data security centrally
Relational DBMS
- Represent data as two-dimensional tables
- Each table contains data on an entity and attributes
- Rows (tuples): records for different entities
- Fields (columns): represents attributes for an entity
- Key field: identifies each record uniquely
- Primary key: field in a table used for key fields
- Foreign key: Primary key used in the second table as a look-up field to identify records from the original table
Operations of a Relational DBMS
- SELECT: creates a subset of data of all records that meet stated criteria
- JOIN: combines relational tables to provide more information
- PROJECT: creates a subset of columns in the table, containing only information specified
Designing Databases
- Conceptual design vs. physical design is part of database design
- Normalization: Streamlining complex groupings of data to minimize redundant data elements and awkward many-to-many relationships is part of database design
- Referential integrity: Rules used by RDBMS to ensure relationships between tables remain consistent during database design
- Entity-relationship diagram: visual representation of entities and their relationships in a database
Non-Relational Databases
- Cloud Databases and Blockchain
- Non-relational databases are often called "No SQL"
- Non-relational databases have a more flexible data model
- Data sets are stored across distributed machines
- Easier to scale
- Can handle large volumes of unstructured and structured data
- Cloud databases appeal to start-ups and smaller businesses
- Amazon Relational Database Service and Microsoft SQL Azure are clous databases
- Private clouds are also cloud databases
- Distributed Databases are stored in multiple physical locations
- Google Spanner is a distributed database
- Blockchain uses distributed ledgers in a peer-to-peer distributed database
- Blockchain maintains a growing list of records and transactions shared by all
- Encryption identifies participants and transactions in blockchain
- Blockchain technology is used for financial transactions, supply chain, and medical records
- The foundation of cryptocurrencies is blockchain
The Challenge of Big Data
- Big data are massive sets of unstructured or semi-structured data from web traffic, social media, sensors, and so on
- Big data is too large for typical DBMS
- Requires new tools and technologies to manage and analyze
- It can reveal more patterns, relationships, and anomalies
Business Intelligence Infrastructure
- An array of tools for obtaining information from separate systems and big data
- Data warehouse
- Data mart
- Hadoop
- In-memory computing
- Analytical platforms
Data Warehouse
- Stores current and historical data from many core operational transaction systems
- Provides analysis and reporting tools
Data Marts
- Subset of data warehouse
- Typically focuses on a single subject or line of business
Hadoop
- Distributed parallel processing of big data across inexpensive computers
- Hadoop Distributed File System: data storage
- MapReduce: breaks data into clusters for work
- Hbase: No SQL database
- Used by Yahoo and NextBio
In-Memory Computing
- Used in big data analysis
- Use computer main memory (RAM) for data storage to avoid delays in retrieving data from disk storage
- Processing can be reduced from hours/days to seconds
- Requires optimized hardware
Analytic Platforms
- High-speed platforms using both relocation and non-relational tools
- Optimized for large datasets
Analytical Tools
- Tools for consolidating, analyzing, and providing access to vast amounts of data to make better business decisions
- Multidimensional data analysis (OLAP)
- Data mining
- Text mining
- Web mining
Online Analytical Processing (OLAP)
- Supports multidimensional data analysis
- Allows viewing data using multiple dimensions
- Each aspect of information (product, pricing, region, time) is a different dimension
- Enables rapid, online answers to ad hoc queries
Data Mining
- Finds hidden patterns and relationships in datasets
- Customer buying patterns are an example
- Associations
- Sequences
- Classification
- Clustering
- Forecasting
Text Mining and Web Mining
- Text mining extracts key elements from large unstructured text data sets
- Sentiment analysis software also helps with text mining
- Web mining involves the discovery and analysis of useful patterns and information from the web
- Types of web mining include web content mining, structure mining, and usage mining
Databases and the Web
- The web can make some internal databases available to customers or partners
- Advantages for using the web for database access include ease of use of browser software
- Having a web interface requires few or no changes to database
- Web interface is inexpensive to add to a database
Data Governance
- Policies and procedures to manage data as an organizational resource
- Establishes rules for sharing, disseminating, acquiring, standardizing, classifying, and inventorying information
- An organization's information policy that specifies that only selected members of a particular department can view certain information
Data Quality Assurance
- Before a new database is in place, the firm must identify and correct faulty data
- Establish and maintain better routines for editing data once database is in operation
- Data quality audit
- Data cleansing
Discussion Questions
- What are the differentiating IT capabilities that are expected from a bank compared to a retailer and why does IT costs so much more for a bank compared to a retailer?
- Where should CT Bank expect to spend more for in its IT using the IT infrastructure framework?
- What is CRM, and why is it important for CT Bank and CT Retail?
- Describe the Triangle reward system and why a customer would provide detailed information in return for rewards
- Provide one example of a digital-enabled banking service from the CTB webpage and compare to a similar banking service at a major Canadian bank
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.