CS131-8 Data Warehousing Overview

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

What is a characteristic of a data warehouse?

  • It includes metadata (correct)
  • It is used for client-side scripting
  • It is used for real-time applications
  • It is used for operational data storage

A data mart is a physical subset of a data warehouse.

True (A)

What is an operational data mart?

A small-scale data mart used by a single department or functional area when they need to analyze operational data

Metadata is ____________________ about data.

<p>data</p> Signup and view all the answers

Match the following terms with their definitions:

<p>Data Mart = A logical and physical subset of a data warehouse Operational Data Store (ODS) = A type of database used as an interim area for a data warehouse Enterprise Data Warehouse (EDW) = A technology that provides a vehicle for pushing data from source systems into a data warehouse Metadata = Data about data</p> Signup and view all the answers

A data warehouse is designed for real-time applications.

<p>False (B)</p> Signup and view all the answers

Why is data consolidation necessary in decision support?

<p>Because it enables summarization and aggregation of data from heterogeneous sources (C)</p> Signup and view all the answers

Data quality is improved when different sources use consistent data representations, codes, and formats.

<p>False (B)</p> Signup and view all the answers

What is the primary reason for using a separate system for decision support?

<p>To limit visibility to sensitive data and ensure confidentiality and privacy</p> Signup and view all the answers

The DateDim table has a primary key of _______________________.

<p>Date</p> Signup and view all the answers

What is the purpose of the StoreDim table?

<p>To store store information (A)</p> Signup and view all the answers

Match the dimension tables with their corresponding primary keys:

<p>DateDim = Date ProductDim = SKU StoreDim = StoreNumber</p> Signup and view all the answers

The SalesFact table has a primary key of Date.

<p>False (B)</p> Signup and view all the answers

The BrandDim table is not shown in Figure 11, but it would have a primary key of _______________________.

<p>BrandID</p> Signup and view all the answers

What is the primary reason business executives need strategic information?

<p>To stay competitive and improve the bottom line (C)</p> Signup and view all the answers

Companies started building and using computer systems in the 1950s.

<p>False (B)</p> Signup and view all the answers

What do we mean by strategic information?

<p>Combined essential information needed to make decisions in the formulation and execution of business strategies and objectives.</p> Signup and view all the answers

As businesses grew more complex, corporations spread ___________ and competition became fiercer.

<p>globally</p> Signup and view all the answers

Match the following roles with their need for strategic information:

<p>Executives = Need information to make competitive decisions Managers = Need information to make strategic decisions IT Staff = Need information to support business processes</p> Signup and view all the answers

What is the primary purpose of a data warehouse?

<p>To provide an integrated and total view of the enterprise (D)</p> Signup and view all the answers

Data warehouses are designed to support operational systems.

<p>False (B)</p> Signup and view all the answers

What is the main characteristic of a data warehouse in terms of its data content?

<p>Standardized format</p> Signup and view all the answers

A data warehouse is a physical repository where relational data (current and historical) are specially organized to provide enterprise-wide, cleansed data in a ______________ format.

<p>standardized</p> Signup and view all the answers

Match the characteristics of a data warehouse with their descriptions:

<p>Subject oriented = Data are organized by detailed subject Integrated = Data warehouses must place data from different sources into a consistent format Time variant = It contains historical and current data Non-volatile = Data cannot be changed or updated after it has been entered into the data warehouse</p> Signup and view all the answers

What is the benefit of having a data warehouse in terms of data access?

<p>It makes decision-support transactions possible without hindering operational systems (D)</p> Signup and view all the answers

A data warehouse is a volatile repository of data.

<p>False (B)</p> Signup and view all the answers

What is the main driving force behind the need for informational systems?

<p>Fiercer competition and business complexity</p> Signup and view all the answers

What is the main goal of a transactional system?

<p>To capture data quickly (C)</p> Signup and view all the answers

Operational databases typically maintain historical data for decision support.

<p>False (B)</p> Signup and view all the answers

What is the main reason why a separate system, such as a data warehouse, is needed for analytics?

<p>Transactional models are not good for analytics</p> Signup and view all the answers

Complex decision-support queries would degrade performance for ______________________ transactions.

<p>operational</p> Signup and view all the answers

Match the following systems with their primary goals:

<p>Transactional System = To capture data quickly and efficiently Operational Database = To support transaction processing Data Warehouse = To support analytics and decision-making</p> Signup and view all the answers

What is the main advantage of using a data warehouse for analytics?

<p>Optimized data organization for multidimensional views and queries (C)</p> Signup and view all the answers

A data warehouse is designed to support transaction processing.

<p>False (B)</p> Signup and view all the answers

What is the main difference between a transactional system and a data warehouse?

<p>Purpose: transaction processing vs. analytics and decision-making</p> Signup and view all the answers

Flashcards are hidden until you start studying

Study Notes

Objectives

  • Understand the need for strategic information in an enterprise
  • Recognize the information crisis at every enterprise
  • Distinguish between operational and informational systems
  • Learn why past attempts to provide strategic information failed
  • Understand why data warehousing is the viable solution
  • Understand business intelligence for an enterprise

A New Paradigm

  • Companies started using computer systems in the 1960s and became dependent on them
  • As businesses grew more complex, executives became desperate for information to stay competitive
  • Competitive advantage requires strategic information

The Need for Strategic Information

  • Who needs strategic information: executives and managers responsible for keeping the enterprise competitive
  • What is strategic information: combined essential information needed to make decisions in the formulation and execution of business strategies and objectives

The Need for Informational Systems

  • Companies need to design and build informational systems that serve different purposes, have different scopes, data content, data usage patterns, and data access types
  • Operational systems are not designed to provide strategic information

Data Warehouse Defined

  • A physical repository where relational data (current and historical) are specially organized to provide enterprise-wide, cleansed data in a standardized format
  • Provides an integrated and total view of the enterprise
  • Makes the enterprise's current and historical information easily available for strategic decision making
  • Makes decision-support transactions possible without hindering operational systems
  • Renders the organization's information consistent
  • Presents a flexible and interactive source of strategic information

Characteristics of Data Warehouse

  • Subject-oriented: data are organized by detailed subject containing only information relevant for decision support
  • Integrated: data warehouses must place data from different sources into a consistent format
  • Time-variant: contains historical and current data
  • Non-volatile: data cannot be changed or updated after it has been entered into the data warehouse
  • Web-based: designed for web-based applications
  • Relational/multidimensional: its structure is either relational or multidimensional
  • Uses client-server: to be easy to access
  • Real-time: a characteristic of new data warehouses
  • Includes metadata: data about data (about how data are organized and used)

Definitions and Concepts

  • Data mart: a logical and physical subset of a data warehouse, representing data from a single business process
  • Operational data stores (ODS): a type of database often used as an interim area for a data warehouse
  • Oper marts: an operational data mart, typically used by a single department or functional area
  • Enterprise data warehouse (EDW): a technology that provides a vehicle for pushing data from source systems into a data warehouse for decision support
  • Metadata: data about data

The Solution

  • Use a separate system – a data warehouse designed for analytics
  • Transactional models are not good for analytics
  • Goal of transactional system is to capture data quickly, not to minimize the time or complexity of retrieving large amounts of data for analysis
  • Why use a separate system:
    • Performance: operational databases are designed and tuned for known transactions and workloads
    • Function: decision support requires historical data, data consolidation, and data quality
    • Security: need to limit visibility to sensitive data, and de-identify personal data

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team

More Like This

Data Warehousing and Data Mining: Strategic Information
10 questions
Data Warehousing and Mining Overview
32 questions
Data Mining Concepts
16 questions

Data Mining Concepts

ProfuseUranium1766 avatar
ProfuseUranium1766
Use Quizgecko on...
Browser
Browser