Data Warehousing Architectures Overview

AdoredCharoite avatar
AdoredCharoite
·
·
Download

Start Quiz

Study Flashcards

51 Questions

What is a key requirement for understanding the purpose of the data warehouse (DW) for business strategic purpose?

Familiarity with business decision making processes

Which area should effective security in a data warehouse primarily focus on?

Establishing effective corporate and security policies and procedures

What is a crucial aspect of implementing logical security procedures in a data warehouse?

User authentication, access controls, and encryption

What is a data warehouse according to Inmon?

A collection of integrated, subject-oriented databases designed to support DSS functions

What is a data warehouse according to Kimball?

A copy of transaction data specifically structured for query and analysis

What does it mean for a data warehouse to be subject oriented?

Organized around major subjects, such as sales progress, containing only information relevant for decision support

What is the main purpose of a data warehouse?

To provide a solid platform of consolidated, historical data for analysis

What are the 4 main characteristics of data warehousing?

Subject-oriented, integrated, time-variant, non-volatile

What is the primary focus of OLTP?

Day-to-day operations and repetitive, structured access

What type of database is used for OLTP?

Relational database

What type of queries does the data warehouse (OLAP) support?

Complex queries and access using star schema

What are data marts?

Smaller subsets of data warehouses for specific business units or departments

What is an enterprise data warehouse (EDW) used for?

Decision support across the enterprise

What are the three parts that data warehousing architectures are divided into?

Data warehouse, data acquisition software, and client software

What does the ETL process stand for in the context of data warehousing?

Extraction, transformation, and loading

Which approach focuses on enterprise-wide view and data reusability in data warehousing?

Inmon Model (EDW)

What is one of the factors affecting architecture selection in data warehousing?

Information interdependence

What can real-time data warehousing eliminate the need for in operational analysis?

Nightly batch processes

What is the primary purpose of the ETL process in a data warehouse?

To extract, transform, and load data into the data warehouse

Which tool can be used for the ETL process in a data warehouse?

IBM InfoSphere DataStage

What is a key responsibility of a data warehouse administrator?

Familiarity in high-performance hardware, software, and networking technologies.

What does real-time data warehousing allow for?

Operational decision support with updated available data.

Which model emphasizes quick end-user capability in its approach to data warehousing?

Kimball Model (Data mart)

What does the Inmon Model focus on in its approach to data warehousing?

Enterprise-wide view and data reusability.

What is a crucial aspect of establishing effective corporate and security policies and procedures for security in a data warehouse?

Communicating the policy from the top to everyone in the organization

What does effective security in a data warehouse primarily focus on?

Implementing corporate and security policies

What is a key requirement for understanding how the data warehouse will be used for business strategic purpose?

Familiarity with business decision-making processes

What is the primary purpose of a data warehouse?

To facilitate reporting and analysis of an organization's electronically stored data

What does it mean for a data warehouse to be subject oriented?

Organized around major subjects and contains information relevant for decision support

What is the Inmon Model's focus in its approach to data warehousing?

Integration of subject-oriented databases to support DSS functions

What are the 4 main characteristics of data warehousing?

Subject oriented, nonvolatile, integrated, relevant to some moment in time

What is the primary focus of the Kimball Model in data warehousing?

Quick end-user capability

What are the main components of data warehousing architectures?

Data warehouse, data acquisition software, client software

What is the purpose of real-time data warehousing?

To meet the need for operational decision support

What are the key responsibilities of a data warehouse administrator?

Familiarity with high-performance hardware, software, and networking technologies

What are the 4 main characteristics of data warehousing?

Subject-oriented, integrated, time-variant, non-volatile

What type of database is used for OLTP?

Relational database

What does an enterprise data warehouse (EDW) provide data for?

Various types of decision support systems

What is the primary focus of OLAP?

Multidimensional data analysis

What is the role of metadata in a data warehouse?

Describe the contents and use of the data

What does the data warehousing process involve?

Importing data from internal and external sources, cleansing and organizing the data, loading it into the enterprise data warehouse, creating data marts if desired, and performing analyses as needed.

What is the main purpose of an operational database (OLTP)?

Support day-to-day operations with repetitive structured access

What type of queries does the operational database (OLTP) support?

Short, simple transactions

What do OLAP databases use for complex queries and access?

Star schema

What are the two types of data marts?

Dependent and independent

What is the goal of OLAP?

Multidimensional data analysis

What is a crucial focus for effective security in a data warehouse?

Ensuring top-down communication and implementation of security policies

What is a key responsibility for a data warehouse administrator?

Ensuring logical security procedures are implemented effectively

What is the primary purpose of establishing an internal control review process for security and privacy in a data warehouse?

To identify and rectify security vulnerabilities

What is a crucial characteristic of effective corporate and security policies for a data warehouse?

Starting at the top and being communicated to everyone in the organization

What is a crucial factor in understanding how the data warehouse will be used for business strategic purpose?

Familiarity with business decision-making processes

Study Notes

Data Warehousing Overview

  • Data warehousing architectures are divided into three parts: the data warehouse, data acquisition software, and client software.
  • Factors affecting architecture selection include information interdependence, urgency of need, resource constraints, strategic view, and compatibility.
  • Data integration involves data access, federation, and change capture processes to move data into a data warehouse.
  • The ETL process includes extraction, transformation, and loading of data into the data warehouse, contributing to data quality.
  • Companies can purchase ETL tools like IBM InfoSphere DataStage, Microsoft SSIS, or Oracle Data Integrator for the ETL process.
  • Data warehousing projects are complex and can influence multiple departments and business strategy.
  • Two main data warehouse development approaches are the Inmon Model (EDW) and the Kimball Model (Data mart).
  • Inmon's approach focuses on enterprise-wide view and data reusability, while Kimball's approach emphasizes quick end-user capability.
  • Both methods can produce enterprise data warehouses and data marts, with differences in scope, development time, cost, difficulty, sources, size, hardware, and users.
  • Real-time data warehousing has emerged to meet the need for operational decision support, allowing data to be updated as it becomes available.
  • Real-time data collection can eliminate the need for nightly batch processes in operational analysis.
  • Data warehouse administration requires a data warehouse administrator with familiarity in high-performance hardware, software, and networking technologies.

Data Warehousing and OLTP vs OLAP

  • Data warehousing involves integrating data from various sources into a consistent format, dealing with naming conflicts and discrepancies, and applying data cleaning and techniques for consistency.
  • The 4 main characteristics of data warehousing include being subject-oriented, integrated, time-variant (maintaining historical data), and non-volatile (data cannot be changed).
  • Data warehouses run on DBMS such as Oracle, SQL, and DB2, and they store large amounts of data for long periods, with data that cannot be overwritten by users.
  • OLTP (on-line transaction processing) focuses on day-to-day operations and is designed for repetitive, structured access, while OLAP (Online Analytical Processing) supports decision support and ad-hoc, unstructured access.
  • The operational database (OLTP) is relational and designed for short, simple transactions, while the data warehouse (OLAP) uses star schema for complex queries and access.
  • Data marts are smaller subsets of data warehouses, either dependent (created directly from the data warehouse) or independent (designed for a strategic business unit or department).
  • An enterprise data warehouse (EDW) is a large-scale data warehouse used for decision support across the enterprise, providing data for various types of decision support systems.
  • Metadata in a data warehouse describe the contents and use of the data, providing information about the data's content and characteristics.
  • The data warehousing process involves importing data from internal and external sources, cleansing and organizing the data, loading it into the enterprise data warehouse, creating data marts if desired, and performing analyses as needed.
  • The major components of a data warehousing process include data sources, data extraction, data loading, the data warehouse/comprehensive database, and middleware tools for access to the data warehouse from various front-end applications.
  • The goal of OLAP is multidimensional data analysis, providing fast and flexible data summarization, analysis, and reporting capabilities, and the ability to view trends over time.
  • The operational database (OLTP) is designed for day-to-day operations, while the data warehouse (OLAP) is designed for decision support and reporting on subjects.

Learn about the different architectures for data warehousing, including the three-tier architecture which involves the data warehouse, data acquisition software, and client software. Explore the process of extracting, consolidating, and loading data from legacy systems and external sources.

Make Your Own Quizzes and Flashcards

Convert your notes into interactive study material.

Get started for free

More Quizzes Like This

Use Quizgecko on...
Browser
Browser