OLAP: Online Analytical Processing

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

______ (Online Analytical Processing) is a powerful technology behind many Business Intelligence (BI) applications.

OLAP

______ is for the data analytic purpose, hence it enables us to analyze information from multiple database systems at the same time.

OLAP

OLAP is a computing method that allows users to easily extract required data and ______ data in order to analyze it from different points of view.

query

OLAP is basically based on the huge data that is called ______; it collects the required data from the data warehouse.

<p>data warehouse</p> Signup and view all the answers

The extracted data is cleaned and transformed before being loaded into an ______ server.

<p>OLAP</p> Signup and view all the answers

An ______ is a data structure that allows fast analysis of data according to the multiple dimensions that define a business problem.

<p>OLAP Cube</p> Signup and view all the answers

______ – Also known as drill-up or consolidation, use to summarize operation data along with the dimension.

<p>Roll-up</p> Signup and view all the answers

______ – To perform the analysis in deeper among the dimensions of data.

<p>Drill-down</p> Signup and view all the answers

______ – To perform the analysis take one level of information for display, such as "sales in 2019."

<p>Slice</p> Signup and view all the answers

______ – To perform the analysis, select data from multiple dimensions to analyze, such as "sales of Laptop in Region 4 in 2019."

<p>Dice</p> Signup and view all the answers

______ – To perform the analysis that can gain a new view of data by rotating the data axes of the cube.

<p>Pivot</p> Signup and view all the answers

______ OLAP uses a relational database management system to keep and control the data.

<p>Relational</p> Signup and view all the answers

______ OLAP utilizes a multi-dimensional Database (MDDB) for storing and analyzing information.

<p>Multidimensional</p> Signup and view all the answers

______ OLAP is a blend of MOLAP and ROLAP, offering the qualities of both techniques.

<p>Hybrid</p> Signup and view all the answers

______ OLAP is a single-tier, desktop-based OLAP technology

<p>Desktop</p> Signup and view all the answers

______ OLAP is an OLAP system accessible via the web browser.

<p>Web</p> Signup and view all the answers

______ OLAP helps users to access and analyze OLAP data using their mobile devices

<p>Mobile</p> Signup and view all the answers

______ OLAP is created to facilitate management of both spatial and non-spatial data in a Geographic Information system (GIS).

<p>Spatial</p> Signup and view all the answers

______, Micro Strategy, Palo OLAP Server, Apache Kylin and icCube are examples of OLAP Tools.

<p>IBM Cognos</p> Signup and view all the answers

Pentaho BI, ______, Oracle Business Intelligence Enterprise Edition(OBIEE), JsHypercube and Jedox, are examples of OLAP Tools.

<p>Mondrian</p> Signup and view all the answers

______ of data processing is a key advantage of using OLAP systems.

<p>High speed</p> Signup and view all the answers

OLAP systems allow businesses to work with both ______ and ______ data for comprehensive analysis.

<p>aggregated, detailed</p> Signup and view all the answers

A disadvantage of OLAP systems can be the ______ associated with implementation and maintenance.

<p>high cost</p> Signup and view all the answers

______ stands for Extract, Transform and Load.

<p>ETL</p> Signup and view all the answers

ETL is a process in data warehousing used to ______ data from the database or source systems.

<p>extract</p> Signup and view all the answers

During ______, data is cleaned, transformed, and made consistent for loading into the data warehouse.

<p>transformation</p> Signup and view all the answers

Following extraction and transformation, data is placed into the data warehouse during the ______ process.

<p>load</p> Signup and view all the answers

In the ______ extraction method, data from sources is loaded into the data warehouses that show either data warehouse is being populated the first time or no strategy has been made for data extraction.

<p>full</p> Signup and view all the answers

______ extraction is also known as delta, where only the data being changed is extracted and update data warehouses.

<p>Partial</p> Signup and view all the answers

One kind of data loading is the ______ Load, where data is populated to all of the Data Warehouse tables.

<p>Initial</p> Signup and view all the answers

______ Load applies ongoing changes only when needed periodically.

<p>Incremental</p> Signup and view all the answers

The ______ loading method involves erasing the contents of tables and reloading with fresh data.

<p>full refresh</p> Signup and view all the answers

______, Informatica Data Validation, QuerySurge, ICEDQ, Datagaps ETL Validator, QualiDI are examples of ETL Tools.

<p>RightData</p> Signup and view all the answers

SSISTester, TestBench, GTL QAceGen, ______, DbFit, AnyDbTest, 99 Percentage ETL Testing are examples of ETL Tools.

<p>Zuzena Automated Testing Service</p> Signup and view all the answers

Applications and ______ are identified and extracted in ETL process.

<p>flat files</p> Signup and view all the answers

Flashcards

What is OLAP?

Online Analytical Processing is a technology behind Business Intelligence applications, enabling data discovery, reporting, complex calculations, and predictive scenarios.

OLAP Defined

A computing method allowing users to extract and query data to analyze it from different perspectives. It relies on a data warehouse to collect required data.

What is an OLAP Cube?

A data structure that enables fast analysis of data based on multiple dimensions defining a business problem.

What is Roll-up in OLAP?

Summarizes operation data together with a specific dimension to provide a high level view.

Signup and view all the flashcards

What is Drill-down in OLAP?

Performs analysis in deeper dimensions of data, such as from 'time period' to 'years' and 'months'.

Signup and view all the flashcards

What is Slice in OLAP?

Performs analysis by taking one level of information for display, like 'sales in 2019'.

Signup and view all the flashcards

What is Dice in OLAP?

Performs analysis by selecting data from multiple dimensions to analyze, like 'sales of Laptop in Region 4 in 2019'.

Signup and view all the flashcards

What is Pivot in OLAP?

Performs analysis to gain a new data view by rotating the data axes of the OLAP cube.

Signup and view all the flashcards

What is Relational OLAP (ROLAP)?

Utilizes a relational database management system to keep and control the data.

Signup and view all the flashcards

What is Multidimensional OLAP (MOLAP)?

Utilizes a multi-dimensional Database (MDDB) for storing and analyzing information.

Signup and view all the flashcards

What is Hybrid OLAP (HOLAP)?

A blend of MOLAP and ROLAP, offering qualities of both techniques, storing data summaries in binary files or pre-calculated cubes.

Signup and view all the flashcards

What is Desktop OLAP (DOLAP)?

A single-tier, desktop-based OLAP technology.

Signup and view all the flashcards

What is Web OLAP (WOLAP)?

An OLAP system accessible via a web browser, consisting of client, middleware, and a database server.

Signup and view all the flashcards

What is Mobile OLAP?

Helps users to access and analyze OLAP data using their mobile devices.

Signup and view all the flashcards

What is Spatial OLAP (SOLAP)?

Facilitates management of both spatial and non-spatial data in a Geographic Information System (GIS).

Signup and view all the flashcards

What is ETL?

A process in data warehousing used to extract data from source systems, transform it, and load it into a data warehouse.

Signup and view all the flashcards

What is Extraction in ETL?

Identifies and extracts all preferred data from various source systems.

Signup and view all the flashcards

What is Full Extraction?

Loads all source data into the data warehouse, whether for the first time or when extraction strategy hasn't been defined.

Signup and view all the flashcards

What is Partial Extraction (with update notification)?

Only extracts and updates data that has been changed.

Signup and view all the flashcards

Partial Extraction (without update notification)

Refers to extracting specific required data from sources according to loading needs in the data warehouses.

Signup and view all the flashcards

What is Transformation in ETL?

Corrects data, removes incorrect data, and fixes data errors before loading.

Signup and view all the flashcards

What is Initial Load?

Populating all the Data Warehouse tables.

Signup and view all the flashcards

What is Incremental Load?

Applying ongoing changes as needed periodically.

Signup and view all the flashcards

What is Full Refresh?

Erasing the contents of one or more tables and reloading with fresh data.

Signup and view all the flashcards

Study Notes

Intended Learning Outcomes

  • Define OLAP
  • Determine the types of OLAP
  • Understand OLAP operations
  • List OLAP Tools
  • Define ETL and understand its process in data warehouse

Online Analytical Processing

  • OLAP is a powerful technology behind many Business Intelligence (BI) applications
  • OLAP discovers data, provides report viewing capabilities and complex analytical calculations
  • OLAP supports predictive "what if" scenarios, budget planning, and forecast planning
  • OLAP is for data analytics and enables analysis of information from multiple database systems simultaneously
  • OLAP is a computing method that enables users to easily extract required data and query data for analysis from different perspectives
  • OLAP is based on the large data in a data warehouse, collecting required data and performing business analysis for decisions to improve profit, sales, brand, and marketing

OLAP Architecture

  • Three-tier data warehouse architecture including data sources, data storage, and front-end tools.
  • Data flows from source databases through extraction, transformation, and loading into a central data warehouse.

How OLAP Works

  • A data warehouse extracts information from multiple data sources and formats, such as text files, Excel sheets, and multimedia files
  • The extracted data is cleaned, transformed and then loaded into an OLAP server (or OLAP cube)
  • The information is pre-calculated in advance for further analysis in the OLAP server

OLAP Cube

  • An OLAP Cube is a data structure that allows fast data analysis based on multiple dimensions which define a business problem.
  • A multidimensional cube for reporting sales could be composed of 7 dimensions: Salesperson, Sales Amount, Region, Product, Region, Month, Year.

Basic Analytical Operations of OLAP

  • Roll-up (drill-up/consolidation) summarizes operation data along a dimension
  • Drill-down performs analysis in deeper dimensions of data. Drilling down from "time period" to "years", "months", and "days" plots sales growth
  • Slice performs the analysis and takes one level of information for display, like "sales in 2019"
  • Dice performs analysis by selecting data from multiple dimensions
  • Pivot performs analysis to gain a new view of data by rotating the cube's data axes

Types of OLAP System

  • Relational OLAP (ROLAP) uses a relational database management system to keep and control the data and the servers exist between the database and the user
  • Multidimensional OLAP (MOLAP) utilizes a multi-dimensional Database (MDDB) for storing and analyzing information
  • Hybrid OLAP (HOLAP) is a blend of MOLAP and ROLAP and offers the qualities of both techniques
  • Desktop OLAP (DOLAP) is a single-tier, desktop-based OLAP technology
  • Web OLAP (WOLAP) is accessible via a web browser and its architecture consists of a client, middleware, and a database server
  • Mobile OLAP helps users access and analyze OLAP data using their mobile devices
  • Spatial OLAP (SOLAP) manages spatial and non-spatial data in a Geographic Information System (GIS)

OLAP Tools

  • Notable OLAP tools include IBM Cognos, Micro Strategy, Palo OLAP Server, Apache Kylin, Pentaho BI, Mondrian, Oracle Business Intelligence Enterprise Edition (OBIEE), JsHypercube and Jedox

Advantages of OLAP

  • High speed of data processing
  • Aggregated and detailed data
  • Multidimensional data representation
  • Using familiar business expressions
  • "What-if" scenarios
  • Flat learning curve

Disadvantages of OLAP

  • High cost
  • OLAP is relational
  • Computation capability
  • Some potential risk

Extract, Transform, and Load (ETL)

  • ETL stands for Extract, Transform, and Load
  • ETL is used in data warehousing to extract data from databases or source systems and after transforming, places the data into a data warehouse
  • ETL is a combination of three database functions

ETL Process

  • Step 1: Extraction - Preferred data are identified and extracted from various source systems like databases, applications, and flat files
  • Data extraction can be done by running jobs during non-business hours
  • Step 2: Transformation - Extracted data that cannot be directly loaded into the target system is transformed
  • Based on business rules, transformations are done before loading the data, correcting it, removing incorrect data, and fixing errors
  • Step 3: Loading - All gathered information is loaded into the target Data Warehouse tables

Data Extraction Strategies

  • Full Extraction: Whole data gets loaded from sources into data warehouses
  • Partial Extraction (with update notification): Known as delta, where only the data being changed is extracted and updates data ware houses
  • Partial Extraction (without update notification): Specific required data is extracted from sources according to load in data warehouses instead of extracting whole data

Types of Loading

  • Initial Load populates all the Data Warehouse tables
  • Incremental Load applies ongoing changes as needed periodically
  • Full Refresh erases the contents of one or more tables and reloads with fresh data

ETL Tools

  • ETL tools include RightData, Informatica Data Validation, QuerySurge, ICEDQ, Datagaps ETL Validator, QualiDI, Talend Open Studio for Data Integration, Codoid's ETL Testing Services, and Data Centric Testing
  • Other ETL tools are SSISTester, TestBench, GTL QAceGen, Zuzena Automated Testing Service, DbFit, AnyDbTest, and 99 Percentage ETL Testing

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team

Related Documents

More Like This

Database Technologies Overview
18 questions
lecture 9
29 questions

lecture 9

CelebratoryNewton4365 avatar
CelebratoryNewton4365
Use Quizgecko on...
Browser
Browser