Podcast
Questions and Answers
______ (Online Analytical Processing) is a powerful technology behind many Business Intelligence (BI) applications.
______ (Online Analytical Processing) is a powerful technology behind many Business Intelligence (BI) applications.
OLAP
______ is for the data analytic purpose, hence it enables us to analyze information from multiple database systems at the same time.
______ is for the data analytic purpose, hence it enables us to analyze information from multiple database systems at the same time.
OLAP
OLAP is a computing method that allows users to easily extract required data and ______ data in order to analyze it from different points of view.
OLAP is a computing method that allows users to easily extract required data and ______ data in order to analyze it from different points of view.
query
OLAP is basically based on the huge data that is called ______; it collects the required data from the data warehouse.
OLAP is basically based on the huge data that is called ______; it collects the required data from the data warehouse.
The extracted data is cleaned and transformed before being loaded into an ______ server.
The extracted data is cleaned and transformed before being loaded into an ______ server.
An ______ is a data structure that allows fast analysis of data according to the multiple dimensions that define a business problem.
An ______ is a data structure that allows fast analysis of data according to the multiple dimensions that define a business problem.
______ – Also known as drill-up or consolidation, use to summarize operation data along with the dimension.
______ – Also known as drill-up or consolidation, use to summarize operation data along with the dimension.
______ – To perform the analysis in deeper among the dimensions of data.
______ – To perform the analysis in deeper among the dimensions of data.
______ – To perform the analysis take one level of information for display, such as "sales in 2019."
______ – To perform the analysis take one level of information for display, such as "sales in 2019."
______ – To perform the analysis, select data from multiple dimensions to analyze, such as "sales of Laptop in Region 4 in 2019."
______ – To perform the analysis, select data from multiple dimensions to analyze, such as "sales of Laptop in Region 4 in 2019."
______ – To perform the analysis that can gain a new view of data by rotating the data axes of the cube.
______ – To perform the analysis that can gain a new view of data by rotating the data axes of the cube.
______ OLAP uses a relational database management system to keep and control the data.
______ OLAP uses a relational database management system to keep and control the data.
______ OLAP utilizes a multi-dimensional Database (MDDB) for storing and analyzing information.
______ OLAP utilizes a multi-dimensional Database (MDDB) for storing and analyzing information.
______ OLAP is a blend of MOLAP and ROLAP, offering the qualities of both techniques.
______ OLAP is a blend of MOLAP and ROLAP, offering the qualities of both techniques.
______ OLAP is a single-tier, desktop-based OLAP technology
______ OLAP is a single-tier, desktop-based OLAP technology
______ OLAP is an OLAP system accessible via the web browser.
______ OLAP is an OLAP system accessible via the web browser.
______ OLAP helps users to access and analyze OLAP data using their mobile devices
______ OLAP helps users to access and analyze OLAP data using their mobile devices
______ OLAP is created to facilitate management of both spatial and non-spatial data in a Geographic Information system (GIS).
______ OLAP is created to facilitate management of both spatial and non-spatial data in a Geographic Information system (GIS).
______, Micro Strategy, Palo OLAP Server, Apache Kylin and icCube are examples of OLAP Tools.
______, Micro Strategy, Palo OLAP Server, Apache Kylin and icCube are examples of OLAP Tools.
Pentaho BI, ______, Oracle Business Intelligence Enterprise Edition(OBIEE), JsHypercube and Jedox, are examples of OLAP Tools.
Pentaho BI, ______, Oracle Business Intelligence Enterprise Edition(OBIEE), JsHypercube and Jedox, are examples of OLAP Tools.
______ of data processing is a key advantage of using OLAP systems.
______ of data processing is a key advantage of using OLAP systems.
OLAP systems allow businesses to work with both ______ and ______ data for comprehensive analysis.
OLAP systems allow businesses to work with both ______ and ______ data for comprehensive analysis.
A disadvantage of OLAP systems can be the ______ associated with implementation and maintenance.
A disadvantage of OLAP systems can be the ______ associated with implementation and maintenance.
______ stands for Extract, Transform and Load.
______ stands for Extract, Transform and Load.
ETL is a process in data warehousing used to ______ data from the database or source systems.
ETL is a process in data warehousing used to ______ data from the database or source systems.
During ______, data is cleaned, transformed, and made consistent for loading into the data warehouse.
During ______, data is cleaned, transformed, and made consistent for loading into the data warehouse.
Following extraction and transformation, data is placed into the data warehouse during the ______ process.
Following extraction and transformation, data is placed into the data warehouse during the ______ process.
In the ______ extraction method, data from sources is loaded into the data warehouses that show either data warehouse is being populated the first time or no strategy has been made for data extraction.
In the ______ extraction method, data from sources is loaded into the data warehouses that show either data warehouse is being populated the first time or no strategy has been made for data extraction.
______ extraction is also known as delta, where only the data being changed is extracted and update data warehouses.
______ extraction is also known as delta, where only the data being changed is extracted and update data warehouses.
One kind of data loading is the ______ Load, where data is populated to all of the Data Warehouse tables.
One kind of data loading is the ______ Load, where data is populated to all of the Data Warehouse tables.
______ Load applies ongoing changes only when needed periodically.
______ Load applies ongoing changes only when needed periodically.
The ______ loading method involves erasing the contents of tables and reloading with fresh data.
The ______ loading method involves erasing the contents of tables and reloading with fresh data.
______, Informatica Data Validation, QuerySurge, ICEDQ, Datagaps ETL Validator, QualiDI are examples of ETL Tools.
______, Informatica Data Validation, QuerySurge, ICEDQ, Datagaps ETL Validator, QualiDI are examples of ETL Tools.
SSISTester, TestBench, GTL QAceGen, ______, DbFit, AnyDbTest, 99 Percentage ETL Testing are examples of ETL Tools.
SSISTester, TestBench, GTL QAceGen, ______, DbFit, AnyDbTest, 99 Percentage ETL Testing are examples of ETL Tools.
Applications and ______ are identified and extracted in ETL process.
Applications and ______ are identified and extracted in ETL process.
Flashcards
What is OLAP?
What is OLAP?
Online Analytical Processing is a technology behind Business Intelligence applications, enabling data discovery, reporting, complex calculations, and predictive scenarios.
OLAP Defined
OLAP Defined
A computing method allowing users to extract and query data to analyze it from different perspectives. It relies on a data warehouse to collect required data.
What is an OLAP Cube?
What is an OLAP Cube?
A data structure that enables fast analysis of data based on multiple dimensions defining a business problem.
What is Roll-up in OLAP?
What is Roll-up in OLAP?
Signup and view all the flashcards
What is Drill-down in OLAP?
What is Drill-down in OLAP?
Signup and view all the flashcards
What is Slice in OLAP?
What is Slice in OLAP?
Signup and view all the flashcards
What is Dice in OLAP?
What is Dice in OLAP?
Signup and view all the flashcards
What is Pivot in OLAP?
What is Pivot in OLAP?
Signup and view all the flashcards
What is Relational OLAP (ROLAP)?
What is Relational OLAP (ROLAP)?
Signup and view all the flashcards
What is Multidimensional OLAP (MOLAP)?
What is Multidimensional OLAP (MOLAP)?
Signup and view all the flashcards
What is Hybrid OLAP (HOLAP)?
What is Hybrid OLAP (HOLAP)?
Signup and view all the flashcards
What is Desktop OLAP (DOLAP)?
What is Desktop OLAP (DOLAP)?
Signup and view all the flashcards
What is Web OLAP (WOLAP)?
What is Web OLAP (WOLAP)?
Signup and view all the flashcards
What is Mobile OLAP?
What is Mobile OLAP?
Signup and view all the flashcards
What is Spatial OLAP (SOLAP)?
What is Spatial OLAP (SOLAP)?
Signup and view all the flashcards
What is ETL?
What is ETL?
Signup and view all the flashcards
What is Extraction in ETL?
What is Extraction in ETL?
Signup and view all the flashcards
What is Full Extraction?
What is Full Extraction?
Signup and view all the flashcards
What is Partial Extraction (with update notification)?
What is Partial Extraction (with update notification)?
Signup and view all the flashcards
Partial Extraction (without update notification)
Partial Extraction (without update notification)
Signup and view all the flashcards
What is Transformation in ETL?
What is Transformation in ETL?
Signup and view all the flashcards
What is Initial Load?
What is Initial Load?
Signup and view all the flashcards
What is Incremental Load?
What is Incremental Load?
Signup and view all the flashcards
What is Full Refresh?
What is Full Refresh?
Signup and view all the flashcards
Study Notes
Intended Learning Outcomes
- Define OLAP
- Determine the types of OLAP
- Understand OLAP operations
- List OLAP Tools
- Define ETL and understand its process in data warehouse
Online Analytical Processing
- OLAP is a powerful technology behind many Business Intelligence (BI) applications
- OLAP discovers data, provides report viewing capabilities and complex analytical calculations
- OLAP supports predictive "what if" scenarios, budget planning, and forecast planning
- OLAP is for data analytics and enables analysis of information from multiple database systems simultaneously
- OLAP is a computing method that enables users to easily extract required data and query data for analysis from different perspectives
- OLAP is based on the large data in a data warehouse, collecting required data and performing business analysis for decisions to improve profit, sales, brand, and marketing
OLAP Architecture
- Three-tier data warehouse architecture including data sources, data storage, and front-end tools.
- Data flows from source databases through extraction, transformation, and loading into a central data warehouse.
How OLAP Works
- A data warehouse extracts information from multiple data sources and formats, such as text files, Excel sheets, and multimedia files
- The extracted data is cleaned, transformed and then loaded into an OLAP server (or OLAP cube)
- The information is pre-calculated in advance for further analysis in the OLAP server
OLAP Cube
- An OLAP Cube is a data structure that allows fast data analysis based on multiple dimensions which define a business problem.
- A multidimensional cube for reporting sales could be composed of 7 dimensions: Salesperson, Sales Amount, Region, Product, Region, Month, Year.
Basic Analytical Operations of OLAP
- Roll-up (drill-up/consolidation) summarizes operation data along a dimension
- Drill-down performs analysis in deeper dimensions of data. Drilling down from "time period" to "years", "months", and "days" plots sales growth
- Slice performs the analysis and takes one level of information for display, like "sales in 2019"
- Dice performs analysis by selecting data from multiple dimensions
- Pivot performs analysis to gain a new view of data by rotating the cube's data axes
Types of OLAP System
- Relational OLAP (ROLAP) uses a relational database management system to keep and control the data and the servers exist between the database and the user
- Multidimensional OLAP (MOLAP) utilizes a multi-dimensional Database (MDDB) for storing and analyzing information
- Hybrid OLAP (HOLAP) is a blend of MOLAP and ROLAP and offers the qualities of both techniques
- Desktop OLAP (DOLAP) is a single-tier, desktop-based OLAP technology
- Web OLAP (WOLAP) is accessible via a web browser and its architecture consists of a client, middleware, and a database server
- Mobile OLAP helps users access and analyze OLAP data using their mobile devices
- Spatial OLAP (SOLAP) manages spatial and non-spatial data in a Geographic Information System (GIS)
OLAP Tools
- Notable OLAP tools include IBM Cognos, Micro Strategy, Palo OLAP Server, Apache Kylin, Pentaho BI, Mondrian, Oracle Business Intelligence Enterprise Edition (OBIEE), JsHypercube and Jedox
Advantages of OLAP
- High speed of data processing
- Aggregated and detailed data
- Multidimensional data representation
- Using familiar business expressions
- "What-if" scenarios
- Flat learning curve
Disadvantages of OLAP
- High cost
- OLAP is relational
- Computation capability
- Some potential risk
Extract, Transform, and Load (ETL)
- ETL stands for Extract, Transform, and Load
- ETL is used in data warehousing to extract data from databases or source systems and after transforming, places the data into a data warehouse
- ETL is a combination of three database functions
ETL Process
- Step 1: Extraction - Preferred data are identified and extracted from various source systems like databases, applications, and flat files
- Data extraction can be done by running jobs during non-business hours
- Step 2: Transformation - Extracted data that cannot be directly loaded into the target system is transformed
- Based on business rules, transformations are done before loading the data, correcting it, removing incorrect data, and fixing errors
- Step 3: Loading - All gathered information is loaded into the target Data Warehouse tables
Data Extraction Strategies
- Full Extraction: Whole data gets loaded from sources into data warehouses
- Partial Extraction (with update notification): Known as delta, where only the data being changed is extracted and updates data ware houses
- Partial Extraction (without update notification): Specific required data is extracted from sources according to load in data warehouses instead of extracting whole data
Types of Loading
- Initial Load populates all the Data Warehouse tables
- Incremental Load applies ongoing changes as needed periodically
- Full Refresh erases the contents of one or more tables and reloads with fresh data
ETL Tools
- ETL tools include RightData, Informatica Data Validation, QuerySurge, ICEDQ, Datagaps ETL Validator, QualiDI, Talend Open Studio for Data Integration, Codoid's ETL Testing Services, and Data Centric Testing
- Other ETL tools are SSISTester, TestBench, GTL QAceGen, Zuzena Automated Testing Service, DbFit, AnyDbTest, and 99 Percentage ETL Testing
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.