Lesson 3: Design Layers in Data Processing Architecture (Big Data Analytics) PDF

Document Details

SuaveAgate4242

Uploaded by SuaveAgate4242

2019

Raj Kamal and Preeti Saxena

Tags

big data analytics data architecture data processing computer science

Summary

This presentation outlines the design layers in big data processing, covering data sources, ingestion, storage, processing, and consumption.

Full Transcript

Lesson 3 Design Layers in Data Processing Architecture “Big Data Analytics “, Ch.01 L03: Introduction To... Big Data Analytics 2019 1 Raj Kamal and Preeti Saxena, © McGraw-Hill Higher Edu. India...

Lesson 3 Design Layers in Data Processing Architecture “Big Data Analytics “, Ch.01 L03: Introduction To... Big Data Analytics 2019 1 Raj Kamal and Preeti Saxena, © McGraw-Hill Higher Edu. India Big Data Architecture “Big Data architecture is the logical and/or physical layout/structure of how Big Data will be stored, accessed and managed within a Big Data or IT environment” Techopedia “Big Data Analytics “, Ch.01 L03: Introduction To... Big Data Analytics 2019 2 Raj Kamal and Preeti Saxena, © McGraw-Hill Higher Edu. India Big Data Architecture Logically defines how Big Data solution will work, the core components (hardware, database, software, storage) used, flow of information, security and more “Big Data Analytics “, Ch.01 L03: Introduction To... Big Data Analytics 2019 3 Raj Kamal and Preeti Saxena, © McGraw-Hill Higher Edu. India Figure 1.2 Design of logical layers in a data processing architecture “Big Data Analytics “, Ch.01 L03: Introduction To... Big Data Analytics 2019 4 Raj Kamal and Preeti Saxena, © McGraw-Hill Higher Edu. India Lowest Layer L1 Considers amount of data needed at ingestion layer 2 (L2) and either Push from L1 or pull by L2 as per the mechanism for the usages Source data-types: Database, files, web or service Source formats, i.e., semi-structured, unstructured or structured. “Big Data Analytics “, Ch.01 L03: Introduction To... Big Data Analytics 2019 5 Raj Kamal and Preeti Saxena, © McGraw-Hill Higher Edu. India Data Ingestion and Acquisition Layer L2 Considers Ingestion and ETL processes either in real time, which means store and use the data as generated, or in batches Batch processing is using discrete datasets at scheduled or periodic intervals of time. “Big Data Analytics “, Ch.01 L03: Introduction To... Big Data Analytics 2019 6 Raj Kamal and Preeti Saxena, © McGraw-Hill Higher Edu. India Data Storage Layer L3 Data storage type (historical or incremental), format, compression, incoming data frequency, querying patterns and consumption requirements for L4 or L5 Data storage using Hadoop distributed file system or NoSQL data stores—HBase, Cassandra, MongoDB “Big Data Analytics “, Ch.01 L03: Introduction To... Big Data Analytics 2019 7 Raj Kamal and Preeti Saxena, © McGraw-Hill Higher Edu. India Data Processing Layer L4 Data processing software such as MapReduce, Hive, Pig, Spark, Spark Mahout, Spark Streaming Processing in scheduled batches or real time or hybrid Processing as per synchronous or asynchronous processing requirements at L5. “Big Data Analytics “, Ch.01 L03: Introduction To... Big Data Analytics 2019 8 Raj Kamal and Preeti Saxena, © McGraw-Hill Higher Edu. India Data Consumption Layer L5 Data integration Datasets usages for reporting and visualization, Analytics (real time, near real time, scheduled batches), BPs, BIs, knowledge discovery Export of datasets to cloud, web or other systems “Big Data Analytics “, Ch.01 L03: Introduction To... Big Data Analytics 2019 9 Raj Kamal and Preeti Saxena, © McGraw-Hill Higher Edu. India Summary We learnt Five Design Layers L1: Identification of Internal and External Sources of Data for ingestion and acquisition L2 Ingestion and Acquisition Layer “Big Data Analytics “, Ch.01 L03: Introduction To... Big Data Analytics 2019 10 Raj Kamal and Preeti Saxena, © McGraw-Hill Higher Edu. India … Summary We learnt: L3 Data Storage in Required formats for processing at L4 L4 Data Processing Layer L5 data consumption (usage) layer “Big Data Analytics “, Ch.01 L03: Introduction To... Big Data Analytics 2019 11 Raj Kamal and Preeti Saxena, © McGraw-Hill Higher Edu. India End of Lesson 3 on Design Layers in Data Processing Architecture “Big Data Analytics “, Ch.01 L03: Introduction To... Big Data Analytics 2019 12 Raj Kamal and Preeti Saxena, © McGraw-Hill Higher Edu. India

Use Quizgecko on...
Browser
Browser