Big Data Presentation PDF

Summary

This presentation provides an overview of Big Data, explaining its definition, characteristics (volume, variety, speed), and challenges in handling large datasets. It also discusses who generates big data, the changing data models, and concludes with references to relevant websites and tutorials.

Full Transcript

Big Data Contents Definition What’s Driving Big Data ? Characteristics Challenges Big Data 3V’s Big Data Ecosystem Who’s Generating Big Data? Advantages & Disadvantages The Model Has Changed Big Data Defin...

Big Data Contents Definition What’s Driving Big Data ? Characteristics Challenges Big Data 3V’s Big Data Ecosystem Who’s Generating Big Data? Advantages & Disadvantages The Model Has Changed Big Data Definition  No single standard definition… “Big Data” is data whose scale, diversity, and complexity require new architecture, techniques, algorithms, and analytics to manage it and extract value and hidden knowledge from it… Characteristics of Big Data: 1-Scale (Volume)  Data Volume  44x increase from 2009 2020  From 0.8 zettabytes to 35zb  Data volume is increasing exponentially Characteristics of Big Data: Complexity (Varity)  Various formats, types, and structures  Text, numerical, images, audio, video, sequences, time series, social media data, multi-dim arrays, etc…  Static data vs. streaming data  A single application can be generating/collecting many types of data To extract knowledge all these types of data need to linked together Characteristics of Big Data: Speed (Velocity)  Data is begin generated fast and need to be processed fast  Online Data Analytics  Late decisions missing opportunities Examples  E-Promotions: Based on your current location, your purchase history, what you like send promotions right now for store next to you  Healthcare monitoring: sensors monitoring your activities and body any abnormal measurements require immediate reaction Big Data: 3V’s Some Make it 4V’s Who’s Generating Big Data ? Social media Scientific Mobile and networks instruments devices Sensor (all of us are (collecting all sorts of (tracking all technology and generating data) objects all the networks data) time) (measuring all The progress and innovation is no longer hindered by the ability to collect data kinds of data) But, by the ability to manage, analyze, summarize, visualize, and discover knowledge from the collected data in a timely manner and in a scalable fashion The Model Has Changed…  The Model of Generating/Consuming Data has Changed Old Model: Few companies are generating data, all others are consuming data New Model: all of us are generating data, and all of us are consuming data What’s driving Big Data ? - Optimizations and predictive analytics - Complex statistical analysis - All types of data, and many sources - Very large datasets - More of a real-time - Ad-hoc querying and reporting - Data mining techniques - Structured data, typical sources - Small to mid-size datasets Challenges in Handling Big Data  The Bottleneck is in technology  New architecture, algorithms, techniques are needed  Also in technical skills  Experts in using the new technology and dealing with big data References  https://en.wikipedia.org/wiki/Big_data  https://www.oracle.com/big-data/index.html  http://www.sas.com/en_us/insights/big-data/what-is -big-data.html  https://www.tutorialspoint.com/big_data_tutorials.h tml Thank You

Use Quizgecko on...
Browser
Browser