10 Questions
What is one of R's strengths?
The ease with which well-designed publication-quality plots can be produced
What is the primary purpose of the R programming language?
For research in statistical methodology
What is the license under which R is available?
Free Software Foundation's GNU General Public License
What is the benefit of using a cloud computing provider for big data processing?
To have access to a console that can take in specialized commands and parameters
What type of systems often generate big data?
Large, network-based systems
What is the purpose of machine learning in cloud computing providers?
To standardize data in non-standard formats
What is a feature of R's graphics capabilities?
It allows the user to retain full control over design choices
What is the advantage of using R for statistical computing?
It is open source and provides an easy route to participation in statistical methodology research
What is a common feature of cloud computing providers' user interfaces?
A console that can take in specialized commands and parameters
What is one of the products that are usually part of a cloud computing package?
Database management systems
Study Notes
Understanding Big Data
- Big data involves collecting and processing large amounts of data from multiple sources, often in terabytes or petabytes.
- Three main actions are required to make big data work: integration, management, and analysis.
Integration
- Integration involves receiving, processing, and transforming raw data into a usable format for business users and analysts.
Management
- Big data requires significant storage, often using cloud solutions to take advantage of unlimited compute and scalability.
- Data must be stored in a format that can be processed and made available in real-time.
Analysis
- Analysis involves exploring the data and communicating insights across the business in a way that everyone can understand.
- Data visualization tools are used to create charts, graphs, and dashboards to facilitate understanding.
Applications of Big Data
- Big retail stores use big data to track customer spending habits and shopping behavior.
- This information is used to provide personalized product recommendations to customers.
Tableau
- Tableau is a data visualization tool that can connect to multiple data sources and create interactive visualizations without requiring coding knowledge.
- Features of Tableau include powerful data discovery and exploration, support for multiple data sources, and centralized data source management.
R Language
- R is a language and environment for statistical computing and graphics.
- R provides a wide range of statistical and graphical techniques and is highly extensible.
- R is available as free software under the GNU General Public License.
Cloud Computing and Big Data
- Cloud computing providers often use a "software as a service" model to allow customers to easily process data.
- Big data is often generated by large, network-based systems and may require artificial intelligence and machine learning to standardize data in non-standard formats.
Learn how to make big data work by integrating, managing, and processing raw data from various sources to make it usable for analysis
Make Your Own Quizzes and Flashcards
Convert your notes into interactive study material.
Get started for free