Big Data Management

WellBalancedOrbit avatar
WellBalancedOrbit
·
·
Download

Start Quiz

Study Flashcards

10 Questions

What is one of R's strengths?

The ease with which well-designed publication-quality plots can be produced

What is the primary purpose of the R programming language?

For research in statistical methodology

What is the license under which R is available?

Free Software Foundation's GNU General Public License

What is the benefit of using a cloud computing provider for big data processing?

To have access to a console that can take in specialized commands and parameters

What type of systems often generate big data?

Large, network-based systems

What is the purpose of machine learning in cloud computing providers?

To standardize data in non-standard formats

What is a feature of R's graphics capabilities?

It allows the user to retain full control over design choices

What is the advantage of using R for statistical computing?

It is open source and provides an easy route to participation in statistical methodology research

What is a common feature of cloud computing providers' user interfaces?

A console that can take in specialized commands and parameters

What is one of the products that are usually part of a cloud computing package?

Database management systems

Study Notes

Understanding Big Data

  • Big data involves collecting and processing large amounts of data from multiple sources, often in terabytes or petabytes.
  • Three main actions are required to make big data work: integration, management, and analysis.

Integration

  • Integration involves receiving, processing, and transforming raw data into a usable format for business users and analysts.

Management

  • Big data requires significant storage, often using cloud solutions to take advantage of unlimited compute and scalability.
  • Data must be stored in a format that can be processed and made available in real-time.

Analysis

  • Analysis involves exploring the data and communicating insights across the business in a way that everyone can understand.
  • Data visualization tools are used to create charts, graphs, and dashboards to facilitate understanding.

Applications of Big Data

  • Big retail stores use big data to track customer spending habits and shopping behavior.
  • This information is used to provide personalized product recommendations to customers.

Tableau

  • Tableau is a data visualization tool that can connect to multiple data sources and create interactive visualizations without requiring coding knowledge.
  • Features of Tableau include powerful data discovery and exploration, support for multiple data sources, and centralized data source management.

R Language

  • R is a language and environment for statistical computing and graphics.
  • R provides a wide range of statistical and graphical techniques and is highly extensible.
  • R is available as free software under the GNU General Public License.

Cloud Computing and Big Data

  • Cloud computing providers often use a "software as a service" model to allow customers to easily process data.
  • Big data is often generated by large, network-based systems and may require artificial intelligence and machine learning to standardize data in non-standard formats.

Learn how to make big data work by integrating, managing, and processing raw data from various sources to make it usable for analysis

Make Your Own Quizzes and Flashcards

Convert your notes into interactive study material.

Get started for free

More Quizzes Like This

Use Quizgecko on...
Browser
Browser