Intro to Data Science Lecture 1: Data Collection
24 Questions
0 Views

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to lesson

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

What are some examples of sources where lots of data is being collected and warehoused?

Web data, e-commerce, financial transactions, bank/credit transactions, online trading and purchasing, social network

What is data, according to the definition provided?

An elementary description of a reality or fact

What is the key difference between data and information?

Information is obtained by analysis and interpretation of a collection of data

What is knowledge, according to Webster's Dictionary?

<p>The fact or condition of knowing something with familiarity gained through experience or association</p> Signup and view all the answers

What is decision-making, and how does it relate to data analytic thinking?

<p>Decision-making is the process of making choices by identifying a decision, gathering information, and assessing alternatives, and data analytic thinking makes decisions based on analysis of data</p> Signup and view all the answers

What is the primary focus of data analytic thinking, and what does it enable?

<p>Make decisions based on analysis of data, enabling informed decision-making</p> Signup and view all the answers

What is the primary objective of replacing intuition with data-driven analytical decisions?

<p>To make more accurate and informed decisions</p> Signup and view all the answers

What is a common application of data analytics in medical treatment, similar to predictive maintenance?

<p>Classifying patients with high risk and taking preventative action</p> Signup and view all the answers

What is the main challenge in forecasting demand for bicycle rentals, leading to under-stocking or over-stocking?

<p>Manual forecasting difficulty due to the complexity of data</p> Signup and view all the answers

What is the volume of data processed by Google on a daily basis, according to the data from 2008?

<p>20 PB</p> Signup and view all the answers

What is the primary challenge in storing and analyzing the astronomical data generated by the European VLBI telescopes?

<p>The sheer volume of data generated at a rate of 1GB per second</p> Signup and view all the answers

What is the main challenge in managing the data generated by AT&T's daily telephone calls?

<p>The storage and real-time analysis of the data</p> Signup and view all the answers

What is Big Data, and how is it defined?

<p>Big Data is any data that is expensive to manage and hard to extract value from.</p> Signup and view all the answers

What are the 3 Vs of Big Data?

<p>The 3 Vs of Big Data are Volume, Velocity, and Variety.</p> Signup and view all the answers

What is the challenge of Big Data in terms of size?

<p>The challenge is that the size of Big Data makes it hard to store, manage, and analyze.</p> Signup and view all the answers

What are some of the ways to extract value from Big Data?

<p>Some ways to extract value from Big Data include aggregation and statistics, indexing, searching, and querying, reporting, BI, data warehousing, and OLAP.</p> Signup and view all the answers

What is the significance of the term 'big' in Big Data?

<p>The term 'big' in Big Data is relative and depends on the context, with 10 terabytes being big for an OLTP system but small for a web search engine.</p> Signup and view all the answers

What are some of the challenges of working with Big Data?

<p>Some of the challenges of working with Big Data include managing and extracting value from the data, as well as integrating and analyzing the diverse data formats and sources.</p> Signup and view all the answers

What is data science defined as, and what type of data does it extract knowledge and insights from?

<p>Data science is defined as a scientific field that uses scientific methods to extract knowledge and insights from structured and unstructured data.</p> Signup and view all the answers

What are the three main business drivers for data science?

<p>The three main business drivers for data science are: desire to optimize business operations, desire to identify business risk, and desire to predict new business opportunities.</p> Signup and view all the answers

What is the focus of data mining, and how does it relate to predictive analytics?

<p>Data mining is the process of discovering useful patterns and trends in large data sets, and it is closely related to predictive analytics, which extracts information from large datasets to make predictions and estimates about future outcomes.</p> Signup and view all the answers

What are the key areas of concentration in data science?

<p>The key areas of concentration in data science include computer science, mathematics and applied mathematics, statistics, solid programming skills, data mining, database storage and management, and machine learning and discovery.</p> Signup and view all the answers

What are some real-life examples of data science in action?

<p>Companies use data science to learn customers' secrets, shopping patterns, and preferences, and can even predict if a woman is pregnant, even if she doesn't want them to know.</p> Signup and view all the answers

What is the relationship between data science and statistical modeling?

<p>Data science uses statistical modeling and probability to analyze data and make predictions.</p> Signup and view all the answers

More Like This

Use Quizgecko on...
Browser
Browser