Podcast
Questions and Answers
What are the steps involved in solving problems with data according to the lecture?
What are the steps involved in solving problems with data according to the lecture?
collect & understand data, clean & format data, use data to create solution
Where does internal data come from, as mentioned in the lecture?
Where does internal data come from, as mentioned in the lecture?
business-centric data in organizational databases recording day to day operations, scientific or experimental data
What are the sources of existing external data mentioned in the lecture?
What are the sources of existing external data mentioned in the lecture?
public government databases, stock market data, Yelp reviews
What are the cautionary notes mentioned about using online data, as per the lecture?
What are the cautionary notes mentioned about using online data, as per the lecture?
What are the two methods mentioned for obtaining online data in the lecture?
What are the two methods mentioned for obtaining online data in the lecture?
From what sources can internal data be obtained, as mentioned in the lecture?
From what sources can internal data be obtained, as mentioned in the lecture?
What caution is mentioned about using online data, as per the lecture?
What caution is mentioned about using online data, as per the lecture?
What are the two methods mentioned for obtaining online data in the lecture?
What are the two methods mentioned for obtaining online data in the lecture?
What are the sources of existing external data mentioned in the lecture?
What are the sources of existing external data mentioned in the lecture?
What are the steps involved in solving problems with data according to the lecture?
What are the steps involved in solving problems with data according to the lecture?
Flashcards
Internal data source
Internal data source
Data originates from internal business operations and experimental data within an organization.
External data source
External data source
Data collected from sources outside the organization, such as public databases or stock markets.
Online data caution
Online data caution
Not all accessible online data is suitable for use; it depends on the quality and the source.
Web scraping
Web scraping
Signup and view all the flashcards
Data problem-solving steps
Data problem-solving steps
Signup and view all the flashcards
Data collection
Data collection
Signup and view all the flashcards
Data cleaning
Data cleaning
Signup and view all the flashcards
Data formatting
Data formatting
Signup and view all the flashcards
Online data sources (APIs)
Online data sources (APIs)
Signup and view all the flashcards
Data Analysis/ML Solution
Data Analysis/ML Solution
Signup and view all the flashcards
Study Notes
Data Science Overview
- Data science involves solving problems with data, which can be related to scientific, social, or business issues.
- The process of data science includes:
- Collecting and understanding data
- Cleaning and formatting data
- Using data to create a solution through data analysis and/or machine learning
Data Sources
Internal Sources
- Data from organizational databases recording day-to-day operations
- Scientific or experimental data
Existing External Sources
- Data available for free or a fee
- Examples include:
- Public government databases
- Stock market data
- Yelp reviews
- Typically, this data is somewhat pre-processed
Collecting Your Own Data
- Beyond the scope of this course
Online Data
- Typically raw data from APIs (e.g. Google Map API, Facebook API, Twitter API)
- Web scraping:
- Extracting data from what is displayed on a page or what is contained in the HTML file
- Caution: not all accessible data is good to be used
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.