Descriptive Analytics Explained

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Play an AI-generated podcast conversation about this lesson
Download our mobile app to listen on the go
Get App

Questions and Answers

What does descriptive analytics refer to?

Descriptive analytics refers to the interpretation of historical data to better understand changes that occur in a business.

What are commonly reported financial metrics a product of?

Descriptive analytics

How does descriptive analytics work?

Descriptive analytics takes a full range of raw data and parses it to draw conclusions that managers, investors, and other stakeholders may find useful and understandable.

What is required to obtain an informed view of a company's sales performance?

<p>The larger context including targeted growth.</p> Signup and view all the answers

Descriptive analytics is rarely industry-specific.

<p>False (B)</p> Signup and view all the answers

What is descriptive analytics used for?

<p>Descriptive analytics is used to gain valuable insight into how companies are performing by comparing its performance and position in the marketplace with its competitors.</p> Signup and view all the answers

What are the two primary methods by which data is collected for descriptive analytics?

<p>Data aggregation and data mining.</p> Signup and view all the answers

What is a form of descriptive analytics created by taking three data points (net income, dividends, and total capital)?

<p>Return on invested capital (ROIC).</p> Signup and view all the answers

What is the first step involved in implementing descriptive analytics into a business strategy?

<p>Identifying which metrics to analyze.</p> Signup and view all the answers

What is the main benefit of employing descriptive analytics in the corporate workflow?

<p>It simply disseminates information and provides all major stakeholders with a way to understand complex ideas.</p> Signup and view all the answers

Descriptive analytics can be used to determine how future market forces may affect a business.

<p>False (B)</p> Signup and view all the answers

What can stakeholders choose which may result bias?

<p>Favorable metrics to analyze and ignore others.</p> Signup and view all the answers

What is a 'pro' of descriptive analytics?

<p>Breaks down information, so it is easy to understand.</p> Signup and view all the answers

There will always be a need for descriptive analytics, as it provides important information in an easy-to-grasp format.

<p>True (A)</p> Signup and view all the answers

How does predictive analytics make predictions?

<p>Through the use of statistics and modeling.</p> Signup and view all the answers

What does prescriptive analytics allow companies to do?

<p>Use technology to analyze important data to determine what they need to do to achieve specific results.</p> Signup and view all the answers

What does diagnostic analytics involve?

<p>The use of data to understand the relationship between variables and why certain trends exist.</p> Signup and view all the answers

What question does descriptive analytics answer?

<p>&quot;What happened?&quot;</p> Signup and view all the answers

What question does predictive analytics answer?

<p>&quot;What will happen?&quot;</p> Signup and view all the answers

What can companies analyze using descriptive analytics?

<p>Various metrics (financial and non-financial) during a specific reporting period.</p> Signup and view all the answers

Which of the following are commonly used algorithms for descriptive analytics?

<p>All of the above (F)</p> Signup and view all the answers

What is the first step in the descriptive analytics process?

<p>Data collection.</p> Signup and view all the answers

What must data collection be followed by to ensure accurate and reliable analysis?

<p>Thorough data cleansing and preparation.</p> Signup and view all the answers

What does calculating summary measures such as averages, totals, percentages, or ratios involve?

<p>Summarizing data to provide key insights.</p> Signup and view all the answers

What does descriptive analytics include to understand how variables or metrics have changed over time?

<p>Analyzing historical trends.</p> Signup and view all the answers

How must the insights and findings derived from the descriptive analytics process be communicated?

<p>Effectively, typically through reports or visual dashboards.</p> Signup and view all the answers

Examples of Descriptive Analytics include Sales performance analysis, Customer segmentation and Website analytics.

<p>True (A)</p> Signup and view all the answers

What is Data?

<p>Data is a collection of measurements and facts and a tool that helps an individual or a group of individuals reach a sound conclusion by providing them with some information.</p> Signup and view all the answers

What is Data Collection?

<p>Data collection is the process of collecting and evaluating information or data from multiple sources to find answers to research problems, answer questions, evaluate outcomes, and forecast trends and probabilities.</p> Signup and view all the answers

What question is important to answer before an analyst begins collecting data?

<p>All of the above (D)</p> Signup and view all the answers

Data can be classified into two types, what are they?

<p>Primary Data and Secondary Data.</p> Signup and view all the answers

_____: An investigator is a person who conducts the statistical enquiry.

<p>Investigator</p> Signup and view all the answers

In order to collect information for statistical enquiry, an investigator needs the help of some people. These people are known as _____.

<p>enumerators</p> Signup and view all the answers

A respondent is a person from whom the _____ is collected.

<p>statistical information required for the enquiry</p> Signup and view all the answers

______: It is a method of collecting information from individuals.

<p>Survey</p> Signup and view all the answers

The investigator asks questions either directly from the source or from its _____ links.

<p>indirect</p> Signup and view all the answers

In _____, the investigator makes direct contact with the person from whom he/she wants to obtain information.

<p>direct personal investigation</p> Signup and view all the answers

In _____, the investigator does not make direct contact with the person from whom he/she needs information.

<p>Indirect Oral Investigation</p> Signup and view all the answers

The observation method can influence subjects' behavior.

<p>True (A)</p> Signup and view all the answers

What is the mailing method?

<p>This method involves mailing the questionnaires to the informants for the collection of data. (B)</p> Signup and view all the answers

What is 'advantage' of questionnaires?

<p>Can reach a large audience quickly and cost-effectively. (A)</p> Signup and view all the answers

What studying user interactions with a product in a natural setting relevant to?

<p>Suitable Use Case (B)</p> Signup and view all the answers

What the 'cause' and effect realtionships with high precision relates to?

<p>Experiments: Advantage (A)</p> Signup and view all the answers

How many participants engages on a focus group?

<p>A group of 6-12 participants engages in a guided discussion led by a moderator who asks open-ended questions to elicit opinions, attitudes, and perceptions.</p> Signup and view all the answers

What does secondary data save?

<p>Time and resources</p> Signup and view all the answers

Which of the follow are government publications?

Signup and view all the answers

Name two primary methods by which data is collected for descriptive analytics.

<p>Data aggregation and data mining.</p> Signup and view all the answers

What question does descriptive analytics try to answer?

<p>&quot;What happened?&quot;</p> Signup and view all the answers

What question does predictive analytics attempt to answer?

<p>&quot;What will happen?&quot;</p> Signup and view all the answers

Which of the following algorithms is NOT commonly used for descriptive analytics?

<p>Genetic Algorithms (B)</p> Signup and view all the answers

Name the first step in the descriptive analytics process.

<p>Data collection.</p> Signup and view all the answers

What is the second step in the descriptive analytics process?

<p>Cleaning and preparation.</p> Signup and view all the answers

Descriptive analytics is a one-time process.

<p>False (B)</p> Signup and view all the answers

According to a February 2023 report by Global Market Estimates, which of these is a prominent player in the data analytics market?

<p>IBM (A)</p> Signup and view all the answers

Asia Pacific is expected to hold the leading data analytics market share from 2023 to 2028.

<p>False (B)</p> Signup and view all the answers

What is the term for a collection of measurements and facts that helps individuals reach a sound conclusion by providing them with information?

<p>Data</p> Signup and view all the answers

What are the two general classifications of data?

<p>Qualitative and quantitative (B)</p> Signup and view all the answers

Match the following terms with their descriptions:

<p>Data = A tool that provides information to help an investigator understand a problem Investigator = A person who conducts the statistical enquiry Enumerator = Someone who helps an investigator to collect information for statistical enquiry Respondent = A person from whom the statistical information required for the enquiry is collected</p> Signup and view all the answers

Name one of the main advantages of primary data.

<p>Provides current, relevant, and specific information tailored to the researcher's needs, offering a high level of accuracy and control over data quality.</p> Signup and view all the answers

Fill in the blank: _______ involves collecting data personally from the source of origin.

<p>Direct Personal Investigation (A)</p> Signup and view all the answers

Which data collection method can reach a large audience quickly and cost-effectively, but may yield biased or inaccurate responses?

<p>Questionnaires (B)</p> Signup and view all the answers

What is a disadvantage of using focus groups for data collection?

<p>Results can be influenced by dominant participants or groupthink, and the findings are not easily generalizable due to the small, non-representative sample size.</p> Signup and view all the answers

What is the advantage of using secondary data?

<p>It is readily available and often free or less expensive to obtain compared to primary data. It saves time and resources since the data collection phase has already been completed.</p> Signup and view all the answers

When collecting secondary data, you do not need to adjust the data in order to use it for your current study.

<p>False (B)</p> Signup and view all the answers

What does the acronym DDDM stand for?

<p>Data-driven decision-making.</p> Signup and view all the answers

How can retailers use customer data extensively?

<p>To build targeted marketing campaigns and enhance recommendation engines.</p> Signup and view all the answers

What kind of algorithms are used by financial institutions to detect and prevent fraud?

<p>Machine learning (ML) algorithms.</p> Signup and view all the answers

What is the term for using geographic information system (GIS) technology to optimize the site selection strategy?

<p>Precision site selection strategy.</p> Signup and view all the answers

What is the first step in the data-driven decision-making process?

<p>Define objectives.</p> Signup and view all the answers

What can poor-quality data lead to?

<p>Inaccurate analyses and misguided decisions, undermining the value of data-driven strategies.</p> Signup and view all the answers

Name the five data analysis types used in data-driven decision-making.

<p>Descriptive, diagnostic, predictive, prescriptive, and exploratory analysis.</p> Signup and view all the answers

What is the goal of descriptive analysis?

<p>To describe and summarize historical data through data aggregation and mining, providing insights into past performance.</p> Signup and view all the answers

What is the goal of diagnostic analysis?

<p>To determine why certain events occurred, involving data discovery, mining and identifying correlations to uncover the root causes of trends or incidents.</p> Signup and view all the answers

What is the goal of predictive analysis?

<p>To forecast future trends or outcomes based on historical data.</p> Signup and view all the answers

What is prescriptive analysis?

<p>Recommending actions based on data. This type combines predictive analytics with optimization algorithms to suggest the best course of action.</p> Signup and view all the answers

What is the goal of exploratory analysis?

<p>To discover patterns, relationships or anomalies in data without specific hypotheses.</p> Signup and view all the answers

What is Inferential Analysis?

<p>Uses a data sample to make inferences about a population.</p> Signup and view all the answers

What is Qualitative Analysis?

<p>Focuses on non-numeric data to understand concepts, opinions or experiences.</p> Signup and view all the answers

What is Data Cleaning?

<p>The process of identifying and correcting errors and inconsistencies in raw data sets to improve data quality.</p> Signup and view all the answers

High-quality or “clean” data is NOT crucial for effectively adopting artificial intelligence (AI) and automation tools.

<p>False (B)</p> Signup and view all the answers

What are the four steps in data cleaning?

<p>Standardization, Addressing outliers, Deduplication, Addressing missing values.</p> Signup and view all the answers

____ are data points that deviate significantly from others in a data set, caused by errors, rare events or true anomalies.

<p>Outliers</p> Signup and view all the answers

What is Normalization?

<p>A scaling method that reduces duplication in which the numbers are scaled and moved between 0 and 1.</p> Signup and view all the answers

____________ is crucial because it enables reliable data transmission across various systems.

<p>Standardization</p> Signup and view all the answers

Provide a simple example of aggregated data.

<p>The sum of your business's total sales in the past three months.</p> Signup and view all the answers

In time data aggregation, what does the 'granularity' refer to?

<p>The timeframe for collecting data points from one or multiple sources to perform aggregation. It can range from minutes to a month.</p> Signup and view all the answers

What is not a benefit of data summarization?

<p>Increased cost (B)</p> Signup and view all the answers

____ helps to exclude unnecessary or irrelevant data, while ____ combines similar data points to reveal patterns or trends.

<p>Filtering; Aggregation (B)</p> Signup and view all the answers

What is a key benefit of data visualization?

<p>Visual displays of information communicate complex data relationships and data-driven insights in a way that is easy to understand.</p> Signup and view all the answers

What are visual discovery and every day data viz closely aligned with?

<p>Data teams.</p> Signup and view all the answers

Name some common visualization techniques.

<p>Tables, Pie charts and stacked bar charts, histograms, Scatter plots.</p> Signup and view all the answers

Why is it important to set the context with data visualizations?

<p>To ground the audience around why this particular data point is important.</p> Signup and view all the answers

Why is it important to know your audience in data visualization?

<p>Think about who your visualization is designed for and then make sure your data visualization fits their needs.</p> Signup and view all the answers

Exploratory data analysis is used by data scientists to:

<p>all of the above (D)</p> Signup and view all the answers

In the 1970s who developed Exploratory Data Analysis?

<p>American mathematician John Tukey.</p> Signup and view all the answers

What is the 1st step in EDA - Exploratory Data Analysis for Hypothesis Development?

<p>Generating Hypotheses (C)</p> Signup and view all the answers

What are the statistical functions used to identify outliers?

<p>Standard deviation and z-scores (B)</p> Signup and view all the answers

Match terms of the EDA method for analyzing customer churn:

<p>Statistical Functions = Summary statistics Graphing Distribution = Histograms and box plots Examine Relationships = Scatter plot Calculate Relationship Values = Correlation coefficients</p> Signup and view all the answers

Name one EDA language.

<p>Python or R.</p> Signup and view all the answers

Which of the following data analysis software applications is considered simple and versatile, making it suitable for those starting in data science?

<p>Excel (D)</p> Signup and view all the answers

What is data ethics?

<p>The branch of ethics that evaluates data practices with respect to principles of fairness, accountability, and respect for privacy.</p> Signup and view all the answers

Flashcards

What is Descriptive Analytics?

Interpreting historical data to understand business changes, comparing reporting periods within a company or across the industry.

How Descriptive Analytics Works

Analyzing raw data to draw useful, understandable conclusions for managers and stakeholders, and comparing performance to previous periods or competitors.

What does Descriptive Analytics Tell You?

Descriptive analytics helps businesses understand their performance, compare themselves to competitors, identify financial trends, and set individual goals.

How is Descriptive Analytics Used?

Corporate management can use it to identify improvement areas, motivate teams, and monitor performance.

Signup and view all the flashcards

Primary Data Collection Methods

Data aggregation and data mining

Signup and view all the flashcards

Steps in descriptive analytics

Identifying metrics, locating data sources, compiling data, and analyzing data.

Signup and view all the flashcards

Advantages of Descriptive Analytics

Breaks down complex information, allows comparison to competitors.

Signup and view all the flashcards

Disadvantages of Descriptive Analytics

Limited insight into the future, stakeholders may choose metrics with bias.

Signup and view all the flashcards

Related analytic types

Predictive, prescriptive, and diagnostic analytics.

Signup and view all the flashcards

How can Descriptive Analytics benefit companies?

Employing descriptive analytics helps companies identify inefficiencies and make necessary changes.

Signup and view all the flashcards

Algorithms Used

Clustering, association rules, time series, text mining, decision trees, GIS, regression

Signup and view all the flashcards

Descriptive Analytics Process

Collect data, clean and prepare, segment, summarize KPIs, analyze trends, report and visualization.

Signup and view all the flashcards

Examples of Descriptive Analytics

Analyzing sales, customer segments, website data, operational efficiency and finances.

Signup and view all the flashcards

What is Data?

Measurements and facts that help reach sound conclusions by providing information. Can be primary or secondary.

Signup and view all the flashcards

Data Collection

Collecting and evaluating information from multiple sources to answer research questions.

Signup and view all the flashcards

Key Questions to Ask Before Data Collection

The goal or purpose, types of data, collection and processes methods

Signup and view all the flashcards

Types of Data

Descriptions (qualitative) and numbers (quantitative).

Signup and view all the flashcards

Data Types

Primary Data, Secondary Data

Signup and view all the flashcards

Methods of Collecting Primary Data

Direct personal investigation, indirect oral and local sources/correspondents, questionnaires and schedules.

Signup and view all the flashcards

Interviews

Collect data through one-on-one conversations.

Signup and view all the flashcards

Questionnaires

Mailing methods or enumerators methods

Signup and view all the flashcards

Observations

Watching behavior, events, or condtions in natural setting

Signup and view all the flashcards

Experiments

Manipulating variables in a controlled environment to determine the effect.

Signup and view all the flashcards

Focus Group

Gathering a small group to discuss a topic, guided by a moderator.

Signup and view all the flashcards

Collecting Secondary Data

Government, semi-government and trade-association publications; journals and papers plus international publications.

Signup and view all the flashcards

What is Data-Driven Decision-Making (DDDM)

Analyze, gather and interpret various points of data used to inform future business decisions.

Signup and view all the flashcards

Data driven decisions in practice

Sales-performance, Customer-Segmentation, and website analytics.

Signup and view all the flashcards

What is Data Cleaning?

Identifying and correcting errors to ensure data is accurate, complete, consistent and usable.

Signup and view all the flashcards

Data Cleaning Techniques

Standardization, Addressing outliers and missing values plus deduplication and validation.

Signup and view all the flashcards

Automating Data Cleaning

Using tools and software to streamline error detection, correction and standardization.

Signup and view all the flashcards

Machine Learning for Data Cleaning

Leveraging ML models to identify outliers/anomalies or predict missing values.

Signup and view all the flashcards

What is Data Normalization?

Altering the values of numerical columns in dataset with standard scale.

Signup and view all the flashcards

What is Data Standardization?

A statistical method for rescaling that meets standard distributions.

Signup and view all the flashcards

When to use Data Normalization?

Processes Raw, Uneven Data Points

Signup and view all the flashcards

When to use Data Standardization?

Works for Points of Data are Gaussian distribution

Signup and view all the flashcards

What is Data Aggregation?

Collect raw data from different sources in central repository and present in summarized

Signup and view all the flashcards

Benefits of Data Aggregation

Optimize database, enable range of data, high level view of key insights.

Signup and view all the flashcards

Steps in Data Aggregation Process?

Collect data, process, summarize

Signup and view all the flashcards

Time Aggregation

Summarizing data from a single source over a specified time.

Signup and view all the flashcards

Spatial Aggregation

Collecting data from various resources across different locations during a specific period.

Signup and view all the flashcards

Study Notes

  • Descriptive analytics interprets historical data to understand business changes.
  • It describes using historical data to compare reporting periods within a company or across the industry.
  • Common financial metrics include year-over-year pricing changes, month-over-month sales growth, and total revenue per subscriber.
  • Descriptive analytics processes raw data to draw understandable conclusions for managers, investors, and stakeholders; it provides a picture of past performance.
  • The analytics compare an organization's performance with others in the same industry.
  • Performance metrics flag strengths and weaknesses to inform management strategies.
  • Context is needed to understand reports; a $1 million sales report requires knowing if that is a decline or increase.
  • Larger context, including targeted growth, is required for informed views of sales performance.
  • Descriptive analytics is a core part of business intelligence and has industry-specific and broadly accepted measures.
  • Companies use these analytics to gain valuable insights into their performance and position in the market, and to determine financial trends.
  • It helps companies understand their operational efficiency and identify areas for improvement, including motivating different teams to implement changes.
  • Data aggregation and data mining are two primary methods of data collection used for descriptive analytics.

How to Implement Descriptive Analytics

  • Identify metrics to analyze and the time frame, then find all the data from internal and external sources, including databases.
  • The identified data is compiled and formatted and datasets and figures are analyzed with different tools.
  • All data is presented to stakeholders with visual aids like charts and videos for insight into the company's direction.
  • Return on invested capital is a form of descriptive analytics, created by taking net income, dividends, and total capital, then turning it into an understandable percentage.

Advantages

  • Descriptive analytics disseminates information and helps stakeholders understand complex ideas through visuals.
  • Stakeholders see how a company compares to others by production costs, revenue streams, and product offerings.
  • Companies see areas for improvement in their own business plans/models.

Disadvantages

  • These analytics do not help understand what to expect in the future, to account for market forces, or to assess variables that may affect them in the future.
  • Stakeholders may choose favorable metrics, creating bias that affects the perception of profitability, ignoring areas that require change.

Competing Analytics Methods

  • Newer fields emphasize predictive, prescriptive, and diagnostic analytics to model outcomes and suggest actions that maximize positive outcomes while minimizing negatives.
  • Predictive attempts to make predictions through statistics and modeling, using current and past data to determine the likelihood of similar future outcomes.
  • Employing predictive can help companies identify and address inefficiencies, and find better ways to utilize resources.
  • Prescriptive analytics allows companies to analyze data and find what needs to be done to achieve specific results, while also considering past and current performance.
  • Stakeholders using prescriptive can better make decisions across any timeline and determine investment in R&D, product offerings, or whether to enter a new market.
  • Diagnostic analytics determines why a trend exists, manually or with computer software, and figures out the root cause of events to make changes.

Applications of Descriptive Analytics

  • Companies can draw comparisons with other reporting periods to identify inefficiencies in their operations and make changes for the future.
  • It answers the "What happened?" question, while Predictive answers "What will happen?" by using historical data to figure out how to improve.
  • Predictive helps companies understand how changes will impact future performance, and these types work together.
  • Companies measure audience engagement through social media or analyze financial metrics.
  • Measuring social media engagement reveals data on which campaigns or product launches lead to traffic to their sites and referrals.

Algorithms

  • Clustering groups similar data points and identifies patterns.
  • Association rule mining unveils links between variables and items, which is useful in market basket analysis and recommendation systems.
  • Time series gauges patterns, trends, and seasonality in time-dependent data.
  • Text Mining & NLP analyses sentiment, topics from unstructured text data (customer reviews, social media).
  • Decision trees make hierarchical structures representing decision rules to classify data and highlight key features.
  • Geographic information systems are used to analyze spatial data by mapping patterns and trends to specific locations.
  • Regression models relationships between dependent and independent variables.
  • Data mining identifies unusual patterns.
  • The use of specific algorithms is determined by the type of data, the analysis objectives, and the industry and application context.

Descriptive Analytics Process Steps

  • Data is collected from sources like databases and spreadsheets.
  • Identifying and resolving missing values, inconsistencies, duplicates, and outliers.
  • Analysts explore the data using summary statistics, data visualization, and data analysis to identify patterns.
  • Segmentation divides datasets into subsets based on demographics, geography, and time for focused analysis.
  • Summary measures show averages, percentages. Key performance indicators evaluate business process, product or service.
  • Historical trends are analyzed to understand how variables/metrics have changed over time for patterns and seasonality.
  • Findings are communicated through reports or dashboards with summary statistics, visuals, and descriptions.
  • Descriptive analytics requires continuous data monitoring and updates to capture patterns and trends; monitors sales data, and updates analyses.

Examples

  • Analyzing prior sales data can identify top-selling products, impacts of pricing strategies, sales channels, and regions.
  • By assessing sales data, one can look at a drop in category sales, and investigate causes (changing preferences or increased competition), or by examining sales in the online sector.
  • Segmentation improves marketing personalization and retention.
  • Descriptive analytics can segment customers by demographics, purchase behavior, and engagement to tailor marketing strategies.
  • Data leads to insights, optimized website operation, and higher conversion rates and user experience.
  • Examination of manufacturing production data can lead to optimizing resources, staff, or processes.
  • Financial insights are gained into revenue, expenses, and profitability.

Data Collection: Terms and Methods

  • Data is a tool that helps people reach a sound conclusion; it understands socio-economic problems.
  • Data collection gathers information from multiple sources to find answers to research problems, evaluate outcomes, etc
  • Analysts must ask self what the purpose is; what kind of data, plus what methods and procedures to collect with.
  • There is qualitative (descriptions) and quantitative data (numbers).
  • Data helps investigators understand a problem by providing required information as primary or secondary data.
  • Investigators conduct the statistical inquiry and enumerators gather and provide statistical information for data collection.
  • Respondents are people from whom data information is then collected.
  • Surveys collect information from individuals to describe usefulness, quality, price, and kindness asking about a product/service.

Methods of Collecting Primary Data

  • Direct personal investigation obtains data personally from the source.
  • Indirect oral investigation collects data orally from someone other than/indirectly connected to the person with the information.
  • Information from local sources appoints correspondents who collect data across various areas.
  • Information is collected via questionnaires, mailing, and enumerator's methods.

Primary vs. Secondary Data

  • Primary data is collected directly from first-hand sources for a specific research purpose though methods including surveys, interviews, experiments, observations, and focus groups.
  • Provides current and specific information with accuracy and control; Observer bias can influence the results; The investigator may influence subjects' behavior.
  • Customer satisfaction surveys and market research suits online, paper, or face-to-face questionnaires that collect data with a questionnaire.
  • Observations record behaviors/events.
  • Studying "product user" interactions is good for assessing classroom dynamics, plus wildlife behavior monitoring.
  • Experiments analyze what happened, and the cause and effect can be artificial, and limit the ability to generalize the findings.
  • The investigator analyzes the efficiency of drugs or marketing campaigns. Gathers with a "moderator" to gather feedback, thought, emotion, and/or insight.

Secondary Data, Sources, and Selection

  • Secondary data is collected, processed, and published and saves time/resources.
  • Can be collected through different published and unpublished sources such as government publications of statistical data, or published data related to health and education plus, newspapers and magazines providing statistical data.
  • Research institutions publish activities/findings, but secondary data requires adjusting to suit the objective and lacks the origin quality and is lower in cost to collect.
  • Unpublished sources with data is in the form of research work or records maintained.

Data-Driven Decision Making

  • DDDM uses data and analysis to make business decisions, using customer feedback and trends, collect processes enabling businesses to find success.
  • Generates real-time insights to optimize performance and test new strategies and provides a solid foundation, which reduces uncertainty.
  • Results come from customer engagement and satisfaction, and better strategic planning, using extensive data.
  • It is used to personalize experiences, product suggestions or pricing strategies and reduce customer churn, the platform is driven using algorithms.
  • Financial institutions use machine learning algorithms to identify/prevent fraud, while utility companies use them to predict energy consumption.
  • Data insights formulate realistic/strategic plans.
  • E-commerce retailers identify untapped customer segments and develop services to identify markets.
  • By analyzing sales data, organizations discover specific products that had a spike before an event.

Decision Making

  • Data-driven decisions minimize bias and increase objectivity.
  • Implement debiasing techniques, and raise awareness of biases to increase transparency.
  • Implement and measure the impact once you: define objectives, identify and collect data, explore data by cleaning its data and see results, and analyze with methodologies or patterns.
  • Key findings are reviewed in the context to form actionable insights and will drive business success.
  • Implement and evaluate resources allocation.

Challenges of Data-Driven Decision Making

  • Organizations avoid data with quality.
  • Data illiteracy can lead to misinterpretations and sub-optimal decisions, so provide training while over reliance on historical data can be problematic,
  • Confirmation bias arises as decision-makers may interpret data selectively.
  • Data types support preconcieved notions while Neglecting data security poses a risk.
  • Descriptive analysis summarizes the history of past performance (sales).
  • Diagnostic analysis determines why events occurred, mining and identifying correlations to uncover the root causes (drop in sales, customer complaints,).
  • Predictive analysis is used to predict sales and customer relation management.
  • Prescriptive will suggest the best course of action, from which is derived supply chain optimization.
  • Exploratory analysis helps identify markets, and cluster and dimensions data.

Data Preparation and Cleaning

  • Data cleaning, or data scrubbing, identifies and corrects errors and inconsistencies in raw data.
  • Processes address duplicates, missing values, syntax errors, and structural errors, as well as securing it.
  • Organizations with clean data make reliable decisions and respond to changes.
  • Cleaning is essential for data science and converts format for analysis.
  • Underpins the success of AI and ensures machine learning algorithms, leading to robust predictions.

Benefits of Cleaning Data Analysis

  • Informed decision-making aligns good quality data with business goals.
  • Improves productivity, cost efficiency, data compliance, enhanced model performance, and data consistency.
  • Data assessment reviews a data set to identify quality issues to standardization.
  • A common discrepancy is the date format (MM-DD-YYYY vs DD-MM-YYYY).
  • Outliers' extreme values distort analysis.
  • Data deduplication is a streamlining process that reduces redundant data when the data entry is repeated twice.
  • Missing values are values not present and professionals might replace missing data, otherwise known as data imputation.
  • The final review verifies that the data is clean and often uses manual inspection.

Guiding Principles

  • "Garbage in, garbage out" is when Data Analysts will get unreliable results.
  • Data cleaning enhances efficiency and reduction while well-cleaned data streams processing.
  • Feature engineering transforms into a more suitable format.
  • Scaling is crucial when different features have vastly different sales and may include min-max scaling to transform based on normal distributions.
  • Encoding categorical data is done through one-hot that converts 'red', 'blue', 'green', can be converted into three features, that are all colors.
  • Advanced cleaning includes automation and ML data.
  • Machine learning can be leveraged to refine data that automates cleaning and predicts modeling.
  • Data is prepared through altering the numerical columns in the dataset to a standard scale through data normalization.
  • Normalization is used to arrange the data and is a scaling method to reduce duplication.
  • Used to remove characteristics, normalization and standardization occur.

Data Aggregation Steps

  • Data aggregation streamlines performance and provides a high-level view; data is collected in a central repository.
  • Requires data loading, processing and summarization.
  • Data summarization is a process that Reduces large data sets, making them more concise, while retaining essential information.
  • Benefits come in the form of improved analytics, and decision-making, time savings, and increased accuracy.
  • Common techniques include aggregation and data sampling, by Selecting a subset of the points from a data set.
  • Clustering points to determine what the right summarization technique to make decision in time.
  • Use filtering techniques for data summarization to create a concise summary, the technique is applied to focus only on a category or region.
  • Sampling, data Visualization, dimensionality reduction and text Summarization.

Additional Methods of Summarizing

  • Mean, median, and more allow a data summarization.
  • Range is measured in the difference between the highest and lowest values.
  • Visuals are key - bar chart, histogram and box plots.
  • Use frequency tables for data Summarization by organizing categorical data, the table then displays number of occurrences.
  • When creating keep categories accounted for.
  • Tables and pie charts simplify categorical.
  • Moving averages and trend analysis smoothens the time.
  • Data analysis may give Business insights across scientific research.

Data Visualization

  • Utilizes graphics (charts, plots, infographics, animations) for easy complex data understanding.
  • May convey organizational structure. Used to generate ideas, illustrate and explain processes, aid in discovery, and serve everyday needs.
  • Early use was in navigation while dashboards are used to report performance metrics.
  • Visuals include: Tables, Pie and bar charts, Line charts, and Scatter plots.
  • Heat maps as a graphical representation that helps with behavioral data.
  • Provide general background, identify the audience, choose an effective visual; keep it simple.

Exploratory Data Analysis or EDA

  • Used by data scientists and helps best manipulate data.
  • EDA can reveal data beyond formal testing, that may help the validity of the statistical techniques or create new hypotheses.
  • One can Generate Hypotheses, Validate Assumptions or Identifying Data.
  • Important factors include: Center, Dispersion, Distribution, and Visualisation
  • One may then visualize and determine the types of EDA data by identifying whether it is univariate, or multivariate graphical or non-graphical

Exploratory Data Analysis Language

  • The most common programming languages are Python and the "R" language.
  • The data is gathered to a specific criteria or through functions for analysis.

Data Analysis Software Applications

  • Excel is a common tool for data analysis and analysis, such as calculating with the "analysis tool pack."
  • It includes graphing and functions for data performance.
  • Python, routinely ranked as the most programming in the world, use it to streamline models, visualize, and analyze data using built-in data analytics tools.
  • A key appeal to professionals is the amount of libraries such as Panda.
  • tableau is primarily for business analytics and intelligence due to the easy seamless turning of the data.
  • MySQL, used for websites is open and secures what's happening .
  • sas is used to retrieve an intuitive graphical interface(GUI), that enables and creates

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team

Related Documents

More Like This

Descriptive Analytics in Business Intelligence
12 questions
Descriptive Analytics
37 questions

Descriptive Analytics

PeacefulExuberance avatar
PeacefulExuberance
Descriptive Analytics Explained
59 questions

Descriptive Analytics Explained

CharismaticEpiphany6818 avatar
CharismaticEpiphany6818
Use Quizgecko on...
Browser
Browser