Podcast
Questions and Answers
Which of the following is NOT a benefit of using graphs of time series data?
Which of the following is NOT a benefit of using graphs of time series data?
- Identifying the specific causes of fluctuations in a time series (correct)
- Understanding past events
- Identifying trends over time
- Projecting future values for the time series
Which of the following is NOT a source of data for time series analysis?
Which of the following is NOT a source of data for time series analysis?
- Government agencies
- Business database services
- Personal opinions and beliefs (correct)
- Internal company records
What is a primary use of time series data graphs?
What is a primary use of time series data graphs?
- To forecast future values for a single variable
- To identify patterns and trends over time (correct)
- To visualize relationships between different variables
- To compare different types of data
Which of the following is NOT a typical source of data for time series analysis?
Which of the following is NOT a typical source of data for time series analysis?
Which of the following is an example of a time series data set?
Which of the following is an example of a time series data set?
What is the range of the parts cost for the 50 tune-ups?
What is the range of the parts cost for the 50 tune-ups?
What is the sample size for the parts cost data?
What is the sample size for the parts cost data?
Which of these is a valid statistical question about the data?
Which of these is a valid statistical question about the data?
What is the relative frequency of the parts cost being $75?
What is the relative frequency of the parts cost being $75?
What is the frequency of the parts cost being $78?
What is the frequency of the parts cost being $78?
What is the difference between the maximum and minimum values in the data set represented by cells A2 to A71?
What is the difference between the maximum and minimum values in the data set represented by cells A2 to A71?
Which Excel function would be used to find the most frequent value in cells A2 to A71?
Which Excel function would be used to find the most frequent value in cells A2 to A71?
Why is it often necessary to use computer software for statistical analysis when dealing with large amounts of data?
Why is it often necessary to use computer software for statistical analysis when dealing with large amounts of data?
Which Excel function is used to calculate the average of the values in cells A2 to A71?
Which Excel function is used to calculate the average of the values in cells A2 to A71?
What is the purpose of using StatTools in statistical analysis using Excel?
What is the purpose of using StatTools in statistical analysis using Excel?
Which of the following is NOT one of the eight topic areas covered by the 67 guidelines?
Which of the following is NOT one of the eight topic areas covered by the 67 guidelines?
Which of the following is NOT a responsibility outlined in the 67 guidelines?
Which of the following is NOT a responsibility outlined in the 67 guidelines?
Which of the following is MOST likely to be addressed in the guidelines regarding 'Responsibilities to Research Subjects'?
Which of the following is MOST likely to be addressed in the guidelines regarding 'Responsibilities to Research Subjects'?
The 67 guidelines are organized into eight topic areas. Which of the following would be LEAST likely to be addressed in one of these topic areas?
The 67 guidelines are organized into eight topic areas. Which of the following would be LEAST likely to be addressed in one of these topic areas?
Which of the following would be MOST likely to be addressed in the section on 'Responsibilities of Employers Including Organizations, Individuals, Attorneys, or Other Clients'?
Which of the following would be MOST likely to be addressed in the section on 'Responsibilities of Employers Including Organizations, Individuals, Attorneys, or Other Clients'?
What type of statistical study is a survey?
What type of statistical study is a survey?
In a study of smokers and nonsmokers, what type of statistical study is being conducted?
In a study of smokers and nonsmokers, what type of statistical study is being conducted?
What is the main difference between observational and experimental studies?
What is the main difference between observational and experimental studies?
Which government agency is responsible for collecting data on customer spending?
Which government agency is responsible for collecting data on customer spending?
What kind of data is typically collected by the Bureau of Labor Statistics?
What kind of data is typically collected by the Bureau of Labor Statistics?
The Salk polio vaccine experiment is considered the largest experimental study ever conducted. What makes it an experimental study?
The Salk polio vaccine experiment is considered the largest experimental study ever conducted. What makes it an experimental study?
Which of the following is NOT a characteristic of observational studies?
Which of the following is NOT a characteristic of observational studies?
What is a key advantage of experimental studies over observational studies?
What is a key advantage of experimental studies over observational studies?
Which of the following is a characteristic of descriptive analytics?
Which of the following is a characteristic of descriptive analytics?
Data warehousing is essential for organizations because it enables them to:
Data warehousing is essential for organizations because it enables them to:
Which of the following examples best illustrates the concept of data mining?
Which of the following examples best illustrates the concept of data mining?
Which type of analysis is best suited for determining the impact of a marketing campaign on sales?
Which type of analysis is best suited for determining the impact of a marketing campaign on sales?
Which of the following is NOT a typical source of data for data warehousing?
Which of the following is NOT a typical source of data for data warehousing?
A company wants to optimize its pricing strategy to maximize profits. Which type of analytics would be most appropriate for this task?
A company wants to optimize its pricing strategy to maximize profits. Which type of analytics would be most appropriate for this task?
Which of the following is NOT a benefit of data warehousing?
Which of the following is NOT a benefit of data warehousing?
Which of the following is an example of a decision that might be made based on data mining analysis of data in a warehouse?
Which of the following is an example of a decision that might be made based on data mining analysis of data in a warehouse?
Flashcards
Time Series Data
Time Series Data
Data points indexed in time order, showing values over a specified period.
Purpose of Time Series Graphs
Purpose of Time Series Graphs
Helps analysts understand past events, identify trends, and project future values.
Trend Identification
Trend Identification
Observing patterns in time series data to recognize consistent movements over time.
Data Sources for Analysis
Data Sources for Analysis
Signup and view all the flashcards
Internal Company Records
Internal Company Records
Signup and view all the flashcards
Sample Data
Sample Data
Signup and view all the flashcards
Parts Cost Analysis
Parts Cost Analysis
Signup and view all the flashcards
Mean Cost Calculation
Mean Cost Calculation
Signup and view all the flashcards
Data Sample Size
Data Sample Size
Signup and view all the flashcards
Statistical Data Representation
Statistical Data Representation
Signup and view all the flashcards
Budget Data
Budget Data
Signup and view all the flashcards
Bureau of Labor Statistics
Bureau of Labor Statistics
Signup and view all the flashcards
Observational Studies
Observational Studies
Signup and view all the flashcards
Survey Example
Survey Example
Signup and view all the flashcards
Experimental Studies
Experimental Studies
Signup and view all the flashcards
Polio Vaccine Experiment
Polio Vaccine Experiment
Signup and view all the flashcards
Variable of Interest
Variable of Interest
Signup and view all the flashcards
Controlled Variables
Controlled Variables
Signup and view all the flashcards
Statistical Software
Statistical Software
Signup and view all the flashcards
Excel Data Sets
Excel Data Sets
Signup and view all the flashcards
Data Entry
Data Entry
Signup and view all the flashcards
Functions in Excel
Functions in Excel
Signup and view all the flashcards
AVERAGE Function
AVERAGE Function
Signup and view all the flashcards
MEDIAN Function
MEDIAN Function
Signup and view all the flashcards
MODE Function
MODE Function
Signup and view all the flashcards
Range Calculation
Range Calculation
Signup and view all the flashcards
Professionalism Guidelines
Professionalism Guidelines
Signup and view all the flashcards
Responsibilities to Funders
Responsibilities to Funders
Signup and view all the flashcards
Responsibilities to Research Subjects
Responsibilities to Research Subjects
Signup and view all the flashcards
Responsibilities Regarding Misconduct
Responsibilities Regarding Misconduct
Signup and view all the flashcards
Responsibilities of Employers
Responsibilities of Employers
Signup and view all the flashcards
Analytics
Analytics
Signup and view all the flashcards
Descriptive Analysis
Descriptive Analysis
Signup and view all the flashcards
Predictive Analysis
Predictive Analysis
Signup and view all the flashcards
Prescriptive Analysis
Prescriptive Analysis
Signup and view all the flashcards
Data Warehousing
Data Warehousing
Signup and view all the flashcards
Transaction Volume
Transaction Volume
Signup and view all the flashcards
Data Mining
Data Mining
Signup and view all the flashcards
Decision Support
Decision Support
Signup and view all the flashcards
Study Notes
Modern Business Statistics (6e)
- Published in 2018 by Cengage Learning
- Authors: Anderson, Sweeney, Williams, Camm, Cochran
- Includes Microsoft Excel® functionalities
Chapter 1: Data and Statistics
- Covers Statistics, applications in business and economics, data sources, descriptive statistics, statistical inference, statistical analysis using Microsoft Excel, data mining, and ethical guidelines for statistical practice.
What is Statistics?
- Statistics can be numerical facts (averages, medians, percentages, and maximums) for understanding business/economic situations.
- Statistics is the art and science of collecting, analyzing, presenting, and interpreting data.
Applications in Business and Economics
- Accounting: Public accounting firms use statistical sampling in audits.
- Economics: Economists use statistical information for economic forecasts.
- Finance: Financial advisors use price-earnings ratios and dividend yields for investment advice.
- Marketing: Electronic point-of-sale scanners collect data for marketing research.
- Production: Statistical quality control charts monitor production process output.
- Information Systems: Statistical information helps assess computer network performance.
Data and Data Sets
- Data are collected, analyzed, and summarized for presentation and interpretation.
- All data collected in a specific study is called the data set.
Elements, Variables, and Observations
- Elements are the entities from which data are collected.
- Variables are characteristics of interest for elements.
- Observations are the set of measurements from a specific element.
- A data set with 'n' elements has 'n' observations.
- Total data values = number of elements x number of variables.
Data, Data Sets, Elements, Variables, and Observations (example)
- Data presented in table format with nations, WTO status, per capita GDP, and Fitch ratings.
Scales of Measurement
- Scales of measurement determine the information contained in data.
- The scale defines appropriate summarization and statistical analyses.
- Scales include: nominal, ordinal, interval, and ratio.
Nominal Scale
- Data are labels or names to identify an attribute of the element.
- Non-numeric labels or numeric codes may be used.
- (Example given in text: WTO status categories “member” and “observer”.)
Ordinal Scale
- Data has the characteristics of nominal data, with order or rank being meaningful.
- Non-numeric labels or numeric codes may be used. (Example given in the text: Fitch ratings from AAA to F.)
Interval Scale
- Data has characteristics of ordinal data, with intervals between observations expressed in a fixed unit of measure.
- Interval data is always numeric. (Example given in the text: SAT scores.)
Ratio Scale
- Data has all interval properties, and the ratio of two values is meaningful.
- Variables such as distance, height, weight, and time use ratio scale.
- Zero value indicates nothing exists for the variable. (Example given in the text compares credit hours.)
Categorical and Quantitative Data
- Categorical data are labels/names for attributes of elements.
- Quantitative data indicate "how many" or "how much". Discrete measures "how many," and continuous data measures "how much".
- Statistical analysis depends on whether data is categorical or quantitative; quantitative data has more analysis alternatives.
Categorical Data
- Labels/names to identify attributes of elements.
- Often referred to as qualitative data.
- Use nominal or ordinal scale of measurement.
- Can be numeric or non-numeric.
- Statistical analysis is limited.
Quantitative Data
- Data indicates "how many" or "how much".
- Discrete data measures "how many" and continuous data measures "how much".
- Always numeric.
- Ordinary arithmetic operations are meaningful.
Scales of Measurement (Diagram)
- A hierarchical diagram showing the relationships between data types (categorical and quantitative) and numerical scales (nominal, ordinal, interval, and ratio).
Cross-Sectional Data
- Data collected at the same or approximately the same point in time.
- (Example provided: data on various variables for WTO nations.)
Time Series Data
- Data collected over several time periods.
- (Example provided: U.S. average price of gasoline between 2010 and 2015.)
- (Graph presented: shows time series data.)
Data Sources
- Existing Sources: Internal company records (almost any department), business database services (like Dow Jones), government agencies (like the U.S. Department of Labor), industry associations, special interest organizations (GMAT), and the internet.
- Data Available from Internal Company Records: Employee records, production records, inventory records, sales records, credit records, customer profiles (with associated data).
- Data Available from Selected Government Agencies: Census Bureau data (population, households, income), Federal Reserve data (money supply, exchange rates, discount rates), Office of Management and Budget data (federal government revenue/expenditures/debt), Department of Commerce and Bureau of Labor Statistics data (business activity, profit/industry data, customer spending/unemployment/earnings/safety records.
- Statistical Studies – Observational: No attempt to control or influence variables of interest. Surveys are a good example.
- Statistical Studies – Experimental: Variable of interest identified first. Then, other variables are identified and controlled to see how they influence the variable of interest. The 1954 Public Health Service polio vaccine experiment is cited as an example.
Data Acquisition Considerations
- Time Requirement: Searching for data and retrieving it can be time-consuming. Data might be outdated.
- Cost of Acquisition: Organizations may charge for data even if not their primary business.
- Data Errors: Using any available data without care can lead to misleading information.
Descriptive Statistics
- Most statistical information in publications/magazines/reports summarizes and presents data for easy understanding.
- These data summaries are called descriptive statistics; these are frequently tabular, graphical, or numerical.
Example: Hudson Auto Repair
- The manager wants to understand the cost of parts used for engine tune-ups.
- 50 customer invoices are examined.
- Costs of parts (rounded to the nearest dollar) are provided in a table.
Tabular Summary: Frequency and Percent Frequency (Example)
- Provides a frequency distribution for the parts costs from the Hudson Auto Repair example.
Graphical Summary: Bar Chart (Example)
- Visualizes the data from the Hudson Auto Repair example using a bar chart.
Numerical Descriptive Statistics
- The mean, the most common descriptive statistic, is the average.
- The mean describes the central tendency or location of data.
- For the Hudson Auto example, the mean cost of parts is $79.
Statistical Inference
- Processes of using data from a sample to make inferences/estimates about population characteristics.
- This may include making estimates or testing hypotheses.
- Concepts of population, sample, census, and sample survey are outlined. (Example in the text: Hudson Auto determining an average cost from a sample.)
Statistical Analysis Using Microsoft Excel
- Statisticians use software for statistical computations in large datasets.
- Datasets in the textbook are in Microsoft Excel format.
- The StatTools Excel add-in is available for download. (The document shows examples of how Excel functions are used.)
Analytics
- Scientific process of transforming data into insights for better decisions.
- Types of analytics:
- Descriptive: Describes what happened in the past.
- Predictive: Uses past data to predict future.
- Prescriptive: Yields a best course of action to take.
Data Warehousing
- Organizations accumulate huge amounts of data daily (e.g., Wal-Mart, Visa transactions).
- Data is captured, stored, and maintained—called data warehousing.
Data Mining
- Analysis of data in a warehouse aids in decision-making and generates higher profits.
- Data is used with procedures from statistics, mathematics, and computing which convert it into useful information.
- Data mining systems identify relationships, and predict future outcomes.
Data Mining Applications
- Primarily used by consumer-focused companies in fields like retail, finance, and communication.
- Useful for identifying related products, creating pop-ups based on previous purchases, and recommending discount offers based on customer purchasing volume.
Data Mining Requirements
- Statistical and computer science methodologies are used (multiple regression, correlation, artificial intelligence, machine learning).
- Significant investment in time and resources.
Data Mining Model Reliability
- A model that works well for a specific sample may not reliably apply to other data.
- Careful data partitioning (training and test sets) is used for model improvement and validation.
- Overfitting is a concern; careful interpretation and rigorous testing is important.
Ethical Guidelines for Statistical Practice
-
Unethical behavior in statistical studies takes various forms (e.g., improper sampling, misleading graphs, statistical result misinterpretations).
-
Statistical practitioners should strive for fairness, objectivity and neutrality in data collection, analysis, and presentation.
-
Consumers of statistics must be mindful of potential unethical behavior in statistical studies.
-
The American Statistical Association has guidelines for statistical practice.
End of Chapter 1
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.