Introduction to Statistics
61 Questions
0 Views

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to lesson

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

What is meant by statistics?

Statistics is a branch of mathematics that has applications in almost every facet of our daily life. Statistical methods provide powerful tools to collect, analyze, summarize, and interpret data to reach informed decisions.

Explain descriptive statistics.

Descriptive statistics is the science of summarizing and describing the important characteristics of a set of data.

Explain inferential statistics.

Inferential statistics is the science of using information obtained from a sample to infer about the characteristics of a population of interest.

What is a population in statistics?

<p>A population is a set of the entire collection of units (individuals, objects).</p> Signup and view all the answers

What is a sample in statistics?

<p>A sample is a subset of the population selected in some prescribed manner.</p> Signup and view all the answers

Define a variable in statistics.

<p>A variable is a characteristic that changes or varies over time and/or for different individuals or objects under consideration.</p> Signup and view all the answers

Define an experimental unit.

<p>An experimental unit is the individual or object on which a variable is measured.</p> Signup and view all the answers

How do we obtain a data value?

<p>A single measurement or data value is obtained when a variable is actually measured on an experimental unit.</p> Signup and view all the answers

What type of data is obtained when a single variable is measured on a single experimental unit?

<p>Univariate data (B)</p> Signup and view all the answers

What type of data is obtained when two variables are measured on a single experimental unit?

<p>Bivariate data (C)</p> Signup and view all the answers

What type of data is obtained when more than two variables are measured?

<p>Multivariate data (B)</p> Signup and view all the answers

What type of variable is measured on each experimental unit?

<p>Qualitative variable (A)</p> Signup and view all the answers

What are some examples of qualitative variables?

<p>Political affiliation: Liberals, Conservatives, NDP, Green, Independent, The colour of a car, The size of T-shirts: XS, S, M, L, XL, Racers classified as coming in 1st, 2nd, 3rd, etc.</p> Signup and view all the answers

What type of variable measures a numerical quantity or amount on each experiment unit?

<p>Quantitative variable (A)</p> Signup and view all the answers

What are the two types of values that quantitative variables can assume?

<p>Continuous and Discrete (A)</p> Signup and view all the answers

Define a discrete variable.

<p>Discrete variables can assume only a finite or countable number of values; x=0,1,2,3,.....</p> Signup and view all the answers

Define a continuous variable.

<p>Continuous variables can assume the infinitely many values corresponding to the points on a line interval; 0&lt;x&lt;5.</p> Signup and view all the answers

The most frequent use of your microwave oven (reheating, defrosting, warming, other). Identify this variable as quantitative or qualitative.

<p>Qualitative (B)</p> Signup and view all the answers

The number of consumers who refuse to answer a telephone survey. Identify this variable as quantitative or qualitative.

<p>Quantitative (B)</p> Signup and view all the answers

The door chosen by a mouse in a maze experiment (A,B, or C). Identify this variable as quantitative or qualitative.

<p>Qualitative (A)</p> Signup and view all the answers

The winning time for a horse running at the Woodbine racetrack, Toronto. Identify this variable as quantitative or qualitative.

<p>Quantitative (A)</p> Signup and view all the answers

The number of children in a fifth-grade class who are reading at or above grade level. Identify this variable as quantitative or qualitative.

<p>Quantitative (A)</p> Signup and view all the answers

Average daily temperature for a small city in Quebec during a summer month. Identify this variable as discrete or continuous.

<p>Continuous (A)</p> Signup and view all the answers

Number of bees on a flower. Identify this variable as discrete or continuous.

<p>Discrete (B)</p> Signup and view all the answers

Driving time between Regina, Saskatchewan and Winnipeg, Manitoba. Identify this variable as discrete or continuous.

<p>Continuous (B)</p> Signup and view all the answers

Number of passengers (excluding the airline staff) on a flight from Edmonton to Vancouver. Identify this variable as discrete or continuous.

<p>Discrete (B)</p> Signup and view all the answers

Amount of propane gas for a BBQ cylinder filled at Costco in Montreal. Identify this variable as discrete or continuous.

<p>Continuous (A)</p> Signup and view all the answers

What are the two best ways to present qualitative/categorical data?

<p>Bar Chart and Pie Chart (C)</p> Signup and view all the answers

When presenting qualitative data, what is a statistical table?

<p>A statistical table is a list of the categories along with a measure of how often each category occurred.</p> Signup and view all the answers

What are the three ways the occurrence of qualitative data can be measured?

<p>Frequency, Relative Frequency, and Percentage (A)</p> Signup and view all the answers

Explain what frequency is.

<p>Frequency refers to the number of measurements in each specified category.</p> Signup and view all the answers

Explain what relative frequency is.

<p>Relative frequency is the proportion of measurements in each specified category.</p> Signup and view all the answers

Explain what percentage is.

<p>Percentage is the amount of a given quantity as a fraction of a hundred.</p> Signup and view all the answers

What does a bar chart show?

<p>A bar chart shows the distribution of measurements in categories, with the height of the bar measuring how often a particular category was observed..</p> Signup and view all the answers

In a survey concerning public education, 400 school administrators were asked to rate the quality of education in Canada. Their responses are summarized in the following table. Construct a pie chart and a bar chart for this set of data.

Rating Frequency
A 35
B 260
C 93
D 12
Total 400

<p>The bar chart would show each rating category on the x-axis and the frequency (number of administrators) on the y-axis, with each bar height representing the corresponding frequency. For example, a bar for 'A' would reach to the height of 35 on the y-axis. The pie chart would divide a circle into four sections, each representing a rating. The size of each section would be proportional to the frequency of that rating. For example, the section for 'B' would be much larger than the section for 'D' because 'B' has a much higher frequency.</p> Signup and view all the answers

Construct a bar chart for the following data. Brown Green Brown Blue Red Red Green Brown Yellow Orange Green Blue Brown Blue Blue Brown Orange Blue Brown Orange Yellow

<p>The bar chart would have six bars representing the colors: brown, green, red, yellow, orange, and blue. The height of each bar would represent the number of times each color appears in the data. For example, the bar for &quot;Brown&quot; would be the tallest, reaching to a height of 6 (since &quot;Brown&quot; appears 6 times in the data).</p> Signup and view all the answers

What are pie charts and bar charts used for in quantitative data?

<p>Pie charts and bar charts are used for quantitative data that are measured on different segments of the population, or different categories of classification.</p> Signup and view all the answers

Give an example of quantitative data that can be represented by pie charts and bar charts.

<p>An example would be the average income for people of different age groups, different genders, or living in different geographic areas of the country.</p> Signup and view all the answers

The following is the amount of money estimated for the fiscal year 2002-2003 budget ($billions) by the Department of National Defence, Government of Canada. Construct both a pie and a bar chart.

Category Amount
Maritime forces 2,053.21
Land forces 3,181.33
Air forces 2,828.76
Joint operations and civil emergency preparedness 1,086.31
Communications and information management 304.02
Support to the personal function 860.85
Materiel, infrastructure, and environment support 754.08
DND/CF executive 786.24
Total 11,834.80

<p>The bar chart would have bars representing each category on the x-axis and the amount (in billions of dollars) on the y-axis. The height of each bar would correspond to the amount allocated to that category. The pie chart would divide a circle into sections representing each category, with the size of each section proportional to the amount allocated to that category. For example, the section for &quot;Land forces&quot; would be the largest, as it has the highest amount allocated.</p> Signup and view all the answers

What are line charts used for?

<p>Line charts are used when a quantitative variable is measured over time at equally spaced intervals (daily, weekly, monthly, quarterly, or yearly).</p> Signup and view all the answers

What are time series?

<p>Time series are data that are measured over time at equally spaced intervals.</p> Signup and view all the answers

The following are Statistics Canada projections for age group 65-69. Construct a line chart to illustrate the data.

Year Population (Thousands)
2025 2,375.7
2026 2,395.2
2027 2,428.0
2028 2,472.4
2029 2,512.7
Year Population (Thousands)
------- --------
2030 2,553.4
2031 2,581.3
2032 2,599.5
2033 2,605.9
2034 2,608.5

<p>The line chart would have the year on the x-axis and the population (in thousands) on the y-axis. Each year would have a corresponding point representing the population for that year, and the points would be connected by a line. This line would demonstrate the trend in population projections over time for age group 65-69.</p> Signup and view all the answers

What are dotplots used for?

<p>Dotplots are used for small data sets which cannot easily be separated into categories or intervals of time.</p> Signup and view all the answers

Why are dotplots not easily interpretable for large datasets?

<p>For large data sets, dotplots can be difficult to interpret because of potential overcrowding and an excessive number of dots, making it hard to discern patterns.</p> Signup and view all the answers

Construct a dotplot for the following data: 2,6,9,3,7,6.

<p>The dotplot would have a number line from 2 to 9. There would be a dot above the number 2, two dots above the number 6, one dot above the number 3, one dot above the number 9, and one dot above the number 7, visually representing the frequency of each data value.</p> Signup and view all the answers

What are the steps for constructing a stem and leaf plot?

<p>Order the data from smallest to largest, Divide each observation/measurement into a stem (all digits of the observation except the least significant digit) and a leaf (the least significant digit of the observation), List the stems in a column, with a vertical line to their right, Put the leaf portion for each observation in the same row as its corresponding stem, and State your leaf unit.</p> Signup and view all the answers

The table below lists test completion time (in hours) for a 2-hour test. Construct a stem and leaf plot to display the distribution of the data. 1.89 1.92 1.9 1.95 1.5 1.6 1.95 1.97 1.96 1.9 1.8 1.95 1.95 1.45 1.33 1.95 1.96 1.77

<p>The stem-and-leaf plot for the data presented has stems representing the whole parts of the completion times (1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 2, 2, 2, 2, 2, 2, 2), and leaves indicating the decimal parts of the completion times. For example, the first entry, 1.89, would have a stem of 1 and a leaf of 8. So, the plot would arrange the data like this: 1 | 3 4 5 5 5 5 6 7 8 9 9 9 9 9 2 | 0 0 0 0 0 0 0 0 0</p> Signup and view all the answers

What happens when a stem-and-leaf plot has too few stems and too many leaves?

<p>If a stem-and-leaf plot has too few stems and too many leaves, the data may be difficult to visualize and interpret. This is because the plot may become too crowded or spread out, making it hard to analyze the distribution of the data.</p> Signup and view all the answers

What are the two ways to divide stems in a stem-and-leaf plot when there are too few stems and too many leaves?

<p>Stems can be divided in one of two ways: into two lines, with leaves 0-4 and 5-9, or into five lines, with leaves 0-1, 2-3, 4-5, 6-7, 8-9.</p> Signup and view all the answers

Consider the following two plots for the same data set. | 99
2 0 1 1 2 3 4 4 5 5 5 5 5 6 6 7 7 7 7 7 8 8 9 9 Leaf unit=0.1
3 0 0 1 1 4

1 9 9
2 0 1 1
2 2 3
2 4 4 5 5 5 5 5 Leaf unit=0.1
2 6 6 7 7 7 7 7
2 8 8 9 9
3 0 0 1 1
3 4
Why would these two stem and leaf plots be constructed for the same data?

<p>These two stem-and-leaf plots are constructed for the same data set because the stems are divided differently in each plot to address the issue of having too few stems and too many leaves. The first plot has stems divided into three lines, while the second plot has stems divided into five lines. This different division of stems results in different representations of the data.</p> Signup and view all the answers

What is a histogram and what is it used for?

<p>A histogram is a graphical representation of the distribution of quantitative data, where the data is grouped into categories or classes, and the height of each bar represents the frequency or relative frequency of values in that class.</p> Signup and view all the answers

What are the rules for choosing classes in a histogram?

<p>Classes in a histogram should be mutually exclusive, meaning no data point belongs to more than one class, and they should be exhaustive, meaning all data points belong to one of the classes.</p> Signup and view all the answers

How many classes should a histogram typically have?

<p>As a rule of thumb, the number of classes in a histogram should range between 5 and 12, but the optimal number may vary depending on the specific data and the size of the dataset.</p> Signup and view all the answers

The following is grade point averages of 30 first-year university students. Construct a histogram using frequencies and using relative frequencies. 9.9 9.9 10 10.1 10.1 10.2 10.3 10.4 10.4 10.5 10.5 10.5 10.5 10.5 10.6 10.6 10.7 10.7 10.7 10.7 10.7 10.8 10.8 10.9 10.9 11 11 11 11 11 11 11 11 .4

<p>To construct a histogram with frequencies, group the GPA data into classes of equal size (e.g., 9.5-10.0, 10.0-10.5, 10.5-11.0, 11.0-11.5), count the number of GPAs within each class, and represent those counts as bars on the histogram. For relative frequencies, calculate the frequency of each class as a proportion of the total number of students. For example, the class 10.5-11.0 would have a frequency of 12 and a relative frequency of 12/30.</p> Signup and view all the answers

What does the height of each bar in a histogram represent?

<p>The height of each bar in a histogram represents the frequency or the relative frequency of values falling into that particular class.</p> Signup and view all the answers

What does it mean when adjacent bars in a histogram share sides?

<p>When adjacent bars in a histogram share sides, it means that the leftmost value of each bar is included in that class, indicating that there is no gap between the categories.</p> Signup and view all the answers

What are outliers in a graph?

<p>Outliers are individual values that fall outside the overall pattern of the data. These are unusually large or small data values.</p> Signup and view all the answers

What are the three key aspects of a graph's overall pattern?

<p>The three key aspects of a graph's overall pattern are its shape, centre, and spread.</p> Signup and view all the answers

Describe a symmetric distribution.

<p>A distribution is symmetric if the right and left sides of the distribution form mirror images.</p> Signup and view all the answers

Describe a skewed distribution.

<p>A distribution is skewed to the right if a greater proportion of the measurements lie to the right of the distribution peak, while a distribution is skewed to the left if a greater proportion of the measurements lie to the left of the distribution peak.</p> Signup and view all the answers

Describe a unimodal distribution.

<p>A distribution is unimodal if it has one peak.</p> Signup and view all the answers

Study Notes

Introduction to Statistics

  • Statistics is a branch of mathematics used in various aspects of daily life.
  • Statistical methods help collect, analyze, summarize, and interpret data for informed decision-making.
  • Two main types of statistics exist:
    • Descriptive statistics involves summarizing and describing data characteristics.
    • Inferential statistics uses sample data to infer about a larger population.

Populations and Samples

  • A population is the entire collection of individuals or objects of interest.
  • A sample is a subset of the population.
  • A variable is a characteristic that changes over time or among individuals/objects.
  • An experimental unit is the individual or object on which a variable is measured.
  • A single measurement or data value is obtained from measuring a variable on an experimental unit.

Types of Data

  • Univariate data involves measuring a single variable on a single experimental unit.
  • Bivariate data involves measuring two variables on a single experimental unit.
  • Multivariate data involves measuring more than two variables on a single experimental unit.

Types of Variables

  • Qualitative variables (categorical): represent qualities or characteristics, and measurements fall into categories.
    • Examples: political affiliation, color of a car, size of T-shirts, race finish position.
  • Quantitative variables: represent numerical quantities or amounts.
    • Examples: prime interest rate, weight, volume of orange juice.
    • Quantitative variables can be:
      • Discrete: can only take on a finite or countable number of values (e.g., number of bees on a flower).
      • Continuous: can take on any value within a given range (e.g., average daily temperature).

Graphs for Qualitative Data

  • Bar charts and pie charts are used to visually display qualitative data.
  • Frequency, relative frequency, and percentages can be used to measure the occurrences of each category.
  • In a bar chart, height of a bar represents how often a particular category occurred

Graphs for Quantitative Data

  • Pie charts and bar charts are used to display quantitative data that are segmented or categorized by population.
    • Examples include average income, age groups, geographical regions.

Line Charts

  • Line charts are used when a quantitative variable is measured over time at equally spaced intervals (daily, weekly, monthly, quarterly, yearly).
  • This type of data is referred to as a time series.

Dotplots

  • Used for small datasets that cannot be categorized.
  • Not ideal for large datasets as they lack interpretability.

Stem-and-Leaf Plots

  • Used to display the distribution of data by dividing each observation into a stem (all digits except the least significant digit) and a leaf (least significant digit).
  • Useful for visualizing data distribution.
  • Leaf units should be specified clearly, like 0.1.

Histograms

  • Visual representation of the distribution of quantitative data.
  • Variable is grouped into categories (classes) of equal size.
  • Classes should be mutually exclusive (non-overlapping) and exhaustive (cover all measurements).
  • Height of each bar reflects frequency or relative frequency of measurements falling into that class.

Interpreting Graphs

  • Check horizontal and vertical axes to know what is being measured.
  • Analyze overall pattern including shape, central tendency, and spread.
  • Look for deviations like outliers (unusual data points).

Describing the Shape of a Distribution

  • Symmetric: right and left sides mirror each other.
  • Skewed right: a greater proportion of measurements lie to the right of the peak.
  • Skewed left: a greater proportion of measurements lie to the left of the peak.
  • Unimodal: single peak.
  • Bimodal: two peaks.

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team

Related Documents

Description

Explore the fundamentals of statistics, including its types, populations, samples, and data measurement methods. This quiz covers descriptive and inferential statistics, and the differences between univariate and bivariate data. Perfect for students starting their journey in statistics.

More Like This

Use Quizgecko on...
Browser
Browser