Statistics Chapter 3: Central Tendency

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

What is the median of the following data set: 10, 2, 7, 14, 5, 9?

  • 8.5
  • 7.5
  • 8 (correct)
  • 7

Given the dataset: 3, 6, 6, 8, 10, 12. Which of the following statements is true?

  • The mean is equal to the mode
  • The median is equal to the mean
  • The mode is greater than the median (correct)
  • The median is greater than the mean

Which of the following best describes a disadvantage of using the median as a measure of central tendency?

  • It is significantly affected by extreme scores.
  • It requires more computational steps than the mean.
  • It may not accurately represent the typical value. (correct)
  • It cannot be applied to categorical data.

Which of the following datasets would have both a mode and a median equal to 9?

<p>7, 8, 9, 9, 10, 11 (C)</p> Signup and view all the answers

In a dataset, the mean is $250, the median is $200 and the mode is $150. Which measure is most resistant to extreme values?

<p>The median (A)</p> Signup and view all the answers

Which measure of central tendency is most susceptible to misrepresenting the center of a distribution due to extreme scores?

<p>Mean (C)</p> Signup and view all the answers

If a dataset has an even number of values, how is the median calculated?

<p>The average of the two middle numbers is derived. (C)</p> Signup and view all the answers

What does the symbol μ, represent in the context of central tendency calculations?

<p>Population mean (C)</p> Signup and view all the answers

Which measure of central tendency provides the most stable estimate of the population mean across repeated samples?

<p>Mean. (B)</p> Signup and view all the answers

Given the set of numbers: 2, 8, 5, 2, 6, what is the mode?

<p>2 (B)</p> Signup and view all the answers

What action is required before calculating the median of a dataset?

<p>List the numbers in ascending order (B)</p> Signup and view all the answers

In the distribution: 3, 3, 6, 8, 9, which value represents the median?

<p>6 (B)</p> Signup and view all the answers

A sample of values is: 10, 20, 30, and 40. What is the sample mean?

<p>25 (B)</p> Signup and view all the answers

Which of the following is an advantage of using the mode as a measure of central tendency?

<p>It can be applied to nominal data. (A)</p> Signup and view all the answers

What does a positively skewed distribution indicate?

<p>The distribution trails off to the right. (C)</p> Signup and view all the answers

In a skewed distribution, where is the median typically located?

<p>Typically but not always between the mean and the mode. (C)</p> Signup and view all the answers

What is a key characteristic of a normal distribution?

<p>Data are symmetrically distributed around the mean, median, and mode. (C)</p> Signup and view all the answers

Which of the following is true about a perfectly normal distribution?

<p>It is perfectly symmetrical and unimodal. (C)</p> Signup and view all the answers

Which measure of central tendency is most appropriate for normally distributed data?

<p>The mean. (D)</p> Signup and view all the answers

Which measure of central tendency is most appropriate for modal distributions?

<p>The mode. (A)</p> Signup and view all the answers

What does the concept of 'dispersion' refer to in a distribution?

<p>How scores are spread out on the x-axis. (A)</p> Signup and view all the answers

What does the range of a dataset represent?

<p>The difference between the highest and lowest scores. (D)</p> Signup and view all the answers

Why might the range be less informative for a dataset containing outliers?

<p>Because outliers significantly inflate the range, creating a misleading representation of score distribution. (C)</p> Signup and view all the answers

What is variance?

<p>The average squared distance that scores deviate from their mean. (B)</p> Signup and view all the answers

What is the sum of squares (SS)?

<p>The sum of the squared deviations of scores from their mean. (B)</p> Signup and view all the answers

In the formula for sample variance, why is the sum of squares (SS) divided by N-1, instead of N?

<p>To account for the degrees of freedom in the sample. (C)</p> Signup and view all the answers

What are degrees of freedom (df)?

<p>The number of independent pieces of information that are free to vary, minus the number of mathematical restrictions. (C)</p> Signup and view all the answers

If three numbers add up to 20, and two of the numbers are 5 and 8, how many degrees of freedom are there?

<p>2 (B)</p> Signup and view all the answers

Given the following numbers: 2, 4, 6, and 8, what is the range?

<p>6 (A)</p> Signup and view all the answers

What is the relationship between variance and standard deviation?

<p>Standard deviation is the square root of the variance. (B)</p> Signup and view all the answers

Why are computational formulas preferred over definitional formulas for calculating variance?

<p>Definitional formulas are more prone to rounding errors. (C)</p> Signup and view all the answers

The standard deviation is described as:

<p>the average distance that scores deviate from their mean (C)</p> Signup and view all the answers

In the provided data, what does a higher rating indicate?

<p>More attractiveness (C)</p> Signup and view all the answers

How many scores are there in Set 4 based on the provided information?

<p>8 (B)</p> Signup and view all the answers

What is the purpose of calculating the standard deviation?

<p>To measure the dispersion of scores around the mean (A)</p> Signup and view all the answers

According to the given information, what is the standard deviation of Set 4?

<p>1.69 (A)</p> Signup and view all the answers

Which of the following formulas represents the sample standard deviation?

<p>$ \sqrt{\frac{\sum x - \frac{(\sum x)^2}{N}}{N-1}}$ (C)</p> Signup and view all the answers

Flashcards

Central Tendency

A measure that represents the center of a dataset. Examples include mean, median, and mode.

Mean

The sum of all scores divided by the number of scores. Also known as the average.

Median

The middle value in a dataset that has been arranged in ascending order. For even datasets, it's the average of the two middle values.

Mode

The value that appears most frequently in a dataset.

Signup and view all the flashcards

Advantages of the Mean

The sample mean is a more consistent estimator of the population mean than the sample mode or median.

Signup and view all the flashcards

Disadvantages of the Mean

The mean can be skewed by outliers or extreme values, leading to misrepresentation of the center.

Signup and view all the flashcards

Median (advantage)

A measure of central tendency that is not easily influenced by outliers.

Signup and view all the flashcards

Mode (disadvantage)

While the mode gives the most frequent value, it is less informative than the mean and median for continuous data.

Signup and view all the flashcards

Median's Advantage

A measure of central tendency not influenced by extreme values. It represents the 'typical' value in a dataset, focusing on the middle data points.

Signup and view all the flashcards

Median's Disadvantage

A measure of central tendency that can be misleading when data has extreme values. It might not accurately represent the 'typical' value.

Signup and view all the flashcards

Positively Skewed Distribution

A distribution that trails off to the right, with more scores on the lower end than on the higher end.

Signup and view all the flashcards

Negatively Skewed Distribution

A distribution that trails off to the left, with more scores on the higher end than on the lower end.

Signup and view all the flashcards

Normal Distribution

A theoretical distribution where data is perfectly symmetrical around the mean, median, and mode.

Signup and view all the flashcards

Kurtosis

The degree to which a distribution is pointed or flat.

Signup and view all the flashcards

Variability

The spread of scores in a distribution.

Signup and view all the flashcards

Range

The difference between the highest and lowest scores in a distribution.

Signup and view all the flashcards

Variance

A measure of variability that describes the average squared deviation of scores from the mean.

Signup and view all the flashcards

Standard Deviation

A measure of variability that describes the average deviation of scores from the mean, expressed in the same units as the original scores.

Signup and view all the flashcards

Outlier

An extreme score that falls significantly above or below most other scores in a dataset.

Signup and view all the flashcards

Sum of Squares (SS)

The sum of the squared deviations of each score from the mean. It's the numerator in the variance formula.

Signup and view all the flashcards

Population Variance Formula

The formula used to calculate the variance of a population. Sigma squared (σ²) is the symbol for population variance.

Signup and view all the flashcards

Sample Variance Formula

The formula used to calculate the variance of a sample. S² represents sample variance.

Signup and view all the flashcards

Degrees of Freedom (df)

The number of independent pieces of information in a sample that are free to vary, minus the number of restrictions. It's used in the sample variance formula.

Signup and view all the flashcards

Population Standard Deviation (σ)

The average distance of scores from the mean in a population.

Signup and view all the flashcards

Sample Standard Deviation (SD)

The average distance of scores from the mean within a sample.

Signup and view all the flashcards

Computational Formulas for Variance

The computational formulas for population variance and sample variance.

Signup and view all the flashcards

Langlois & Roggman (1990) study

A study showing average faces are considered more attractive than non-average faces.

Signup and view all the flashcards

Calculating SD for Set 4

The calculations used to determine the standard deviation of Set 4 in the Langlois & Roggman (1990) study.

Signup and view all the flashcards

SD of Set 4

The standard deviation of Set 4 (the average face) in the Langlois & Roggman (1990) study was 1.69.

Signup and view all the flashcards

Study Notes

Chapter 3: Central Tendency

  • Central tendency measures the center of a distribution. Examples include mean, median, and mode.
  • Scientists, therapists, and educators frequently need to understand the central tendency of data. For example, finding the average number of symptoms in a disorder, or the most frequent symptom.

Mean

  • The mean is the sum of scores divided by the number of scores. This is also known as the average.
  • Population mean (μ): μ = Σx/N where μ is the Greek letter mu, Σx is the sum of all scores, and N is the population size.
  • Sample mean (M): M = Σx/N where N is the size of the sample.
  • The mean may be misleading if there are extreme values (outliers) in the data. For example, a psychotherapy school might claim a mean hourly rate of $500, but if a few psychotherapists charge $2,100/hour and the others are in the $100 range, the average will be inflated. The median will be more accurate in cases like this.

Median

  • The median is the middle number in an ordered set of numbers.
  • For odd-sized sets, it is the middle value after ordering the data. For even-sized sets it is the average of the two middle values.
  • Example: Find the median of the odd set 1, 0, 5, 4, 6; Order the numbers: 0, 1, 4, 5, 6; The middle number (3rd number) is 4.
  • Example: Find the median of the even set 2, 8, 0, 6, 4, 5; Order the numbers: 0, 2, 4, 5, 6, 8; The middle two numbers are 4 and 5; The average of these two numbers is 4.5.
  • The median is less sensitive to outliers than the mean.

Mode

  • The mode is the most frequently occurring score or value.
  • Example: Find the mode for the data 1, 2, 2, 2, 3, 4. The mode is 2.
  • Example: Find the mode for the data 1, 2, 2, 3, 4, 4. The modes are 2 and 4.
  • Example: Find the mode for the data blue, blue, pink, pink, gray, gray, gray. The mode is gray.
  • The mode can only be determined from nominal scale data.

Describing Distributions

  • Graphed distributions can vary in skew (symmetry) and kurtosis (pointedness).
  • Positively skewed distribution: the tail of the distribution trails off to the right.
  • Negatively skewed distribution: the tail of the distribution trails off to the left.
  • Normal distribution: a symmetrical distribution where the mean, median, and mode are all located at the center.

The Empirical Rule

  • For normally distributed data:
    • Approximately 68% of the data falls within 1 standard deviation of the mean.
    • Approximately 95% of the data falls within 2 standard deviations of the mean.
    • Approximately 99.7% of the data falls within 3 standard deviations of the mean.

Chapter 4: Variability

  • Variability describes how dispersed or spread out data points are.
  • Range: the difference between the highest and lowest score. The range is helpful if there are no major outliers.
  • Variance: the average squared distance that scores deviate from their mean.
  • Standard Deviation: the average distance that scores deviate from their mean. (The standard deviation is the square root of the variance).
  • The Empirical Rule is used to understand the proportion of data points falling within specific ranges of the mean using the standard deviation.

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team

Related Documents

Chapter 3 & 4 PDF

More Like This

Central Tendency: Mode, Median, and Mean
45 questions
Central Tendency: Mode, Median, and Mean
45 questions
Use Quizgecko on...
Browser
Browser