Week 2: descriptive statistics

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Play an AI-generated podcast conversation about this lesson
Download our mobile app to listen on the go
Get App

Questions and Answers

A researcher is studying the effect of different teaching methods on student test scores. They calculate descriptive statistics for each group. What is the primary purpose of using descriptive statistics in this context?

  • To determine if the different teaching methods have a statistically significant effect on test scores.
  • To generalize the findings to a larger population of students.
  • To predict future test scores based on the teaching methods used.
  • To summarize and describe the test score data for each teaching method group. (correct)

In a dataset of student ages, which measure of central tendency would be most affected by the presence of a few extremely old students (outliers)?

  • Mean (correct)
  • Mode
  • Interquartile Range
  • Median

A data set includes the following values: 2, 3, 3, 4, 5, 6, 7, 8, 9. What is the median of this data set?

  • 3
  • 5.44
  • 5 (correct)
  • 6

For a symmetrical distribution, which of the following statements is true regarding the relationship between the mean, median, and mode?

<p>The mean, median, and mode are all equal. (D)</p> Signup and view all the answers

A researcher wants to quickly determine the spread of scores in a dataset. Which measure of dispersion is simplest to calculate?

<p>Range (A)</p> Signup and view all the answers

A dataset of test scores has a mean of 75 and a standard deviation of 5. Approximately what percentage of scores fall between 65 and 85, assuming a normal distribution?

<p>95% (B)</p> Signup and view all the answers

Which of the following is LEAST likely to be influenced by outliers in a data set?

<p>Median (B)</p> Signup and view all the answers

A researcher measures the happiness level of individuals on a scale of 1 to 10. What type of variable is 'happiness level' in this scenario?

<p>Ordinal (B)</p> Signup and view all the answers

What does a large standard deviation indicate about a dataset?

<p>The data points are spread out over a wider range of values. (D)</p> Signup and view all the answers

A researcher is analyzing the distribution of income in a city. The distribution has a long tail extending towards the higher income values. What type of skew does this distribution exhibit?

<p>Positive Skew (A)</p> Signup and view all the answers

The interquartile range (IQR) is a measure of:

<p>The spread of the middle 50% of the data. (D)</p> Signup and view all the answers

For a normally distributed dataset, approximately what percentage of data points fall within one standard deviation of the mean?

<p>68% (A)</p> Signup and view all the answers

A set of test scores is as follows: 60, 70, 70, 80, 90. What is the mode of this set of scores?

<p>70 (A)</p> Signup and view all the answers

Which of the following is a measure of dispersion?

<p>Standard Deviation (C)</p> Signup and view all the answers

If a distribution has a kurtosis value greater than zero, it is called:

<p>Leptokurtic (D)</p> Signup and view all the answers

A student scores 80 on a test. The mean score is 70, and the standard deviation is 5. What is the student's z-score?

<p>2 (C)</p> Signup and view all the answers

What type of statistics would be used to analyze the relationship between the type of note-taking method (longhand vs. laptop) and exam performance?

<p>Inferential Statistics (A)</p> Signup and view all the answers

Which type of variable is 'eye color' (e.g., blue, brown, green)?

<p>Nominal (C)</p> Signup and view all the answers

If a distribution is negatively skewed, which of the following relationships between the mean, median, and mode is most likely?

<p>Mode &gt; Median &gt; Mean (A)</p> Signup and view all the answers

A researcher wants to compare the variability in exam scores between two different classes. Which statistic would be most appropriate for this purpose?

<p>Standard Deviation (A)</p> Signup and view all the answers

Flashcards

Descriptive statistic

A type of statistic that describes data through central tendency, dispersion, skew, and kurtosis.

Central tendency

A measure of the 'average' value in a dataset.

Dispersion Statistic

A measure of how spread out the values are in a dataset.

Mean

The average of a set of values; sum all values and divide by the number of values.

Signup and view all the flashcards

Median

The middle value of a set of ordered values.

Signup and view all the flashcards

Mode

The most frequently occurring value in a dataset.

Signup and view all the flashcards

Range

Largest value minus the smallest value in a dataset.

Signup and view all the flashcards

Interquartile range (IQR)

Difference between the 75th and 25th percentile; range spanned by the middle half of the data.

Signup and view all the flashcards

Variance

Mean squared deviation; average of the squared differences from the mean.

Signup and view all the flashcards

Standard Deviation

The root mean square deviation; square root of the variance.

Signup and view all the flashcards

Nominal variables

Variable also known as categorical variables that uses counts or frequencies to describe data

Signup and view all the flashcards

Skew

Describes the symmetry of a distribution; can be negative (left), positive (right), or no skew (normal distribution).

Signup and view all the flashcards

Kurtosis

Measures the 'tailedness' of a distribution; can be platykurtic (too flat), mesokurtic, or leptokurtic (too pointy).

Signup and view all the flashcards

Standard score (z score)

Transforms raw scores into a standard scale with a mean of 0 and standard deviation of 1; expresses how many standard deviations a raw score is from the mean.

Signup and view all the flashcards

Study Notes

  • Descriptive statistics are being examined
  • Central tendency, dispersion, and distributions are covered
  • JASP is used for descriptive statistics

Research Study

  • Note-taking format affects lecture content recall
  • Note-taking format impacts memory of exam material

Types of Statistics

  • Descriptive statistics involve central tendency
  • Inferential statistics exist

Central Tendency

  • Central tendency is the measure of the typical value for a probability distribution

Mean

  • The mean is the the average of a set of values
  • The mean is calculated by summing all values and dividing by the number of values
  • Formula: 𝑋 = (Σ X_i) / N

Median

  • The median is the middle value of an ordered data set
  • For N=15 observations, the median is the 8th value
  • For N=14 observations, the median is the average of the 7th and 8th values

Median vs. Mean

  • The median of a data set with an outlier of 500 is 40
  • This is the same as a data set without the outlier

Mode

  • The mode is the most frequently occurring value in a data set

Nominal Variables

  • Nominal variables can also be called categorical variables
  • Counts or frequencies can describe categorical variables
  • An example of a categorical variable includes degree choice (Psychology, Engineering, Business, Arts, Other)
  • Unit of measure is student
  • Mode is a type of nominal variable

Dispersion

  • Dispersion is the extent to which a distribution is stretched or squeezed

Range

  • The range is the difference between the largest and smallest values in a data set
  • The range is calculated as: Xmax - Xmin

Interquartile Range

  • The interquartile range (IQR) is the difference between the 25th and 75th percentiles
  • The median is the 50th percentile
  • 25% of the data lies below the 25th percentile and one quarter of the data is above the 75th percentile
  • The IQR represents the range spanned by the middle half of the data

Variance

  • Variance is the mean squared deviation
  • Var(𝑋) = Σ(X_i − 𝑋)^2 / N, where N is the number of observations

Standard Deviation

  • Standard deviation is the root mean square deviation
  • s = √[Σ(X_i − 𝑋)^2 / N]
  • 68% of the data falls within 1 standard deviation of the mean
  • 95% of the data falls within 2 standard deviations of the mean
  • 99.7% of the data falls within 3 standard deviations of the mean

Distributions

  • Distributions include skew and kurtosis

Skewness

  • No skew (normal distribution): mean = median = mode
  • Negative skew (left skew): tail is on the left; Mean < Median < Mode
  • Positive skew (right skew): tail is on the right; Mode < Median < Mean

Kurtosis

  • Platykurtic ('too flat'): Kurtosis < 0
  • Mesokurtic: Kurtosis ≈ 0
  • Leptokurtic ('too pointy'): Kurtosis > 0

Standard (z) Score

  • Standard score indicates how many standard deviations an element is from the mean
  • z_i = (X_i - X) / s
  • Example: a score of 55, when the mean is 40.1 and standard deviation is 6.6, has a 𝑍 score of 2.26, and lies 2.26 Standard Deviations above the mean

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team

Related Documents

More Like This

Week 2 Chemical Bonding Terms Flashcards
20 questions
Week 2 - Classical Ethical Theories
18 questions
Week 2 - Computer Memory Basics
4 questions
Use Quizgecko on...
Browser
Browser