Podcast
Questions and Answers
What type of variable is required for the outcome in a t-test?
What type of variable is required for the outcome in a t-test?
Interval scale or higher.
In the context of comparing two groups, what is the standard statistical approach?
In the context of comparing two groups, what is the standard statistical approach?
To use a t-test.
What is the practical purpose of the z-test, as described in Section 13.1?
What is the practical purpose of the z-test, as described in Section 13.1?
It serves as a stepping stone for teaching statistics, specifically toward understanding the t-test.
What assumptions are made about the population when using the z-test to compare if psychology students tend to get the same grades as everyone else?
What assumptions are made about the population when using the z-test to compare if psychology students tend to get the same grades as everyone else?
How is the test statistic in the z-test calculated?
How is the test statistic in the z-test calculated?
What is the key property of the z-score that makes it useful as a test statistic?
What is the key property of the z-score that makes it useful as a test statistic?
What are the three assumptions of the z-test?
What are the three assumptions of the z-test?
Why is assuming a known standard deviation of the population considered a 'stupid' assumption in the z-test?
Why is assuming a known standard deviation of the population considered a 'stupid' assumption in the z-test?
Under which condition a t-test is used instead of z-test?
Under which condition a t-test is used instead of z-test?
What parameter distinguishes the t-distribution from the normal distribution?
What parameter distinguishes the t-distribution from the normal distribution?
What is compared against during a one-sample t-test?
What is compared against during a one-sample t-test?
Besides different from zero, which other component is required for an effect to be qualified as 'statistically significant'?
Besides different from zero, which other component is required for an effect to be qualified as 'statistically significant'?
What assumptions are necessary while performing a one-sample t-test?
What assumptions are necessary while performing a one-sample t-test?
Define an independent samples t-test.
Define an independent samples t-test.
Explain which assumptions are necessary when referring to 'independent samples'.
Explain which assumptions are necessary when referring to 'independent samples'.
In the context of Students t-test, assuming for the moment you want to run a two-sided test, what is often the goal?
In the context of Students t-test, assuming for the moment you want to run a two-sided test, what is often the goal?
Under the null hypothesis, what should the difference between the population means be equal to?
Under the null hypothesis, what should the difference between the population means be equal to?
Apart from the value, what is also necessary for the formula of the standard error?
Apart from the value, what is also necessary for the formula of the standard error?
When is it valid to use a 'pooled estimate' instead of Student's?
When is it valid to use a 'pooled estimate' instead of Student's?
What the weight that is associated to each sample equal to?
What the weight that is associated to each sample equal to?
If a student were to fall under one of the two tutorials (from two different groups), how does this affect deviation from the group mean?
If a student were to fall under one of the two tutorials (from two different groups), how does this affect deviation from the group mean?
What does the Student t-test assume about variances?
What does the Student t-test assume about variances?
Describe some assumptions of Students' independent sample t-test.
Describe some assumptions of Students' independent sample t-test.
What is the assumption in place to resolve ambiguity in the t-test?
What is the assumption in place to resolve ambiguity in the t-test?
How is this assumption solved?
How is this assumption solved?
What does the Welsh T-test not assume?
What does the Welsh T-test not assume?
Regarding the Welch test, how is the t-statistic calculated?
Regarding the Welch test, how is the t-statistic calculated?
Define what exactly is a data set with repeated measures entail.
Define what exactly is a data set with repeated measures entail.
How can any repeated measures results be approached?
How can any repeated measures results be approached?
Why is the paired-samples t-test 'not really a new test at all'?
Why is the paired-samples t-test 'not really a new test at all'?
Give an example where is it appropriate to specify a one-sided test?
Give an example where is it appropriate to specify a one-sided test?
What do one-sided tests affect?
What do one-sided tests affect?
When is it appropriate to use Cohen's d?
When is it appropriate to use Cohen's d?
How is Cohen's d defined primarily?
How is Cohen's d defined primarily?
When is another time in which it makes sense to apply Cohen's d?
When is another time in which it makes sense to apply Cohen's d?
What is calculated with Cohen's d and provides reasons to why there are multiple versions?
What is calculated with Cohen's d and provides reasons to why there are multiple versions?
How can we check the normality of a sample?
How can we check the normality of a sample?
Describe a 'quantile-quantile plot (QQ plot)'
Describe a 'quantile-quantile plot (QQ plot)'
Apart from the visual aspect, how can QQ plots be used?
Apart from the visual aspect, how can QQ plots be used?
What's the hypothesis being tested?
What's the hypothesis being tested?
In what situation is a t-test typically used, and what kind of predictor variable is involved?
In what situation is a t-test typically used, and what kind of predictor variable is involved?
What assumption does the Student's t-test make that the Welch's t-test does not?
What assumption does the Student's t-test make that the Welch's t-test does not?
How do the null and alternative hypotheses differ when conducting a one-sided t-test compared to a two-sided t-test?
How do the null and alternative hypotheses differ when conducting a one-sided t-test compared to a two-sided t-test?
In a paired samples t-test, is it necessary to have an equivalent number of data points in each group? Explain why or why not.
In a paired samples t-test, is it necessary to have an equivalent number of data points in each group? Explain why or why not.
What is Cohen's d, and what does it measure?
What is Cohen's d, and what does it measure?
What is the purpose of running a Shapiro-Wilk test, and what type of data is needed?
What is the purpose of running a Shapiro-Wilk test, and what type of data is needed?
Describe the primary use case for Wilcoxon tests, and how do they differ conceptually from t-tests?
Describe the primary use case for Wilcoxon tests, and how do they differ conceptually from t-tests?
Why it it important to report the t-statistic, degrees of freedom, and p-value?
Why it it important to report the t-statistic, degrees of freedom, and p-value?
What information is provided from the formula for performing One Sample t-test?
What information is provided from the formula for performing One Sample t-test?
What are the two different version of Cohen's d you can calculate depending on the properties of the Sample, and what are the advantages of one versus the other?
What are the two different version of Cohen's d you can calculate depending on the properties of the Sample, and what are the advantages of one versus the other?
Flashcards
What is a t-test?
What is a t-test?
A test to compare the means of two groups.
What is a One-sample t-test?
What is a One-sample t-test?
Compares a single sample mean to a known or hypothesized population mean.
What is a z-test?
What is a z-test?
A mostly useless test that serves as a stepping stone to the t-test.
What is the null hypothesis (H0)?
What is the null hypothesis (H0)?
Signup and view all the flashcards
What is a z-score?
What is a z-score?
Signup and view all the flashcards
What is standard error?
What is standard error?
Signup and view all the flashcards
What is the t-distribution?
What is the t-distribution?
Signup and view all the flashcards
What is the Shapiro-Wilk Test?
What is the Shapiro-Wilk Test?
Signup and view all the flashcards
What is the Student's t-test?
What is the Student's t-test?
Signup and view all the flashcards
What is Cohen's d?
What is Cohen's d?
Signup and view all the flashcards
What is a non-parametric test?
What is a non-parametric test?
Signup and view all the flashcards
What is a Wilcoxon test?
What is a Wilcoxon test?
Signup and view all the flashcards
What are Paired samples t-test?
What are Paired samples t-test?
Signup and view all the flashcards
What is a QQ plot?
What is a QQ plot?
Signup and view all the flashcards
What is one-sided test?
What is one-sided test?
Signup and view all the flashcards
What is Welch's t-test?
What is Welch's t-test?
Signup and view all the flashcards
What is the paired samples t-test?
What is the paired samples t-test?
Signup and view all the flashcards
Study Notes
Comparing Two Means
- Compares the means of two groups where the outcome variable is interval or ratio scale and the predictor is binary
- A t-test is standard for comparing means; variations exist based on the question at hand
- Includes one-sample, independent samples, and paired samples t-tests
- Cohen’s d also covered as a standard measure of effect size for a t-test
The One-Sample Z-Test
- A test of limited real-world use, mainly serving as a transition to the more versatile t-test
Inference Problem Addressed
- Dr. Zeppo's class average is 67.5 with a standard deviation of 9.5
- A sample of 20 psychology students has a mean grade of 72.3
- It addresses the question: do psychology students score differently (higher or lower) than the average?
Constructing the Hypothesis Test
-
Null hypothesis (H0): The true population mean for psychology student grades is 67.5
-
Alternative hypothesis (H1): The population mean is not 67.5
-
Assumes psychology grades are normally distributed
-
Assumes scores have a known standard deviation is 𝛔 = 9.5
Diagnostic Test Statistic
- The difference serves as a starting point since hypotheses refer to population mean
- Calculate 𝑋 − 𝜇0, the difference between the sample mean 𝑋 and the hypothesized population mean 𝜇0
- If zero or close, the null hypothesis is supported
- Relies on knowing the raw data are normally distributed with population standard deviation 𝜎
Population Distribution
- If the null hypothesis is valid, the data follows a normal distribution: 𝑋 ~ Normal(𝜇0, 𝜎2)
- The sampling distribution of the mean 𝑋 is also normal and has mean 𝜇
- Standard error of mean is: 𝑆𝐸(𝑋) = 𝜎/√𝑁
Standard Score Conversion
- Converts sample mean 𝑋 into a standard score 𝑧𝑋
- Formula: 𝑧𝑋 = (𝑋 − 𝜇0) / 𝑆𝐸(𝑋) or 𝑧𝑋 = (𝑋 − 𝜇0) / (𝜎/√𝑁)
- This z-score is the test statistic, and like all z-scores, follows a standard normal distribution: 𝑍𝑋 ~ Normal(0, 1)
Critical Regions
- The 5% critical region for the z-test remains constant regardless of the population parameters for raw scores
- Researchers can calculate their z-statistic and reference a textbook table
Worked Example Using R
- Calculating sample mean:
sample.mean <- mean(grades)
- Define variables for population standard deviation (σ = 9.5) and null hypothesis population mean (μ0 = 67.5):
> mu.null <- 67.5
> sd.true <- 9.5
- Variable for sample size
> N <- length(grades)
- Calculate standard error of the mean:
> sem.true <- sd.true / sqrt(N)
- The z-score calculation:
>z.score <- (sample.mean - mu.null) / sem.true
- p-value calculation
- Calculating area under the curve (from 2.26 upwards):
> upper.area <- pnorm(q = z.score, lower.tail = FALSE)
- Calculating area from -2.26 downwards
> lower.area <- pnorm(q = -z.score, lower.tail = TRUE)
- The p-value is then
> p.value <- lower.area + upper.area
- Can report as z = 2.26, N = 20, p < .05).
Assumptions of the z-test
- Normality: Assumes the true population distribution is normal, but can be checked
- Independence: Assumes observations aren't related or correlated, but relies on good experimental design
- Known Standard Deviation: Assumes population standard deviation is known, typically not true in real data analysis
The One-Sample T-Test
- An alternative when population standard deviation (σ) is unknown
- Uses the estimated standard deviation
- Requires adjustment due to reliance on the estimate
Introducing The T-Test
- The ambiguity in the z-test caused by relying on true 𝛔 to be resolved by William Sealy Gosset in 1908
- Gossett, working for Guinness brewery, published under pseudonym "A Student"
- Recognized how the sampling distribution changes, t distribution and the t-statistic is calculated
T-statistic
- Calculated the same way with true mean is μ
- Sample mean 𝑋 and population standard deviation is Ô 𝑡 = 𝑋 − 𝜇 / 𝛔𝑁
- Sampling distribution turns into a t-distribution with N - 1 df
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.