Hypothesis Testing Explained

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Play an AI-generated podcast conversation about this lesson
Download our mobile app to listen on the go
Get App

Questions and Answers

What is the purpose of a hypothesis-testing framework?

  • To provide an objective framework for making decisions using probabilities. (correct)
  • To ensure decisions are inconsistent.
  • To complicate the decision-making process.
  • To introduce subjective impressions into decision-making.

In hypothesis testing, what does the null hypothesis (H₀) represent?

  • The hypothesis that is to be tested. (correct)
  • The hypothesis that represents a new theory.
  • The hypothesis that is always true.
  • The hypothesis that contradicts the alternative hypothesis.

What does a Type I error in hypothesis testing refer to?

  • Accepting the alternative hypothesis when it is true.
  • Rejecting the null hypothesis when it is true. (correct)
  • Failing to reject the null hypothesis when it is false.
  • Accepting the null hypothesis when it is false.

What is the significance level (α) of a test?

<p>The probability of rejecting the null hypothesis when it is true. (D)</p> Signup and view all the answers

Which of the following describes the power of a test?

<p>The probability of correctly rejecting a false null hypothesis. (C)</p> Signup and view all the answers

In the context of hypothesis testing, what does the 'acceptance region' refer to?

<p>The range of values for which the null hypothesis is accepted. (A)</p> Signup and view all the answers

What characterizes a one-tailed test?

<p>The parameter under the alternative hypothesis can only be greater than the value under the null hypothesis. (B)</p> Signup and view all the answers

When is it more appropriate to use a one-sided test rather than a two-sided test?

<p>When alternatives on only one side of the null mean are of interest or possible. (D)</p> Signup and view all the answers

In a one-sample z-test for the mean with known variance, how is the test statistic (z) calculated?

<p>$z = (\bar{x} - \mu_0) / (\sigma / \sqrt{n})$ (C)</p> Signup and view all the answers

If, in a one-sample z-test (μ₁ < μ₀), the calculated z-value is less than zα, what decision should be made?

<p>Reject H₀. (C)</p> Signup and view all the answers

In a two-sided z-test, what is the condition for rejecting the null hypothesis H₀?

<p>z &lt; zα/2 or z &gt; z1-α/2 (B)</p> Signup and view all the answers

When conducting a one-sample t-test with unknown variance, what is the formula for the test statistic?

<p>$t = (\bar{x} - \mu_0) / (s / \sqrt{n})$ (A)</p> Signup and view all the answers

In a one-sample t-test, if the calculated t-value is greater than tn-1,1-α, what conclusion can be drawn?

<p>Reject H₀. (B)</p> Signup and view all the answers

What is the 'critical-value method' in hypothesis testing?

<p>A method that compares a test statistic to a critical value to determine the outcome of a test. (B)</p> Signup and view all the answers

What does the p-value represent in hypothesis testing?

<p>The probability of observing a test statistic as extreme as, or more extreme than, the one computed if the null hypothesis is true. (C)</p> Signup and view all the answers

According to the guidelines for judging the significance of a p-value, if 0.01 ≤ p < 0.05, the results are considered:

<p>Significant. (B)</p> Signup and view all the answers

What does it mean if a result is declared 'not statistically significant'?

<p>The p-value is greater than 0.05. (D)</p> Signup and view all the answers

For a one-sample t-test with an alternative hypothesis μ > μ₀, how is the p-value calculated?

<p>p = Pr(tn-1 &gt; t) (B)</p> Signup and view all the answers

In a two-tailed t-test with unknown variance, when do you reject the null hypothesis?

<p>When |t| &gt; tn-1,1-α/2 (D)</p> Signup and view all the answers

What should be done to avoid potentially biasing the conclusions in hypothesis testing?

<p>The decision on whether to use a one-sided or two-sided test must be made before beginning the data analysis or data collection. (D)</p> Signup and view all the answers

What is a key factor that influences the power of a test?

<p>How likely it is that a statistically significant difference will be detected if the alternative hypothesis is true. (C)</p> Signup and view all the answers

How does decreasing the significance level (α) affect the power of a test, assuming other factors remain constant?

<p>It decreases the power. (C)</p> Signup and view all the answers

What effect does increasing the sample size have on the power of a test, assuming other factors remain constant?

<p>The power increases. (B)</p> Signup and view all the answers

If the standard deviation of the distribution of individual observations increases, how does this typically affect the power of the test?

<p>Decreases power. (B)</p> Signup and view all the answers

How does the distance between the null and alternative means (|μ₀ - μ₁|) affect the power of a test?

<p>The power increases as the distance increases. (C)</p> Signup and view all the answers

How does an increase in the required power affect the necessary sample size?

<p>The sample size increases. (C)</p> Signup and view all the answers

What is the relationship between hypothesis testing and confidence intervals in the two-sided case?

<p>If the two-sided confidence interval contains μ₀, H₀ is accepted. (C)</p> Signup and view all the answers

In Bayesian inference, what is a 'prior distribution'?

<p>A probability distribution of possible values for a parameter specified before looking at available sample data. (B)</p> Signup and view all the answers

What is a 'flat' or 'noninformative' prior distribution?

<p>A prior distribution where all possible values of the parameter are equally likely. (D)</p> Signup and view all the answers

In Bayesian Inference with a normal distribution and known σ, what information is required to make inferences about μ?

<p>The sample mean. (B)</p> Signup and view all the answers

What information does a 100% x (1 - α) posterior predictive interval provide?

<p>A range of values in which we are (1 - α)% confident that a future observation will fall, based on the sample data. (B)</p> Signup and view all the answers

In the one-sample χ² test for variance, what is being tested?

<p>Whether the variance of a normally distributed population is equal to a specified value. (D)</p> Signup and view all the answers

In the one-sample χ² test for the variance of a normal distribution what is the test statistic?

<p>$X^2 = (n-1)s^2 / \sigma_0^2$ (C)</p> Signup and view all the answers

What is the condition for using a normal-theory method for the one-sample inference for the binomial distribution?

<p>np₀q₀ ≥ 5 (B)</p> Signup and view all the answers

Which distribution is used to approximate the binomial distribution when using the normal theory method?

<p>Standard Normal Distribution (B)</p> Signup and view all the answers

What is the standardized mortality ratio (SMR)?

<p>An index used to quantity risk in a study population relative to the general population. (B)</p> Signup and view all the answers

What test statistic is typically computed for the one-sample inference for the Poisson distribution (large-sample test)?

<p>χ² Statistic (B)</p> Signup and view all the answers

What is the rule of thumb for when to use the large-sample test for a Poisson distribution?

<p>μ₀ ≥ 10 (A)</p> Signup and view all the answers

Flashcards

Null hypothesis (Ho)

The hypothesis to be tested in hypothesis testing.

Alternative hypothesis (H₁)

The hypothesis that contradicts the null hypothesis.

Acceptance region

Range of values for which the null hypothesis (Ho) is accepted.

Rejection region

Range of values for which the null hypothesis (Ho) is rejected.

Signup and view all the flashcards

One-tailed test

Test where the parameter can be greater or less than null.

Signup and view all the flashcards

Type I error (α)

Probability of rejecting the null hypothesis when it is true.

Signup and view all the flashcards

Type II error (β)

Probability of accepting the null hypothesis when it is false.

Signup and view all the flashcards

Power of a test (1 - β)

The probability of correctly rejecting a false null hypothesis.

Signup and view all the flashcards

P-value

The a level at which you are indifferent between accepting or rejecting Ho.

Signup and view all the flashcards

Critical-value method

The method of comparing a test statistic to a critical value.

Signup and view all the flashcards

P-value method

Reject Ho if p < α, otherwise, Ho is accepted.

Signup and view all the flashcards

Two-tailed test

Test where values can be greater OR less than the null.

Signup and view all the flashcards

One-Sample t-Test (μ₁ > μ₀)

T test where the alternative mean is greater than the null mean.

Signup and view all the flashcards

One-Sample t-Test (μ₁ < μ₀)

T test where the alternative mean is less than the null mean.

Signup and view all the flashcards

Sample-Size Estimation Based on CI Width

Estimate mean with sample variance s^2

Signup and view all the flashcards

Standardized Mortality Ratio (SMR)

It quantifies risk in a study population.

Signup and view all the flashcards

Study Notes

Introduction

  • Hypothesis testing provides an objective decision making framework using probabilities, instead of subjective feelings
  • It offers a uniform and consistent approach to decision-making
  • In one-sample problems, hypotheses are about a single distribution
  • Two-sample problems involve comparison between two different distributions

Key Hypotheses

  • The null hypothesis (Ho) is the hypothesis to be tested
  • The alternative hypothesis (H₁) contradicts the null hypothesis
  • Only possible decisions are considered: whether Ho is true or H₁ true
  • Outcomes in hypothesis testing generally refer to the null hypothesis

Outcomes and Errors in Hypothesis Testing

  • If Ho is true and we decide Ho is true, we accept Ho
  • If H₁ is true and we decide H₁ is true, we reject Ho
  • A Type I error the probability of rejecting the null hypothesis when it is actually true, is denoted by α It's the significance level of a test
  • A Type II error is the probability of accepting the null hypothesis when the alternative hypothesis H₁ is true, is denoted by ß
  • The power of a test is the probability of rejecting Ho when H₁ is true, calculated as 1 - ß
  • Aim: Use statistical tests to minimize α and ß, thus minimizing errors in rejecting or accepting the null hypothesis

One-Sample Test for the Mean of a Normal Distribution with One-Sided Alternatives

  • Acceptance region: Values of x for which Ho is accepted
  • Rejection region: Values of x for which Ho is rejected
  • If x is sufficiently smaller than µ0, then Ho is rejected; otherwise, Ho is accepted.
  • When Ho is true: x values cluster around µo
  • When H₁ is true: x values cluster around μ₁
  • A test based on sample mean has the highest power, given a type I error of α
  • A one-tailed test allows values under the alternative hypothesis to be either greater or less than the null hypothesis, but not both

One-Sample z Test (One-Sided Alternative)

  • To test Ho: μ = μo vs. H₁: μ < μo, with a significance level of α and known standard deviation σ, the test statistic is z = (x - μo)/(σ/√n)
  • If z < zα, Ho is rejected; if z ≥ zα, Ho is accepted
  • The p-value is given by p = Φ(z)
  • To test Ho: μ = μo vs. H₁: μ > μo, with a significance level of α and known standard deviation σ, the test statistic is z = (x - μo)/(σ/√n)
  • If z > z1-α, Ho is rejected; if z ≤ z1-α , Ho is accepted
  • The p-value is given by p = 1 - Φ(z)

One-Sample z Test (Two-Sided Alternative)

  • To test H₀: μ = μ₀ vs. H₁: μ ≠ μ₀ with a significance level of α and known standard deviation σ, the test statistic is z = (x - μ₀) / (σ/√n)
  • If z < zα/2 or z > z1-α/2, then H₀ is rejected.
  • If zα/2 ≤ z ≤ z1-α/2, then H₀ is accepted.
  • To compute a two-sided p-value: p = 2Φ(z) if z ≤ 0; p = 2[1 - Φ(z)] if z > 0

One-sample t Test (Alternative Mean < Null Mean)

  • To test H₀: μ = μ₀, σ unknown vs. H₁: μ < μ₀, σ unknown with a significance level of α: t = (x - μ₀) / (s/√n)
  • If t < tn-1,α, then reject H₀.
  • If t ≥ tn-1,α, then accept H₀.
  • The value of t is called a test statistic, the value of tn-1,α is called a critical value
  • The general approach is to compute a test statistic and compare it with a critical value; this is the critical-value method

P-Value Significance

  • The p-value is the α level at which we would be indifferent between accepting or rejecting H₀
  • It is the α level at which the given value of the test statistic (such as t) is on the borderline
  • Solving for p gives p = Pr(tn-1 ≤ t)
  • p-value is the probability of obtaining a test statistic as extreme as or more extreme than the one obtained, assuming the null hypothesis is true
  • p-value indicates the significance of results, without repeated significance tests at different α levels
  • If 0.01 ≤ p < 0.05, results are significant
  • If 0.001 ≤ p < 0.01, results are highly significant
  • If p < 0.001, results are very highly significant
  • If p > 0.05, results are not statistically significant (NS)
  • If 0.05 ≤ p < 0.1, a trend toward statistical significance may be noted

Methods to Establish Statistical Significance

  • Compute the test statistic t and compare with the critical value tn-1,α at an α level of 0.05
  • If Ho: μ = μo vs. H1: μ < μo is tested and t < tn-1,0.05, then Ho is rejected (statistically significant with p < 0.05); otherwise Ho is accepted (not statistically significant with p ≥ 0.05); this is the critical-value method
  • Compute the exact p-value; if p < 0.05, then Ho is rejected and the results are statistically significant; otherwise Ho is accepted (results not statistically significant); this is the p-value method

One-Sample t Test (Alternative Mean > Null Mean)

  • To test Ho: μ = μo vs. H₁: μ > μo with significance level α, the test statistic is based on t = (x - μo)/(s/√n)
  • If t > tn-1,1-α, then Ho is rejected
  • If t ≤ tn-1,1-α, then Ho is accepted
  • The p-value for this test is given by p = Pr(tn-1 > t)

One-Sample Test with Two-Sided Alternatives

  • A two-tailed test values under the alternative hypothesis are allowed to be either greater or less than the values under the null hypothesis (µo)
  • To test for alternatives on either side of the null mean, reject Ho if t < c₁ or t > c₂ (c1, c2 are constants)
  • Accept Ho if c₁ ≤ t ≤ c₂
  • Pr(reject Ho | Ho true) = Pr(t < c₁ | Ho true) + Pr(t > c₂ | Ho true) = α
  • Pr(t < c₁ | Ho true) = Pr(t > c₂ | Ho true) = α/2

One-sample t Test (Two-Sided Alternative)

  • To test the hypothesis H₀: μ = μ₀ vs. H₁: μ ≠ μ₀ with a significance level of α, the test statistic used is t = (x̄ - μ₀) / (s/√n)
  • Reject H₀ if |t| > tn-1,1-α/2
  • Accept H₀ if |t| ≤ tn-1,1-α/2

P-Value (Two-Sided Alternative)

  • Let t = (x - μ0) / (s/√n)
  • p = 2 × Pr(tn-1 ≤ t), if t ≤ 0
  • p = 2 × [1 - Pr(tn-1 ≤ t)], if t > 0
  • The p-value is the probability under the null hypothesis of obtaining a test statistic as extreme as or more extreme than the observed test statistic
  • Extremeness is measured by the absolute value of the test statistic

One-Sided vs. Two-Sided Tests

  • The sample means falls in the expected direction from µ₀ and it's easier to reject H₀ using a one-sided test
  • A two-sided test does not require guessing the appropriate side of the null hypothesis for the alternative hypothesis.
  • If only alternatives on one side of the null mean are of interest or possible, a one-sided test is better because it has more power
  • Decision (one-sided or two-sided test) must be made before data analysis or collection to avoid biased conclusions.
  • Avoid changing from a two-sided to a one-sided test after looking at the data.

Power of a Test

  • The power of a test indicates the likelihood of detecting a statistically significant difference when the alternative hypothesis is true.

Formula

  • (x - μ₀) / (σ/√n) > z1-α; multiplying both sides with σ/√n and adding μ₀: x > μ₀ + z1-α σ/√n
  • Alternative formula x ≤ μ₀ ± z1-α σ/√n

Power for the One-Sample z Test (One-Sided Alternative)

  • For the hypothesis H₀: μ = μ₀ vs. H₁: μ = μ₁, assuming a normal distribution and known population variance (σ²): Φ(zα + |μ₀ - μ₁| √(n)/σ) = Φ(-z1-α + |μ₀ - μ₁|√(n)/σ)
  • Significance Levels: Smaller significance levels (α decreases) reduces test power
  • Alternative Mean: Greater difference between the alternative and null means (|μ₀ - μ₁| increases) increases test power
  • Standard Deviation: Larger standard deviation in individual observations (σ increases) decreases test power
  • Sample Size: Larger sample sizes (n increases) increases test power

Power for the One-Sample z Test (Two-Sided Alternative)

  • Power of the two-sided test Ho: μ = μo vs. H₁: μ ≠ μo with the specific alternative μ = μ₁, where the underlying distribution is normal and the population variance (σ²) is assumed known exactly by
  • Φ[-z1-α/2 + ((μo - μ₁) √n) / σ] + Φ[-z1-α/2 + ((μ1 - μo) √ n) / σ]
  • Or Approximately by Φ[-z1-α/2 + (μo-μ1)√n / σ]

Sample-Size Determination: One-Sided Alternatives

  • For testing Ho: μ = μo vs. H₁: μ = μ₁ with normal data with mean µ and known variance σ², the necessary sample size for a one-sided test with significance level α and the defined Power :
  • n = (σ² (z1-β + z1-α)² / (μ₀ - μ₁)²)

Factors Affecting Sample Size

  • The sample size increases as σ² increases
  • The sample size increases as the significance level is made smaller (α decreases)
  • The sample size increases as the required power increases (1- β increases)
  • The sample size decreases as the absolute value of the distance increases

Sample-Size Estimation (Two-Sided Alternative)

  • For testing H₀: μ = μ₀ vs. H₁: μ = μ₁ with normal data with mean µ and known variance σ², the necessary sample size for a one-sided test with significance level α and the defined power :
  • n = (σ² (z1-β + z1-α/2)² / (μ₀ - μ₁)² )

Sample-Size Estimation Based on CI Width

  • To estimate the mean of a normal distribution with sample variance s²: n = 4z²1-α/2 s² / L²
  • The number of subjects needed for the two-sided 100% × (1 - α) Confidence Interval be no wider than L

Hypothesis Testing and Confidence Intervals Relationship

  • H₀: μ = μ₀ vs. H₁: μ ≠ μ₀ is rejected with a two-sided level α test if the two-sided 100% × (1 - α) Confidence Interval for μ does not contain μ₀
  • H₀: μ = μ₀ vs. H₁: μ ≠ μ₀ is accepted with a two-sided level α test if the two-sided 100% × (1 - α) Confidence Interval for μ contains μ₀
  • Rejection region is a two-sided level a test

Bayesian Inference

  • We need to specify a prior distribution for μ in order to use that approach
  • A prior distribution generalizes the concept of a prior probability: where a probability distribution of a possible values for a specified before looking at the available sample data
  • This distribution is then modified after the data is collected to find a posterior for μ
  • In the absence of data, there is no knowledge of µ, so all possible values of u are equally likely This is called flat or noninformative prior distribution and is written as Pr(μ) ∝ c, where c is a constant
  • If a distribution is normal and σ is known, then all the information in the sample concerning μ is contained in x, that is, Pr(μ|x1,...,xn) = Pr(μ|x) -

Bayesian Formula

  • X is referred to as a sufficient statistic for μ
  • distribution, we use Baye's rule. Pr(μ|x) = Pr(x | μ)Pr(μ)/Pr(x).
  • the prior probability of µ, Pr(u), is the same for all values of μ
  • Pr(μ|x) x Pr(x| μ)
  • approximation for a continuous distribution
    • f(x|μ) =[1/(√2π σ/√n)]exp{-(1/2)[(x̄ -μ)/(σ/√n)]²} = [1/(√2π σ/√n)]exp{-(1/2)[(μ-x̄)/(σ/√n)]²]
  • Pr(μ | x) ∝ [1/(√2π σ/√n)]exp{-(1/2)[(μ - x̄)/(σ/√n)]²}
  • The distribution of µ given x is normally distributed with mean = x and variance σ²/n, or μ|x~ N(x, σ²/n)

Bayesian Distribution Formula

  • A 100% × (1 - a) posterior predictive interval for u: Pr(μ₁ < μ < μ₂) = (1 - α) where μ₁ = x̄ - z1- α/2 σ/√n and μ₂ = x̄ + z1- α/2 σ/√n

One-Sample x² Test (Two-Sided Alternative)

  • The test statistic X² = ((n − 1)s²) / σ₀²
  • if X² < X²n-1,α/2 or X² > X²n-1,1- α/2, Ho is rejected
  • If X²n-1,α/2 ≤ X² ≤ X²n-1,1- α/2 then Ho is accepted

P-Value for a One-Sample x² Test (Two-Sided Alternative)

  • Test statistic X² = [(n-1)s²]/σ₀²
  • If s² ≤ σ₀² then p-value = 2 × (area to the left of X² under a X²n-1 distribution)
  • If s² > σ₀² then p-value = 2 × (area to the right of X² under a X²n-1 distribution)

One-Sample Inference for the Binomial Distribution: Normal-Theory Methods

  • Test statistic z = (p^- po/√poqo/n
  • if z < za/2 or z > Z1-a/2 then Ho is rejected
  • if za/2 ≤ z ≤ Z1-a/2 then Ho is accepted
  • Should only be used if npoqo ≥ 5

Computation of the p-value (Two-Sided Alternative)

z (P^-Po/Pogo/n

  • Statistic Test
  • If P^≤ Po, then value = 2xÞ(z) = twice the area to the left of z under an N(0,1) curve
  • If P> Po, then z-value = 2x[1-Þ(z)] = twice the area to the right of z under an N(0,1) curve.

One Sample Poisson Distribution

  • An index used to quantify a populations risk relative to the another population is the standardized mortality ration (SMR)
  • It is defined by 100% x O/E = 100% * the observed number of deaths / the expected number of deaths under the assumption that the populations have the same mortality rates
  • For nonfatal conditions the SMR is known as the standardized morbidity ratio

Power for the One-Sample Binomial Test (Two-Sided Alternative) Formula

  • Ho: p = po vs. H₁: p ≠ po: Φ[√Polo/P191 (za/2 + |Po-P1|/√pogo)]

Formula

    • Test- 2 =(x - μ₂) for a two sided test. 2=H₂ is rejected if
  • IfHo 42.²²-a is given by Pr(k²>,)
  • This text should only be used if μ ₁, 10

Summary

  • specification of the null and alternative hypotheses
  • type I error, type II error, and the power of a hypothesis test; the p-value of a hypothesis test; distinction between on-sided and two sided tests
  • methods for estimating appropriate sample size as determined by the prespecified null and alternative hypotheses and type I and type II errors
  • One-sample hypothesis-testing cases
  • hypothesis tests are conducted through
    • Specifying critical values to determine the acceptance and rejection regions based on a specified type I error a
    • Computing p-values.
  • Bayesian inference methods
    • when no prior information exists concerning a parameter
    • when a substantial amount of prior information is available.

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team

Related Documents

More Like This

Alpha Level and Null Hypothesis Quiz
3 questions
Hypothesis Testing Overview
24 questions
Hypothesis testing
15 questions

Hypothesis testing

BetterThanExpectedThermodynamics1470 avatar
BetterThanExpectedThermodynamics1470
Use Quizgecko on...
Browser
Browser