CHAPTER 14

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Play an AI-generated podcast conversation about this lesson
Download our mobile app to listen on the go
Get App

Questions and Answers

A researcher aims to estimate the average lifespan of a specific species of butterfly. They collect multiple random samples and calculate the sample mean lifespan for each. Which statement best describes how these sample means relate to the true population mean?

  • The average of all sample means will always be significantly different from the true population mean.
  • Virtually none of the sample means will be exactly equal to the true population mean, but will vary around it. (correct)
  • The sample means will cluster far from the population mean due to sampling error.
  • All sample means will be equal to the true population mean.

A statistician calculates a 95% confidence interval for the average body temperature of a population, assuming a normal distribution with a known standard deviation. Which action would most likely result in a narrower confidence interval?

  • Using a t-distribution instead of a normal distribution.
  • Decreasing the sample size.
  • Increasing the sample size. (correct)
  • Increasing the confidence level to 99%.

A researcher is studying the effectiveness of a new drug. They perform a hypothesis test and obtain a p-value of 0.08. Assuming a significance level (alpha) of 0.05, what is the correct interpretation of the p-value?

  • The null hypothesis is rejected because the p-value is less than the significance level.
  • There is not enough evidence to reject the null hypothesis. (correct)
  • The null hypothesis is accepted because the p-value is greater than the significance level.
  • The alternative hypothesis is rejected because the p-value is too high

In hypothesis testing for a population mean with known standard deviation, how does increasing the sample size affect the likelihood of rejecting a false null hypothesis (i.e., statistical power)?

<p>It increases the likelihood of rejecting a false null hypothesis. (D)</p> Signup and view all the answers

Imagine you construct a 95% confidence interval for a population mean based on a random sample. Which of the following statements best interprets what the 95% confidence level means?

<p>If you were to take 100 random samples and construct a confidence interval from each, approximately 95 of those intervals would contain the true population mean. (B)</p> Signup and view all the answers

A researcher calculates a 90% confidence interval for the average height of women in a certain city. Which of the following is the correct interpretation of this confidence level?

<p>We are 90% confident that the calculated interval contains the true average height of women in the city. (C)</p> Signup and view all the answers

Given a 95% confidence interval for a population mean, which of the following actions would not decrease the width of the interval?

<p>Increase the population standard deviation. (A)</p> Signup and view all the answers

A laboratory technician is measuring the concentration of a certain chemical in water samples. The measurement equipment has a known standard deviation $\sigma = 0.5$ mg/L. The technician takes four measurements: 10.1, 10.3, 10.5, and 10.7 mg/L. What is the margin of error for a 95% confidence interval for the true chemical concentration?

<p>0.490 mg/L (D)</p> Signup and view all the answers

Which of the following is the best definition of 'margin of error' in the context of confidence intervals?

<p>The maximum likely difference between the sample estimate and the true population parameter. (D)</p> Signup and view all the answers

A researcher wants to estimate the average time students at a university spend on social media per day. They want a 99% confidence interval with a margin of error of no more than 15 minutes. Assuming a population standard deviation of 60 minutes, what is the minimum sample size needed?

<p>Approximately 107 (A)</p> Signup and view all the answers

How does increasing the confidence level affect the margin of error, assuming all other factors remain constant?

<p>It increases the margin of error, leading to lower precision. (C)</p> Signup and view all the answers

A researcher aims to achieve both high confidence and a narrow confidence interval. What is the most effective strategy to accomplish this?

<p>Increase the sample size and then choose the confidence level. (C)</p> Signup and view all the answers

In hypothesis testing, what is the primary purpose of examining sample data?

<p>To decide on the validity of the hypothesis. (D)</p> Signup and view all the answers

If the average inorganic phosphorus level of a random sample of 12 healthy elderly subjects is 1.128 mmol/L, and the population mean is known to be 1.2 mmol/L with a standard deviation of 0.1 mmol/L, what question does hypothesis testing aim to answer?

<p>Is the smaller mean phosphorus level simply due to chance variation, or is it evidence that the true mean phosphorus level in elderly individuals is lower than 1.2 mmol/L? (C)</p> Signup and view all the answers

What is the key characteristic of the null hypothesis ($H_0$)?

<p>It is a specific statement about a parameter of the population(s). (A)</p> Signup and view all the answers

How does the alternative hypothesis ($H_a$) relate to the null hypothesis ($H_0$)?

<p>It is a more general statement that complements and is mutually exclusive with the null hypothesis. (A)</p> Signup and view all the answers

In the context of phosphorus levels in the elderly, if we are testing whether the mean phosphorus level is lower than 1.2 mmol/L, which of the following correctly represents the null and alternative hypotheses?

<p>$H_0$: $µ = 1.2$ mmol/L, $H_a$: $µ &lt; 1.2$ mmol/L (B)</p> Signup and view all the answers

What distinguishes a one-sided alternative hypothesis test from a two-sided test?

<p>A one-sided test is asymmetric and specific, while a two-sided test is symmetric. (B)</p> Signup and view all the answers

Which of the following alternative hypotheses is an example of a two-sided test?

<p>$H_a$: $µ \neq 10$ (C)</p> Signup and view all the answers

What is the primary factor that determines whether to use a one-sided versus a two-sided hypothesis test?

<p>The question being asked and what is known about the problem before performing the test. (B)</p> Signup and view all the answers

Flashcards

Sampling Distribution

The distribution of a statistic (like the sample mean) calculated from many samples taken from the same population.

Parameter

A numerical value summarizing a characteristic of a population (e.g., population mean µ).

Statistic

A numerical value summarizing a characteristic of a sample (e.g., sample mean x̄ ).

Central Limit Theorem

The sampling distribution of the sample mean approaches a normal distribution as the sample size increases, regardless of the population's distribution.

Signup and view all the flashcards

Confidence Interval

A range of values, calculated from sample data, used to estimate an unknown population parameter with a certain level of confidence.

Signup and view all the flashcards

Confidence Interval (CI)

A range of values, calculated from sample data, used to estimate a population parameter, expressed as center ± margin of error or (lower bound, upper bound).

Signup and view all the flashcards

Margin of Error (m)

The extent to which the sample estimate might differ from the population parameter.

Signup and view all the flashcards

Confidence Level (C)

The probability that the method will give a correct answer.

Signup and view all the flashcards

CI for Normal population mean

When sampling from a Normal population with known standard deviation σ, a level C confidence interval for µ is calculated with this formula.

Signup and view all the flashcards

Finding z-values

Values related to a given confidence level C. Appropriate z*-value can be found on a z-table.

Signup and view all the flashcards

Hypothesis

A statement about a population parameter that we want to test.

Signup and view all the flashcards

Test of Statistical Significance

A statistical test using sample data to evaluate the validity of a hypothesis.

Signup and view all the flashcards

Null Hypothesis (H0)

A specific statement about a population parameter assumed to be true unless evidence contradicts it.

Signup and view all the flashcards

Alternative Hypothesis (Ha)

A statement that contradicts the null hypothesis, representing what the researcher is trying to find evidence for.

Signup and view all the flashcards

Two-Sided Alternative

An alternative hypothesis that specifies the parameter differs from the null hypothesis in either direction (greater or less).

Signup and view all the flashcards

One-Sided Alternative

An alternative hypothesis that specifies the parameter differs from the null hypothesis in only one direction (either greater or less).

Signup and view all the flashcards

Confidence vs Precision

Wanting high confidence, but also narrow intervals.

Signup and view all the flashcards

Higher confidence

Less precision more accuracy.

Signup and view all the flashcards

Study Notes

  • Chapter 14 introduces statistical inference

Statistical Estimation

  • Different samples from a population yield different sample means
  • Virtually none of the sample means will equal the true population mean.
  • Sampling distribution is N(μ, σ/√n) if the population is N(μ, σ)
  • Sampling distribution is ~ N(μ, σ/√n) if n is large enough.
  • One random sample of size n is taken to rely on the known properties of the sampling distribution.

Visualization of Confidence

  • Random samples taken using a computer applet allow calculation of the sample mean and an interval
  • The interval size is plus-or-minus 2 σ/√n around the mean
  • Approximately 95% of computed intervals capture the parameter μ, based on the 68-95-99.7% rule.
  • The confidence interval is a range of values with an associated probability, also called the confidence level, C
  • The probability quantifies the likelihood that the interval contains the unknown population parameter
  • Confidence C indicates the probability that μ falls within the computed interval

Margin of Error

  • A confidence interval ("CI") can be expressed as a center ± a margin of error m, resulting in μ within m.
  • It can also be expressed as an interval, with μ falling within a range from (m) to (m)
  • The confidence level C (in %) indicates the area of corresponding size C under the sampling distribution.
  • A 95% confidence interval for mean body temperature (°F) computed to be (98.1, 98.4) from a sample of 130 healthy adults means: There is 95% confidence that the population mean body temperature is a value between 98.1 and 98.4°F
  • For the same 95% confidence interval (98.1, 98.4), the margin of error is 0.15°F.

Confidence Interval

  • Confidence interval, C, when taking a random sample from a Normal population with known standard deviation σ, the level C confidence interval for μ is: σ/√n
  • σ/√n is the standard deviation of the sampling distribution
  • C is the area under the N(0,1) between -z* and z*
  • Z- and t-values (Table C) can be used
  • The appropriate z*-value for a confidence level C is listed in the same column.

Confidence Interval Calculations

  • Measurement equipment has Normal distribution with standard deviation σ = 1 million bacteria/mL of fluid
  • Three measurements made: 24, 29, and 31 million bacteria/mL
  • Mean: 28 million bacteria/ml
  • Find the 99% and 90% CI.
  • 99% confidence interval for the true density, z* = 2. 576

Confidence Level and Margin of Error

  • Confidence level C determines the value of z* (in a table)
  • The margin of error depends on z*.
  • Higher confidence C implies a larger margin of error m (less precision more accuracy)
  • Lower confidence level C yields a smaller margin of error m (more precision less accuracy)
  • High confidence and narrow intervals are desired
  • Choose the confidence level first, then increase n to achieve a narrow interval.

Hypothesis Testing

  • A claim about the unknown value of a population parameter is made
  • An analysis is performed to check if this claim is reasonable based on the evidence gathered (sample data).
  • A test of statistical significance is performed to tests a specific hypothesis
  • The sample data is used to decide on the validity of the tested hypothesis.
  • Inorganic phosphorus blood levels vary Normally among adults, with mean 1.2 and standard deviation 0.1 mmol/L
  • Average inorganic phosphorus level of a random sample of 12 healthy elderly subjects is 1.128 mmol/L
  • Question to be asked: Is this smaller mean phosphorus level due to chance variation alone, or is it evidence that the true mean phosphorus level in elderly individuals is lower than 1.2 mmol/L?

Null and Alternative Hypothesis

  • Null hypothesis, Ho, is a specific statement about a parameter of the studied population(s)
  • Alternative hypothesis, Hâ‚‚, is a more general statement that complements and is mutually exclusive with the null hypothesis.
  • Phosphorus levels Example:
  • Hâ‚€:= μ = 1.2 mmol/L
  • Hâ‚‚: μ < 1.2 mmol/L (μ is smaller due to changing physiology)

One-Sided versus Two-Sided Alternatives

  • A two-tail (two-sided) alternative is symmetric: Hâ‚‚: μ [value]
  • A one-tail (one-sided) alternative is asymmetric and specific:
  • Hâ‚‚: μ < [value]
  • Hâ‚‚: μ > [value]
  • Choice of a one-sided versus two-sided test depends on the question being asked and prior knowledge about the problem.
  • If asymmetric, Hâ‚‚ should be one-sided, otherwise, Hâ‚‚ should be two-sided.
  • Examples of hypothesis:
  • Is active ingredient concentration as stated on the label (325 mg/tablet)?
  • Hâ‚€: μ = 325 vs Hâ‚‚: μ ≠ 325
  • Is Nicotine content > written value of 1mg/cigarette on average?
  • Hâ‚€: μ = 1 vs Hâ‚‚: μ > 1
  • Does a Drug cause a change in Blood Pressure on average?
  • Hâ‚€: μ = 0 vs Hâ‚‚: μ ≠ 0
  • Does a particular stream have an unhealthy mean Oxygen content (a level below 5mg/liter)? Ecologists collect a liter of water from each of 45 locations along and find a mean of 4.62
  • Hâ‚€:=5 vs Hâ‚‚:<5

P-Value and Hypothesis Testing

  • Phosphorus levels vary Normally with standard deviation s = 0.1 mmol/L
  • Hâ‚€: μ = 1.2 mmol/L versus Hâ‚‚: μ < 1.2 mmol/L
  • Mean phosphorus level from 12 elderly subjects is 1.128 mmol/L
  • Evaluate probability of drawing a random sample with a mean as small as this one (or even smaller), if Ho is true
  • P-value: Probability (Ho was true) of obtaining a sample statistic at least as extreme (in the direction of Hâ‚‚) as the one obtained.
  • Visualizing the P-value can also be done
  • The chance of seeing a sample mean less than our observed average of 1.128 is 0.0063.

Interpreting the P-Value

  • Check for amount of random variation that alone accounts for the difference between Ho observations from a random sample
  • Small P-values are strong evidence AGAINST Ho
  • In that case we reject Ho, and the findings are statistically significant
  • P-values that are not small fail to give enough evidence against Ho
  • There is a failure to reject Ho and Ho cannot be proved

Range of P-value

  • P-values are probabilities and therefore always between 0 and 1.
  • The order of magnitude of the P-value matters more than its exact numerical value.
  • Significance level, α, is the largest P-value is tolerated for rejecting Ho
  • This value is decided arbitrarily before conducting the test.
  • When P-value ≤ α, reject Ho
  • When P-value > α, it leads to a failure to reject Ho
  • Example:
    • If industry standards require significance level α of 5% and the two-sided is performed on a sample of data where
    • P-value is 4.56%, then the results are statistically significant at significance level 0.05.
  • To test Hâ‚€: μ = μ₀ using a random sample of size n from a Normal population with known standard deviation σ, usethe null sampling distribution N(μ₀, σ/√n).
  • P-value represents the area under N(μ₀, σ/√n) for values of x at least as extreme in the direction of Hâ‚‚ as that of a random sample.
  • Calculate the z-value then consult Table B or C or use appropriate technology

P-Value Symmetry

  • Calculate the P-value for a two-sided test using the symmetry of the normal curve.
  • The P-value for a one-sided test is found and doubled for asymmetry
  • Examples on slides 33 and 34 test the hypothesis:
  • Are there statistically significant differences blood phosphorus levels between healthy adults (1.2mmol/L) and elderly adults (1.128mmol/L)?
  • P-value=0.0063
  • This shows the mean phosphorus level among the elderly is significantly less than 1.2 mmol/L

Testing Confidence Intervals

  • Because a two-sided test is symmetric, a confidence interval can easily test a two-sided hypothesis
  • In a two-sided test, C = 1 − α, where C is the confidence level and α is the significance level.

Logic of Confidence Interval

  • A 99% CI found for the true bacterial density is 26.5 to 29.5 million bacteria/mL.
  • With 99% confidence, could the population mean be μ = 25 million/mL or μ = 29?
  • A confidence interval offers a yes/no answer and an estimated range of likely estimates the range for the true population mean μ.
  • A P-value quantifies how strong the evidence is against Hâ‚€
  • But if Hâ‚€ is rejected, it provides no information about the true population mean μ.
  • This section discussed the concept of statistical inference, covering statistical estimation, margin of error, confidence levels, confidence intervals, hypothesis testing, P-values, and tests for a population mean

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team
Use Quizgecko on...
Browser
Browser