Statistics II: Normal Distribution and Sampling

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

What does a 95% confidence interval of a particular quantity, estimated from a sample, indicate?

  • The range of possible values for a population parameter in which one can be 95% confident. (correct)
  • The range within which the sample mean is expected to fall 95% of the time.
  • The range of possible values for the sample statistic with 95% certainty.
  • The range of values for which the null hypothesis cannot be rejected at a significance level of 0.05.

In hypothesis testing, what is the purpose of the null hypothesis?

  • To assume there is no effect or no difference, which the test attempts to disprove. (correct)
  • To prove that there is a significant difference between the sample and the population.
  • To determine the exact value of the population parameter.
  • To establish the researcher's belief about the population parameter.

What does a small p-value (e.g., p < 0.05) typically indicate in hypothesis testing?

  • The null hypothesis is true.
  • The sample size is too small.
  • Strong evidence against the null hypothesis. (correct)
  • Strong evidence in favor of the null hypothesis.

Which statement best describes the relationship between sample size and confidence interval width?

<p>As sample size increases, confidence interval width decreases. (A)</p> Signup and view all the answers

Under what condition is a paired t-test most appropriate?

<p>When comparing the means of two related groups, where each subject is measured twice. (C)</p> Signup and view all the answers

If the p-value in a hypothesis test is 0.03, what can you conclude if using a significance level of 0.05?

<p>Reject the null hypothesis. (A)</p> Signup and view all the answers

What is the primary difference between a one-tailed and a two-tailed hypothesis test?

<p>A one-tailed test assesses the possibility of an effect in one direction, while a two-tailed test assesses the possibility of an effect in both directions. (C)</p> Signup and view all the answers

In the context of statistical hypothesis testing, what does 'statistical significance' indicate?

<p>The observed effect is likely not due to chance alone. (C)</p> Signup and view all the answers

If researchers are comparing the effect of a new drug to a placebo on blood pressure, and they use a t-test, what assumption must be met regarding the populations from which the samples are drawn?

<p>The populations must be normally distributed. (A)</p> Signup and view all the answers

What is the formula for Standard Error of the Mean (SEM)?

<p>$SEM = \frac{SD}{\sqrt{n}}$ (B)</p> Signup and view all the answers

A study compares glucose levels in students who drink diet coke versus regular coke. The null hypothesis is: μ(diet coke) = μ(coke). The alternative hypothesis is: μ(diet coke) ≠ μ(coke). An unpaired t-test yields a p-value of 0.0044. If the significance level is set at 0.05, what conclusion can be drawn?

<p>Reject the null hypothesis; there is a statistically significant difference in mean glucose levels. (A)</p> Signup and view all the answers

In a paired t-test examining the effect of a new diet on weight loss, the mean difference in weight before and after the diet is found to be statistically significant. However, the 95% confidence interval for the mean difference includes zero. What is the most appropriate interpretation of these findings?

<p>The results are inconclusive; the statistical significance and confidence interval contradict each other, so more data is needed. (B)</p> Signup and view all the answers

A researcher conducts a series of experiments and consistently obtains p-values between 0.05 and 0.10. Despite these results not reaching the conventional significance level of 0.05, the researcher decides to combine the data from all experiments in a meta-analysis. What would be the most concerning potential issue with this approach?

<p>The meta-analysis might amplify any biases or flaws present in the individual studies, leading to a misleading overall conclusion. (B)</p> Signup and view all the answers

A study examines the relationship between exercise and blood pressure. The researchers initially plan to use a two-tailed t-test to compare the blood pressure of those who exercise regularly versus those who do not. However, after reviewing prior literature, they find overwhelmingly strong evidence suggesting that exercise reduces blood pressure. Under what specific condition would it be MOST justifiable to switch to a one-tailed t-test?

<p>If the researchers have a strong, pre-existing conviction, based on solid evidence, that exercise <em>only</em> lowers blood pressure, and they are not concerned with detecting a potential increase. (A)</p> Signup and view all the answers

A highly skilled statistician is reviewing a research paper and notices that the authors have reported a statistically significant result using an independent samples t-test. However, the Levene's test for equality of variances was significant (p < 0.05), and the authors did not adjust their t-test or report using Welch's t-test. What is the MOST critical concern regarding the validity of the authors' conclusion?

<p>The statistically significant result may be a Type I error (false positive) because the assumption of equal variances was violated, potentially invalidating the t-test. (D)</p> Signup and view all the answers

Flashcards

95% Confidence Interval

Shows the possible values for a quantity, estimated from a sample, with a 95% confidence level that the population value lies within it.

Confidence Interval (CI)

A range of values likely to contain the population mean, offering a degree of certainty.

Standard Error (SEM)

Estimates the standard error of a statistic. Calculated by standard deviation divided by square root of sample size.

Hypothesis testing

A test to determine if sample-to-sample variations plausibly explain the results of a research study.

Signup and view all the flashcards

Null hypothesis (H0)

The initial assumption in hypothesis testing that there is no difference between the means of two populations.

Signup and view all the flashcards

Alternative hypothesis (H1)

The hypothesis that suggests there is a difference between the means of two populations.

Signup and view all the flashcards

P-value

The probability of observing the results (or more extreme) if the null hypothesis is true.

Signup and view all the flashcards

Statistical Significance

A test providing evidence of a difference between the groups being compared.

Signup and view all the flashcards

Significance level (alpha)

The probability (alpha) below which the p-value is considered statistically significant (typically 0.05 or 5%).

Signup and view all the flashcards

One-tailed test

There is a difference between the means of two populations, but it is known which group is larger.

Signup and view all the flashcards

Two-tailed test

There is a difference between the means of two populations, but it is not known which group is larger.

Signup and view all the flashcards

Student's t-test

Compares the means of two independent groups using a t-statistic and its t-distribution.

Signup and view all the flashcards

Unpaired t-test

An unpaired t-test compares the means of two independent samples to determine if they are statistically significant.

Signup and view all the flashcards

Paired t-test

Test that measures each subject twice (once under each condition) and calculates the difference. Requirements: Observations are independent (different pairs)

Signup and view all the flashcards

Study Notes

  • Statistics II

Normal Distribution

  • The normal distribution shows percentages within standard deviations from the mean.
  • 68.2% of data falls within one standard deviation (1σ) of the mean (μ).
  • 95.4% of data falls within two standard deviations (2σ) of the mean.
  • 99.7% of data falls within three standard deviations (3σ) of the mean.

Statistical Concept: Sampling

  • Parameters in a population (μ, σ2, σ) are estimated using statistics (xÌ„, s2, s) from a sample.

Variation in Between Sample Means

  • Determining how well sample values estimate underlying population values is a key issue
  • Confidence Interval (CI) is a range of values likely to contain the population mean with some degree of certainty: CI = [XÌ„ ± T × SEM]
  • SEM (Standard Error) = SD/√n, where SD is the standard deviation and n is the sample size.
  • T-value comes from the t-distribution and depends on sample number -1 (degrees of freedom or df) and confidence level (95%), this will be provided in the exam
  • A 95% Confidence Interval means that can be 95% confident that the population mean falls within this interval.

Confidence Intervals

  • Confidence intervals are useful for means and differences in means (e.g., change in blood sugar levels after treatment).
  • A 95% confidence interval, estimated from a sample, shows the possible values of population with 95% confidence.
  • This is valid if the sample is representative of the population and the study is unbiased.
  • Repeating an experiment many times and constructing many CIs, expect the true population value to lie in 95% of these intervals.
  • A confidence interval is a standard way to show how accurately sample means reflect population means.
  • Narrower confidence intervals indicate better estimation of population parameters by sample means.

Comparing Two Groups

  • Focus on one value: the average change.

Hypothesis Testing

  • Core question: can observed differences be simply due to chance?
  • Hypothesis Testing addresses this core question
  • To answer this question, determine the probability of getting such a difference, assuming no difference exists: p-value.
  • A scientific/medical question addresses whether the difference is biologically important.
  • Bioscience research size of scientifically important difference is often not known, but the confidence interval is important.
  • The general goal of hypothesis testing is to understand if sample to sample variations are plausible from a research study.
  • Hypothesis testing can help determine whether a specific treatment affects individuals in a population.

The Steps of Hypothesis Testing

  • Assume there is no difference (null hypothesis, H0) or that both samples belong to the same population.
  • Consider samples from different population(alternative hypothesis, H1)
  • Determine the p-value, or the probability of seeing the observed difference
  • Accept H0 and reject H1 if there is no difference
  • Reject H0 and accept H1 if there is a difference.

Statistical Significance

  • A statistical test is significant with evidence of a group difference.
  • Standard definition of "significant": p-value is less than significance level or α (0.05 or 5%).
  • Researchers must state the hypothesis precisely.
  • The null hypothesis (H0): no difference between population means, groups are the same, treatment had no effect.
  • The alternative hypothesis (H1): difference between population means, implying treatment had an effect.
  • The hypothesis test measures the probability of observed results if H0 is true.
  • Smaller p-value indicates greater evidence against H0.
  • P <0.05 is typically considered a significant result, so there is sufficient evidence to reject H0 and accept H1.
  • A confidence interval should be included, but it is often omitted.

Student's t-test

  • Gossett t-statistic and t-distribution compare 2 group means
  • Suitable for comparing 2 sample means.
  • Calculate if no difference is obtained
  • Normal distribution is typically required, but can tolerate skewness in larger sample size.
  • Types of Student's t-test
    • Unpaired t-test: independent samples, two-tailed or one-tailed
    • Paired t-test: related samples, two-tailed or one-tailed

Unpaired t-tests

  • Compare 2 independent populations, by determining if there is a difference in the means of two groups
  • Compare the means of the two groups after measuring each subject once.
  • Example: blood pressure on different diets.

Unpaired t test requirements

  • Independent observations are each dataset has different set of individuals.
  • Drawn at random from normally distributed populations.
  • Assume equal variance (corrected by Welch's Test if unequal).
  • When comparing two groups in a glucose experiment, each student should only be in one group, making it an unpaired test

Using a t-test

  • Are the groups different: two-tailed?
  • Unpaired: each student in one group only?
  • Null Hypothesis (H0): no difference between means at 30 minutes: μ (diet coke) = μ (coke).
  • Alternative Hypothesis (H1): difference between means at 30 minutes: μ (diet coke) ≠ μ (coke) .

Unpaired t-test example

  • Start by taking a significant level of 0.05(5%)
  • Calculate that p = 0.0044
  • Reject the null hypothesis since p-value <0.05
  • There is statistically significant difference at 5% between mean blood sugar level with diet coke and regular coke.
  • A low probability makes an effect unlikely, repeat in other groups and conditions if the sample size is small

Paired t-tests requirements

  • It will be known if there is a difference between the means under two conditions.
  • Differences are measured twice and should be independent

Paired t-test example

  • Each student measured at both time points used to comparing glucose levels at 0 and 120 in the diet coke group.
  • Test if sugar levels between base and 30 min statistically differ, if drinking diet coke raises blood sugar levels > 30 minutes

Paired t-test statistical analysis

  • H0: is mean difference is zero, μα= 0
  • H1: mean difference is not zero, μα ≠ 0
  • Mean difference = - 0.27 mmol/L
  • SD (differences) = 0.63 mmol/L
  • SEM = 0.26 mmol/L

Paired t-test Conclusion?

  • The test is a result of p=0.34
  • Cannot reject the null hypothesis since p>0.05
  • Accept the null hypothesis
  • The mean difference is not significantly different above 0 in glucose above 30 min from drinking diet coke.
  • Diet coke affects blood glucose levels above 30 minutes

One-tailed vs Two-tailed tests

  • Two-tailed are in either direction
  • One-tailed are in only one direction.
  • A two-tailed test is better due to correspond confidence interval which is two tailed and always does
  • Only use One-tailed in the following - exceptional circumstances or strong direction

Interpreting p-values

  • p-value does not equal the chance of being true
  • P says obtaining/greater difference is that if difference does not exist
  • If p= 0.02, there is 2% chance of obtaining such or greater difference if there were no difference (i.e. under H0)

Summary

  • Confidence intervals show how an accurate estimate might be
  • Setup the null and alternative
  • Prefer two tail tests
  • Tests are P<0.05, remember the P value may not be enough and look for supporting evidence
  • Small studies can important differences
  • A larger study needed research

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team

Related Documents

More Like This

Use Quizgecko on...
Browser
Browser