BIOSTATS 3.2 - CH. 18: COMPARING TWO MEANS

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

In the context of comparing two means, what is a key factor in determining the appropriate statistical test?

  • Whether the samples are independent or related. (correct)
  • Whether the populations are normally distributed.
  • Whether to avoid inferences on standard deviations.
  • Whether the sample sizes are large.

What does the two-sample t statistic primarily reflect?

  • The sample size of the two groups.
  • The variability within each sample.
  • The degrees of freedom.
  • The magnitude of the difference between the sample means relative to the variability within the samples. (correct)

Why is it generally recommended to avoid using pooled procedures when comparing two means?

  • They are less robust against outliers.
  • They assume equal variances, which is often difficult to verify. (correct)
  • They require equal sample sizes.
  • They are computationally complex.

What condition should be met to ensure the robustness of the two-sample t statistic?

<p>The sample sizes are equal and the sample distributions are similar. (B)</p> Signup and view all the answers

In hypothesis testing for two independent samples, what does the null hypothesis typically state?

<p>The population means are equal. (A)</p> Signup and view all the answers

In a two-sample t-test, what does the degrees of freedom (df) influence?

<p>The shape of the <em>t</em>-distribution and the critical value. (D)</p> Signup and view all the answers

How are matched pairs samples defined?

<p>Samples where individuals in both samples are related or assessed twice. (C)</p> Signup and view all the answers

For two independent samples, one with $n_1 = 10$ and the other with $n_2 = 15$, what is a reasonable guideline for ensuring the robustness of the two-sample t-test?

<p>Ensure both sample distributions have similar shapes and no strong outliers. (B)</p> Signup and view all the answers

When performing a two-sample t-test, what is the primary purpose of calculating the p-value?

<p>To assess the probability of observing the data (or more extreme data) if the null hypothesis is true. (B)</p> Signup and view all the answers

In the context of hypothesis testing, failing to reject the null hypothesis indicates:

<p>Insufficient evidence to reject the null hypothesis. (A)</p> Signup and view all the answers

Researchers are comparing the effectiveness of two different drugs on reducing blood pressure. They measure the blood pressure of patients before and after treatment with each drug. Which statistical approach is most appropriate?

<p>Paired <em>t</em>-test. (B)</p> Signup and view all the answers

How does the confidence interval approach relate to hypothesis testing?

<p>If the null hypothesis value falls within the confidence interval, we fail to reject the null hypothesis. (A)</p> Signup and view all the answers

A study compares the average test scores of students taught by two different methods. The output from a statistical software package shows a p-value of 0.04 for a two-sample t-test. Assuming a significance level of $\alpha = 0.05$, what conclusion can be drawn?

<p>The null hypothesis should be rejected. (A)</p> Signup and view all the answers

In SPSS, most p-values are calculated assuming what type of hypothesis?

<p>Two-sided hypothesis. (A)</p> Signup and view all the answers

If you want the p-value for a one-sided test, and you are using SPSS, what should you do?

<p>Divide the p-value by 2. (D)</p> Signup and view all the answers

When conducting a two-sample t-test to compare the means of two groups, what is the key assumption regarding the data?

<p>A and D (A)</p> Signup and view all the answers

What key information does Levene's test provide when you are comparing the means of two groups?

<p>Whether to use equal variance or unequal variance. (C)</p> Signup and view all the answers

What is the primary use of a paired sample t-test?

<p>To compare the means of two related groups. (D)</p> Signup and view all the answers

Why is important to use Paired Sample T-test on dependent groups?

<p>A T-test assumes all data are independent and makes it non accurate for dependent groups. (C)</p> Signup and view all the answers

Calculate a 95% confidence intervals using: $\overline{X_1}=10$, $\overline{X_2}=8$ t_a(2),v=2.2, $S_{\overline{X_1}-\overline{X_2}} = .5$

<p>6.9, 9.1 (B)</p> Signup and view all the answers

Flashcards

Independent Samples

Samples where individuals are chosen separately from each group.

Matched Pairs Samples

Samples where individuals are related (e.g., twins).

Two-Sample t-statistic

The statistic used for comparing two independent sample means, following a t-distribution.

Two-Sample t-Test

A test used to determine if there is a significant difference between the means of two independent groups.

Signup and view all the flashcards

P-value

The probability of observing a test statistic as extreme as, or more extreme than, the one computed, assuming the null hypothesis is true.

Signup and view all the flashcards

Confidence Interval

A range of values likely to contain the true difference between two population means.

Signup and view all the flashcards

Robustness

The extent to which a statistical test produces accurate results even when assumptions are not perfectly met.

Signup and view all the flashcards

Pooled t-Procedures

A two-sample t procedure that assumes equal variance and thus is not used.

Signup and view all the flashcards

Equal Variance

In SPSS: Look at Levene's Test. If the p-value is greater than alpha, fail to reject Ho which is equal variances.

Signup and view all the flashcards

Paired Sample t-test

A method for comparing two means where data points are related or paired.

Signup and view all the flashcards

Study Notes

  • Chapter focuses on comparing two means from different samples.

Learning Objectives

  • Differentiate between two-sample situations.
  • Understand and apply two-sample t procedures.
  • Assess the robustness of these procedures.
  • Know when to avoid pooled procedures.
  • Avoid making inferences on standard deviations.

Two Sample Situations

  • Comparison of two treatments or conditions may be needed.
  • Determination of sample independence is important.
  • Independent samples involve individuals chosen separately in both samples.
  • Matched pairs samples involve related individuals, like subjects assessed twice or siblings.

T Distribution for Two Independent Samples

  • Two independent Simple Random Samples, or SRSs, come from two populations with unknown (μ₁, σ₁) and (μ₂, σ₂).
  • (X₁, s₁) and (X₂, s₂) are used to estimate (μ₁, σ₁) and (μ₂, σ₂) respectively.
  • In theory, both populations should be normally distributed.
  • In practice, both distributions should have similar shapes without strong outliers for the t procedures to be valid.

T Statistic

  • The two-sample t statistic follows approximately a t distribution.
  • Standard error (SE) in the denominator reflects variation from both samples.
  • Degrees of freedom are computed with a complex formula, often done by statistical software.

Two-Sample T-Test

  • For two independent random samples, the goal is to test the null hypothesis.
  • Null hypothesis is: H₀: μ₁ = μ₂ ↔ μ₁ – μ₂ = 0.
  • Alternate hypothesis can be one-sided or two-sided.
  • Compute the t statistic and appropriate degrees of freedom (df).
  • Obtain and interpret the P-value based on the alternate hypothesis.

Lung Capacity and Parental Smoking Example

  • Forced Vital Capacity, or FVC, is the volume of air exhaled in 6 seconds, measured in milliliters.
  • FVC was obtained from samples of children exposed and not exposed to parental smoking.
  • In a sample of children exposed to parental smoking the mean FVC was 75.5 with a standard deviation of 9.3 from a sample size of 30.
  • In a sample of children not exposed to parental smoking the mean FVC was 88.2 with a standard deviation of 15.1 from a sample size of 30.
  • A two-sample t test was used to assess the impact of parental smoking on child lung capacity.
  • The t statistic for the lung capacity test example is equal to -3.92
  • The null hypothesis was rejected as the P value was less than 0.0005, based off a df of 40
  • Lung capacity is significantly impaired in children exposed to parental smoking compared to those not exposed.

Hypothesis

  • Geckos have specialized toe pads for climbing slick surfaces.
  • Male mean toe pad area in Tokay geckos is 6.0 cm², and female is 5.3 cm².
  • Need to formulate the null hypothesis for testing if male and female geckos differ significantly in toe pad size

Two Sample Confidence Interval

  • Uses the difference between sample averages (X1 - X2) to estimate (μ₁ - μ₂).
  • C is the area between -t* and t*.
  • Find t* in Table C for the computed degrees of freedom.
  • The margin of error m is calculated as m = t*√(s₁²/n₁ + s₂²/n₂) = t*SE.

Pesticide and Seedling Growth Example

  • Seeds planted in pots with pesticide-treated soil vs. untreated soil.
  • Seedling growth (in mm) recorded after 2 weeks.
  • The degrees of freedom (df) are calculated to be 37.86.
  • A 95% confidence interval for (µ₁ – µ₂) is (X1 - X2) ± t*√(s₁²/n₁ + s₂²/n₂).
  • Using df = 30 from Table C, resulting in (51.48 – 41.52) ± 2.042√(11.01²/21 + 17.15²/23) equating to 9.96 ± 8.80 mm.

Software Examples

  • For df = 38, m is approximately 2.024 * 4.31 ≈ 8.72.
  • Software outputs provide values for t Statistic, P (T<=t) one-tail, t Critical one-tail, P (T<=t) two-tail, and t Critical two-tail.
  • A 95% level of confidence using pesticide yields seeds that are 1.2 to 18.7 mm longer on average after 2 weeks.

Robustness

  • Two-sample statistic is most robust with equal sample sizes and similar sample distributions.
  • Two-sample tests remain quite robust even when deviating from these conditions.
  • A combined sample size (n₁ + n₂) of 40 or more allows working even with the most skewed distributions.

Pooled Two-Sample T Procedures

  • Two versions of two-sample t procedures: one assuming equal variance ("pooled") and one not assuming equal variance for the two populations.
  • Pooled two-sample t test has degrees of freedom n₁ + n₂ – 2.

Inference and Standard Deviations

  • The pooled t test assumes equal variances.
  • Procedures exist to test whether two population variances are equal, such as F procedures.
  • Avoid using F procedures without proper guidance due to a lack of robustness.

T Procedures Overview

  • One-sample t procedure summarizes one sample by its mean X and standard deviation s to make inferences about population parameter μ when σ is unknown.
  • Matched Pairs t Procedure summarizes two paired data sets (from a matched pairs design) from n pairwise differences ( ) to make inferences about population parameter
  • Two-Sample t Procedure summarizes two independent samples (unrelated individuals) based on with ( to make inferences about -

Procedure Selection

  • Blood pressure change with oral contraceptives: Compare women using/not using OC, a two-sample or two-sample t procedure.
  • Cholesterol level in adults with "high cholesterol" parents: Average cholesterol level in general adult population is 175 mg/dL.
  • Compare a sample of adults with "high cholesterol" parents, a one-sample t procedure.
  • Take a sample of bread loaves, and compare the vitamin content right after baking and again 3 days later with a matched pairs sample t procedure.
  • Bread loaves just baked compared to bread loaves stored for 3 days: Compare vitamin content, a two-sample t procedure.

Paired Sample T-Test

  • The test deals with the previous two sample t-test with independent samples.
  • Each datum in one sample is in no way associated with any specific datum in the other sample.
  • Test when the test has an observation that can be correlated in some way to an observation in another sample - dependence.
  • Can have dependence because the test looks at individual at repeated time points, at the same individual but different locations, or in case of a patient study when patients are paired based on age, gender, and other factors to get a better control for your study.
  • The assumption is that the differences between the samples come from a normally distributed population, instead of the individual values themselves.
  • Since the regular two sample t-test assumes all data are independent, a new test statistic is needed for dealing with dependent samples.

Deer Example

  • Null hypothesis: the left foreleg and left hindleg lengths of deer are equal.
  • Testing would entail the measurement on a number of deer involving variation among the data owing to the null hypothesis being false and the leg lengths differ.
  • There would also be variation as deer are of different sizes, and a deer with longer front leg is likely to have a longer hindleg.
  • Degrees of freedom are calculated as t = d/SEM = 3.3/0.97 = 3.402 vs to.05(2),9 = 2.262.
  • Hypothesis is rejected, that p – value = .008.

Confidence Intervals

  • Probability is that some interval around the mean or instance of interest actually contains the mean.
  • Contains an interval with a lower and upper value.
  • Formula:
  • Can be used to determine whether to reject Ho.
  • With the confidence interval, If 0 is present then Ho is rejected.

Example 1 Continued

  • Confidence intervals illustrate this concept.
  • Null hypothesis is rejected, since 0 does not fall within the CI.

P-Values in SPSS

  • Most p-values are calculated assuming a two-sided hypothesis.
  • The p-value is found in the Sig (2-tailed) column.
  • Divide the obtained output by 2 to get the p-value for a one-sided test.

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team

Related Documents

More Like This

Independent Samples T-Test
21 questions

Independent Samples T-Test

LargeCapacityAntigorite4770 avatar
LargeCapacityAntigorite4770
Independent Samples T-Test Explained
5 questions

Independent Samples T-Test Explained

LargeCapacityAntigorite4770 avatar
LargeCapacityAntigorite4770
Independent Samples T-Test
5 questions

Independent Samples T-Test

LargeCapacityAntigorite4770 avatar
LargeCapacityAntigorite4770
Use Quizgecko on...
Browser
Browser