T3 Power & Mid Sem Exam Revision (PSYC2010)

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Play an AI-generated podcast conversation about this lesson
Download our mobile app to listen on the go
Get App

Questions and Answers

Which of the following is the most accurate definition of 'power' in statistical hypothesis testing?

  • The probability of making a Type I error.
  • The probability of correctly rejecting a false null hypothesis. (correct)
  • The probability of failing to reject a false null hypothesis.
  • The probability of rejecting a true null hypothesis.

How does increasing the alpha level (e.g., from 0.05 to 0.10) typically affect the power of a statistical test, assuming all other factors remain constant?

  • The effect on power depends on the specific test being used.
  • It decreases power because it makes the test more conservative.
  • It increases power because it enlarges the rejection region. (correct)
  • It has no effect on power; alpha is unrelated to power.

Assuming all other factors are held constant, how does decreasing the population variance generally affect the power of a statistical test?

  • The impact depends on the type of test being conducted.
  • Decreasing variance increases power as it sharpens the precision of the sample estimate. (correct)
  • Variance has no direct impact on power.
  • Decreasing variance decreases power as it reduces the effect size.

In the context of statistical power, what does 'effect size' primarily indicate?

<p>The practical significance or magnitude of the observed effect. (A)</p> Signup and view all the answers

In hypothesis testing, increasing sample size while holding other factors constant typically leads to:

<p>An increase in statistical power. (B)</p> Signup and view all the answers

What is the primary purpose of calculating Cohen's $d$ in statistical analysis?

<p>To measure the effect size or the standardized difference between two means. (B)</p> Signup and view all the answers

Which of the following best describes the purpose of a power analysis before conducting a study?

<p>To estimate the sample size needed to achieve a desired level of statistical power. (D)</p> Signup and view all the answers

In the context of hypothesis testing, a Type II error is:

<p>Failing to reject a false null hypothesis. (B)</p> Signup and view all the answers

How does the noncentrality parameter ($\delta$) relate to the statistical power of a test?

<p>Higher values of $\delta$ are always associated with greater statistical power. (B)</p> Signup and view all the answers

In an independent groups t-test, if the sample sizes of both groups increase, the power of the test will typically:

<p>Increase. (A)</p> Signup and view all the answers

A Type I error, also known as a false positive, is the incorrect rejection of a ______ null hypothesis.

<p>true</p> Signup and view all the answers

Decreasing the measurement error in a study generally decreases the statistical power, all else being equal.

<p>False (B)</p> Signup and view all the answers

Explain why, in a repeated measures design, controlling individual differences often leads to greater statistical power compared to an independent groups design.

<p>By using the same participants across all conditions, repeated measures designs eliminate between-subject variability, making the design more sensitive to detecting within-subject effects, thus increasing power.</p> Signup and view all the answers

In a scenario where you are comparing two independent means and you have a medium effect size (d=0.5), what is the implication for the required sample size to achieve a desired level of power, compared to a small effect size (d=0.2)?

<p>The required sample size will be smaller. (C)</p> Signup and view all the answers

According to Cohen's conventions, which of the following Cohen's d values represents a large effect size?

<p>0.9 (D)</p> Signup and view all the answers

If a researcher sets the alpha level to 0.01 instead of the typical 0.05, what happens to power, assuming other parameters stay the same?

<p>Power decreases because the criterion for significance is more stringent. (D)</p> Signup and view all the answers

When calculating power for an independent samples t-test, 'n' in the formula $ \delta = d \sqrt{\frac{n}{2}}$ always refers to the total sample size across both groups.

<p>False (B)</p> Signup and view all the answers

Briefly explain why 'rounding down' the noncentrality parameter ($\delta$) when consulting a power table is generally recommended.

<p>Rounding down provides a conservative estimate of power, avoiding overestimation and ensuring that the study is adequately powered.</p> Signup and view all the answers

In cases where you have variance instead of standard deviation, you can calculate standard deviation by taking the ______ root of the variance.

<p>square</p> Signup and view all the answers

Match the following statistical concepts with their corresponding definitions:

<p>Power = The probability of correctly rejecting a false null hypothesis. Type I error = Rejecting a true null hypothesis (false positive). Type II error = Failing to reject a false null hypothesis (false negative). Effect size = A measure of the strength of a phenomenon, independent of sample size.</p> Signup and view all the answers

How does increasing the effect size influence the required sample size needed to achieve a power of 0.80?

<p>An increase in the effect size leads to smaller required sample sizes. (D)</p> Signup and view all the answers

In a study comparing two independent groups, a researcher finds that increasing the sample size per group from 20 to 80 substantially increases the observed power. What is the primary reason for this increase in power?

<p>The standard error decreases, making it easier to detect true differences. (C)</p> Signup and view all the answers

If a statistical test is underpowered, it is more likely to commit a Type I error.

<p>False (B)</p> Signup and view all the answers

When consulting a power table, if the calculated statistical effect ($\delta$) falls exactly between two values listed in the table, it is conventional to ______ down to the lower value to obtain a conservative estimate of power.

<p>round</p> Signup and view all the answers

How does using a repeated measures design, when appropriate, impact the required sample size needed to obtain a certain level of statistical power, compared to an independent groups design?

<p>Repeated measures designs reduce the required sample by controlling for individual differences. They are more sensitive.</p> Signup and view all the answers

Which of the following is NOT a way to increase power in a statistical test?

<p>Increase the degrees of freedom. (A)</p> Signup and view all the answers

How does increasing the sample size affect Type II error rate?

<p>Decreases Type II error rate. (A)</p> Signup and view all the answers

Power calculations are performed using the formula: $Power = 1− \beta$, where $ \beta$ = the probability of being convicted of murder.

<p>False (B)</p> Signup and view all the answers

Cohen defined a small effect size as d = .2. What is the primary problem with small effect sizes in experiments?

<p>Small effect sizes are harder to detect because they may not be meaningful. Small samples also have difficulty detecting them.</p> Signup and view all the answers

Match the following phrases:

<p>Increase Sample Size = A larger number to work with Raise the Alpha Level = Lower your standards Low Variance = When score are more stable Increasing the Effect Size = When groups behave more different than one another</p> Signup and view all the answers

Why should you use the t-test instead of the z-test?

<p>The deviation is unknown (B)</p> Signup and view all the answers

Under the null hypothesis assumed a small effect size; what is assumed under the H1, alternative hypothesis?

<p>We assume no difference so the distributions are overlapping entirely (B)</p> Signup and view all the answers

Sample size has no impact on the power of the statistical test.

<p>False (B)</p> Signup and view all the answers

A researcher intends to use an independent samples t-test with samples: 9 old participant and 5 young participants. What is the degrees of freedom?

<p>12</p> Signup and view all the answers

Match each design type to its key characteristic:

<p>Repeated measures design = Each participant undergoes all conditions Independent groups design = Different participants are used for each of the conditions</p> Signup and view all the answers

How can you estimate the sample size needed using power?

<p>Estimate sample by using power table tools (D)</p> Signup and view all the answers

You want to measure change over time and need to increase sensitivity with fewer participants. How can you increase this?

<p>Matched Pairs / Repeated Measures Design (D)</p> Signup and view all the answers

It's fine to always check the means when interpreting any differences.

<p>True (A)</p> Signup and view all the answers

In hypothesis testing, which of the following statements is true regarding the alternative hypothesis ($H_1$)?

<p>It specifies the values that the researcher believes to be true. (D)</p> Signup and view all the answers

Increasing the alpha level (e.g., from 0.05 to 0.10) reduces the probability of making a Type I error.

<p>False (B)</p> Signup and view all the answers

Explain how increasing sample size typically impacts statistical power.

<p>Increasing the sample size typically increases statistical power, leading to a higher chance of detecting a real effect if it exists.</p> Signup and view all the answers

To compute the number of participants needed for a power of 83% in a single-sample t-test, you need to find the ______ using the power table on page 126.

<p>statistical effect</p> Signup and view all the answers

Match each statistical concept with its appropriate definition:

<p>Effect Size (Cohen's d) = A standardized measure of the magnitude of a treatment effect, independent of sample size. Statistical Power = The probability of correctly rejecting a false null hypothesis. Alpha Level (α) = The probability of making a Type I error, or a false positive, set by the researcher. Noncentrality Parameter (δ) = A measure combining effect size and sample size to quantify the separation between null and alternative distributions.</p> Signup and view all the answers

Flashcards

What is Power?

The probability of finding a significant effect if one exists.

Type I error (alpha)

A "false alarm" or "false positive," the probability of incorrectly rejecting the null hypothesis when it is true. Usually set at .05.

Type II error (beta)

A "miss" or "false negative," The probability of incorrectly accepting the null hypothesis when it is false.

Power (1 - beta)

The probability of correctly rejecting a false null hypothesis.

Signup and view all the flashcards

Ways to Increase Power

Increasing the a level, Increase the effect size, Reduce variance, Increase the sample size.

Signup and view all the flashcards

Effect Size (d)

How many standard deviations apart the distribution of scores are under Ho and H₁.

Signup and view all the flashcards

Statistical effect (δ)

Combines effect size and sample size; how many standard errors apart the distribution of scores are under Ho and H₁.

Signup and view all the flashcards

Effect Size (d) value determination

Estimated based on past research or convention (Cohen's standards)

Signup and view all the flashcards

Cohen's d values

Small (d=0.20), Medium (d=0.50), Large (d=0.80)

Signup and view all the flashcards

What is Example 1b (p. 35 of the workbook)

Compute the number of participants required for a power of 83%.

Signup and view all the flashcards

What is Determine the effect size (d)

As with a single sample t-test, determine the effect size (d) independent of sample size.

Signup and view all the flashcards

Type II Error

An error where you conclude there is no effect when there actually is one.

Signup and view all the flashcards

What is 8?

Calculating the statistical effect (8)

Signup and view all the flashcards

What is N?

Calculating sample size

Signup and view all the flashcards

What is H₀?

The assumption of no difference between groups is made under the null hypothesis.

Signup and view all the flashcards

Sample size increase?

What happens to sample size when power is increased?

Signup and view all the flashcards

The likelihood is false...

What is beta?

Signup and view all the flashcards

What are Matched Pairs

Is used when each participant does both conditions

Signup and view all the flashcards

What is lower your...

Alpha - raises the level increases the error

Signup and view all the flashcards

Influence power

Alpha level(a), Sample size, Population variance

Signup and view all the flashcards

Differences

Always check the means.

Signup and view all the flashcards

Top graph

A small theft, where guilty people try hard to hide what they did

Signup and view all the flashcards

SD is unknown

Use the t-test that is for smaller samples or the population

Signup and view all the flashcards

Study Notes

  • Psychological Research Methodology II, Tutorial 3

Mid-Semester Quiz Details

  • Opens at 9am on Monday, March 31st and closes at 5pm on the same day
  • The quiz is online and open-book
  • Once started, the quiz auto-submits at 5pm
  • There is a 90-minute time limit to complete the quiz once it has started
  • The exam auto-submits once the timer runs out
  • Covers material from the first four lectures and first three tutorials
  • The quiz is worth 20% of the final grade
  • The format is multiple choice
  • Some questions require calculations, while others are conceptual
  • There are 14 single-point questions
  • There are 4 one and a half-point calculation questions
  • Each calculation question has 3 parts, each worth half a mark

Quiz Preparation and Resources

  • A practice quiz is available on Blackboard to help prepare
  • Tutors offer consult hours to assist with preparation
  • Formulae, z-, t-, and power tables from the tutorial workbook should be used for calculations
  • Use the formulae taught in this course and avoid using computational formulae learned elsewhere
  • Retain 3 decimal places at every step during calculations
  • Answers must be your own work (do not collude)
  • Questions are randomised between students during the quiz
  • Consulting with the lecturer or tutors regarding quiz questions during the quiz is not permitted
  • Review lecture slides, tutorial slides, the workbook, and the textbook
  • Non-PSYC2010 internet resources should be used with caution

Emergency Contact

  • For emergency situations preventing completion of the quiz, submit an application for a deferred quiz with documentation to [email protected]
  • A form is available on Blackboard for this purpose
  • This email also handles applications for assignment extensions later in the semester

Pooling Variance Example

  • Study to investigate the impact of a lead smelter on lead levels in children's blood
  • Comparing 10 children near the smelter to 7 children in an unpolluted area
  • Children near the smelter show an average lead level of 18.40 (SD = 3.17) mg/100ml
  • Children in the unpolluted area show an average lead level of 12.14 (SD = 3.18) mg/100ml
  • Formula calculates the pooled variance in a different way: Sp2 = ((N1-1)s12 + (N2-1)s22) / (N1 + N2 - 2)
  • Based on previous figures the numbers input accordingly
  • Calculating the pooled variance yields a value of 10.074

Further calculations on the same variance

  • The standard error of the difference between means can be calculated as 1.565
  • Calculate t using the formula: t = (X1 - X2) / sX1-X2
  • In this blood level example t = 4.000
  • Degrees of freedom are calculated as such: df = 10 + 7-2 = 15
  • Making a statistical decision can be done by comparing the absolute calculated t against the t critical value: |tobt(15) = 4.000| > |tcrit(15) = 2.131|
  • Reject null hypothesis
  • An independent-groups t-test revealed that blood lead levels were significantly higher in kids living closer to the smelter (M = 18.4)
  • The levels are higher than the children living in the unpolluted region (M =12.1), t(15) = 4.00, p < .05

Power definition

  • Power: Probability of finding a significant effect if one exists

Type I Error

  • Type I error (α): A "false alarm" or "false positive."
  • Alpha is usually set at .05.
  • It represents the probability of incorrectly rejecting Ho when Ho is actually true.
  • Example: The probability of finding a significant difference when no reliable difference exists.
  • Can only occur when Ho is true.

Type II Error

  • Type II error (β): A “miss” or “false negative."
  • Probability of incorrectly accepting Ho when it is false.
  • Probability of NOT finding a significant difference when a reliable difference does exist.
  • Can only occur when Ho is false.

Power Continued

  • Power is (1 - β)
  • Probability of correctly rejecting a false null hypothesis; power assumes H₁ is true

How Power Influences Decisions

  • Decision: Accept Ho, Reality: No difference is a correct choice and a true negative
  • Decision: Reject Ho, Reality: No difference is an error and a false positive called type I
  • Decision: Accept Ho, Reality: Difference exists is an error, and a false negative called type II
  • Decision: Reject Ho, Reality: Difference exists, and a correct rejection

Power Illustration Notes

  • The left curve represents innocent people (null)
  • The right illustrates the guilty people (alternative)
  • The small areas on both ends of the first, innocent, curve are alpha by 2
  • This is the chance of wrongly accusing an innocent person, also known as Type I error
  • The confusing part is where the two are overlapping, and where you can behave in similar ways
  • If a guilty person does this, identifying them as innocent is a type II error
  • If a guilty person crosses the decision line correctly you call them guilty; this is power
  • As the curves overlap more, it gets harder to tell who is truly guilty or innocent and the more mistakes that can be made

Increasing Power

  • Raising alpha from 0.05 to 0.10 lowers the stringency for guilt
  • You will only find someone guilty if they act guilty at "10"
  • When alpha is 0.05, then they must act guilty at "5"
  • Increasing power means being better at correctly identifying guilty people
  • One method is to increase the effect size
  • You can not control how guilty act to make your work easier
  • Another is to reduce variance, meaning making people's behavior more consistent
  • You can not always reduce variance always
  • Last is to increase the sample size, like more interviewing more people or collecting more evidence
  • These three methods cannot always be practically applied in real investigations

Calculating Power Considerations

  • Three considerations when calculating power include experimental design, effect size (d), and statistical effect (δ, noncentrality parameter)
  • Effect size measures how man standard deviations the distribution of scores are under Ho and H₁
  • Estimated based on past research
  • Effect size tells us how big the actual difference is in the real world
  • The statistical effect tells us how clear that difference is in sample data
  • Also called the noncentrality parameter (δ)
  • This measures how many standard errors apart the distribution of scores are under Ho and H₁
  • With alpha level chosen delta tells how far the true mean Is from the zero in the shifted distribution
  • Even if the real difference is substantial, missing It is possible if sample is to small etc

Cohen's d

  • Commonly used to describe how big or meaningful an effect is
  • If d is around 0.2, the effect is considered small; if it's around 0.5, it's medium; and if it's 0.8 or more, it's large
  • However, these are not hard rules, its more of a guideline

Single Sample t-test for Finding Power

  • Determine the effect size (d), which is independent of sample size: d = (μ1 - μ0)/σ
  • Calculate the statistical effect (δ), which is dependent on sample size: δ = d√N
  • For a single sample t-test, N = n
  • Look up the power in the table (p. 126) for the calculated δ and given α

Single Sample t-test to Find Sample Size:

  • Determine the effect size (d)
  • Use the table (p. 126) to look up the statistical effect required to yield a particular power for the given alpha
  • Calculate the required sample size: N = (δ/d)^2

Single Sample t-test Example

  • Example 1a comes from page 35 of the workbook
  • Data from tests on young adult memory suggest that, on average, they can remember 11.50 objects out of 20, with a variance of 19.03
  • Collected from random samples of 22 older adults, it showed that older adults were able to remember an average of, 13.25
  • Calculate the power of the test in order to replicate study with older adults can be replicated
  • Null hypothesis: Older adults' memory is the same as the young; Alternative hypothesis: older is actually 13.25
  • μ0 = 11.50, μ1 = 13.25, σ2 = 19.03, N = 22
  • First determine effect size: σ2 = √19.030 = 4.362, d = μ1-μ0/ σ 13.250 - 11.500/4.362= 0.40
  • Second calculate what statistical effect is: δ= d√N= .400√22 = .401 x 4.689 = 1.881
  • Then look up the power table on (p. 126)
  • Therefore, round “down” to be a conservative i.e 1.80, power = 44%

Sample t-Test Interpretation

  • The single t-test has given results of the memory test showing that if true mean of other adulate 13.3, then there is a 44% chance correctly rejecting this, with a sample of 22
  • If H0 is false, power is undefined Power calculations make sense when hOh0 is false
  • Defining power requires the formula form and expressing percentage

Finding Sample t-Test to Find Sample Participants

  • Determining the participants needed to achieve the specific power is done with this formula: N =(δ/d)^2
  • Determine d effect size we know
  • Finding delta is done with the power table and alpha level of given.
  • This finds what statistical effect is required for the desired power

Independent Groups t-test Notes

  • Finding power from a given sample size involves a few steps
  • Determine size effect (d): μ-μ0/σ
  • Calculate statistical effect: 8 = d n/ 2, n = sample of each group Look up in power table (p. 126) the result in given aplha

Independent Groups t-test continued

  • The independent test can be used for a specific power, or the sample can be used from particular power
  • μ − μ0
  • Determine effect size dependent of sample size d
  • Find the table to locate the required to yield specific power of the given α test
  • calculate this using calculated required sample site

Independent Groups T-test Example

  • Test of whether people use more positive words then negative
  • medium effect of .05, a new independent test needed with 40 people and a
  • What is power of this result ( = .05, with n for each group is 40 )

Effect size determination

  • previous research would be ""medium"" effect, = ,50. sample means populations of score which has 1/2

- Statistical Effect

  • the formulas are:. d equal equal to d: n1 = 4 , 2 = 40 =N
  • d 2 == .,500.

Final Independent Calculation

  • A give of is 0 5, a a statical effect from the result ,(0 has the power of or rounded down by 2.2
  • note: we don't h t have 2: but we he hve s so use at
  • make sure that your you is, and the d n1 n and If false, and the word, with ample which 4 group
  • use is look give has so test word,
  • So a give for, this if give
  • To

Important multiple choice points:

  • power equals finding of a. of
  • Alapha and of we call

Revision Questions Tutorial 1 and 2:

  • Identify the independent and dependent variables, experimental design, and the appropriate statistical test.
  • Example 1: Studying the effects of a new memory-enhancing drug on Alzheimer's patients by testing before and after administration.
    • IV: Drug administration or time (pre/post)
    • DV: Memory test scores
    • Design: Repeated measures
    • Test: Repeated measures t-test
  • Example 2: Finding out if the mean IQ of statisticians differs from the general population (mean IQ 100, standard deviation 15) by giving an IQ test to 50 statisticians.
    • IV: Job type (statistician vs. population)
    • DV: IQ
    • Design: Single sample
    • Test: Single sample z-test
  • Example 3: Studying the effects of alcohol on motor coordination by comparing a group given a moderate dose of alcohol to another group given no alcohol.
    • IV: Alcohol dosage (moderate vs. none)
    • DV: Motor coordination
    • Design: Independent groups
    • Test: Independent groups t-test
  • Example 4: Examining if males and females are persuaded differently by a female speaker, with each male participant matched to a female participant of the same age, IQ, and occupation.
    • IV: Participant gender (males vs. female)
    • DV: Speaker persuasion
    • Design: Matched samples
    • Test: Repeated measures t-test
  • Example 5: A student comparing the number of driving lessons his friends took to the average number of lessons reported by a driving school (23 lessons).
    • IV: Friends vs. population
    • DV: Number of lessons
    • Design: Single sample
    • Test: Single sample t-test

Mid-Semester Quiz: Scope and Resources

  • All content from first 4 lectures
  • All content from first 3 tutorials
  • Introductory statistics will be useful
  • Include lecture slides, tutorials videos, workbooks, and textbooks
  • Use non-PSYC2010 resources carefully

Key Concepts to Review

  • Normal distributions: Shape, center, and spread
  • z-scores and z-transformations: Definition, calculation, and interpretation
  • z-tables: How to find probabilities/percentiles
  • Using z-scores for significance testing: Single scores and single samples
  • Sampling error and sampling bias: Definition and impact
  • Inferential versus descriptive statistics: Difference and application
  • Sampling distribution of means: Definition and usefulness
  • Hypothesis testing: Types of hypotheses, formulation, and null hypothesis testing
  • Choosing between z and t-tests: Criteria and assumptions for selection

t-Test Types and How to Conduct Them

  • Single sample: Comparing a sample mean to a known population mean
  • Independent groups: Comparing means of two unrelated groups
  • Repeated measures: Comparing means from the same subjects under different conditions
  • Select the correct test
  • Review your formulas
  • Consider the degrees of freedom
  • Review the means when interpreting the results
  • Ensure they meet the basic assumptions regarding the normality of data and equality of variance

More Concepts to Review

  • Advantages and disadvantages of independent vs. repeated measures designs/tests
  • Power and its relationship to key variables (d, N, δ)
  • How power and effect sizes are used by researchers
  • Types of errors (Type I, Type II) and associated probabilities
  • Factors that influence power: explanation and reasons
  • Common rules from Cohen (effect sizes)
  • Method for calculating power for different t-tests
  • Consider the N needed for a given power result

Additional Notes

  • Effect Size (Cohen's d): It's the measure of the distance between two means
  • A small effect size: d≈ 0.2 results in an overlapping of groups
  • If variance given: Standard Deviation variance equals the square root result
  • Delta Combines effect size plus sample size, the detectable outcome from that sampling result
  • Independent samples, Where n per group and independent groups created
  • Under null hypothesis, with distribution curves overlapping the effect size from the h1 curve moves from HO
  • Power is under h, but only if this result rejects H0, also known as a false result

Other Reminders

  • It is best practice to down the delta table in order to be a more conservative
  • Conservative helps in avoiding overestimation
  • Ex delta equals two point to for look up the two point.

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team

Related Documents

More Like This

Test Construction
30 questions

Test Construction

StunnedBarbizonSchool avatar
StunnedBarbizonSchool
Item Analysis in Psychology
44 questions
Use Quizgecko on...
Browser
Browser