Podcast
Questions and Answers
In the context of experimental design, disentangling the effect of the independent variable (IV) from extraneous variables often necessitates maintaining specific factors constant across all conditions. Which of the following exemplifies such a 'control variable' when investigating the impact of ambient noise on cognitive task performance?
In the context of experimental design, disentangling the effect of the independent variable (IV) from extraneous variables often necessitates maintaining specific factors constant across all conditions. Which of the following exemplifies such a 'control variable' when investigating the impact of ambient noise on cognitive task performance?
- Randomly varying the complexity of cognitive tasks to mirror real-world scenarios.
- Systematically altering the experimenter's demeanor to assess its influence on participant motivation.
- Ensuring all participants complete the cognitive task in a room with consistent temperature and lighting. (correct)
- Allowing participants to self-select their preferred testing time to maximize comfort.
A researcher aims to operationalize 'academic resilience' in a longitudinal study. Considering the multifaceted nature of resilience, which of the following operational definitions would be MOST comprehensive and ecologically valid?
A researcher aims to operationalize 'academic resilience' in a longitudinal study. Considering the multifaceted nature of resilience, which of the following operational definitions would be MOST comprehensive and ecologically valid?
- The number of times a student visits the university counseling center for academic-related stress, tracked across the study period.
- The cumulative GPA of students at the end of each academic year, focusing solely on quantitative performance metrics.
- A composite measure incorporating GPA, frequency of help-seeking behaviors, qualitative analysis of reflective journals detailing coping strategies, and teacher ratings of persistence. (correct)
- A single self-report measure assessing students' perceived ability to bounce back from academic setbacks, administered annually.
Consider a hypothetical study examining the effects of a novel cognitive training program on working memory capacity. Post-intervention, participants in the training group exhibit significantly improved performance on complex span tasks compared to a control group. However, closer inspection reveals that the training group also demonstrated a pre-existing higher baseline performance on these tasks, despite random assignment. What specific threat to internal validity is MOST salient in this scenario?
Consider a hypothetical study examining the effects of a novel cognitive training program on working memory capacity. Post-intervention, participants in the training group exhibit significantly improved performance on complex span tasks compared to a control group. However, closer inspection reveals that the training group also demonstrated a pre-existing higher baseline performance on these tasks, despite random assignment. What specific threat to internal validity is MOST salient in this scenario?
- Selection effects, indicating a systematic difference between groups that existed prior to the intervention. (correct)
- Maturation threat, suggesting that the training group naturally improved over time irrespective of the intervention.
- Attrition threat, assuming that lower-performing participants in the training group disproportionately dropped out of the study.
- Instrumentation threat, due to potential calibration drift in the cognitive tasks.
In a study purportedly investigating the impact of mindfulness meditation on test anxiety, participants are informed about the hypothesized benefits of meditation before commencing the intervention. During post-intervention interviews, many participants in the meditation group report feeling less anxious and more focused during exams, attributing these changes directly to the meditation techniques. However, a physiological measure of anxiety (e.g., cortisol levels) reveals no significant differences between the meditation and control groups. Which of the following phenomena is MOST likely influencing the self-report data in this study?
In a study purportedly investigating the impact of mindfulness meditation on test anxiety, participants are informed about the hypothesized benefits of meditation before commencing the intervention. During post-intervention interviews, many participants in the meditation group report feeling less anxious and more focused during exams, attributing these changes directly to the meditation techniques. However, a physiological measure of anxiety (e.g., cortisol levels) reveals no significant differences between the meditation and control groups. Which of the following phenomena is MOST likely influencing the self-report data in this study?
A researcher develops a new self-report scale designed to measure 'intellectual humility.' To establish convergent validity, the researcher correlates scores on the new scale with scores on several established measures. Which of the following correlation patterns would provide the STRONGEST evidence of convergent validity?
A researcher develops a new self-report scale designed to measure 'intellectual humility.' To establish convergent validity, the researcher correlates scores on the new scale with scores on several established measures. Which of the following correlation patterns would provide the STRONGEST evidence of convergent validity?
When evaluating the test-retest reliability of a newly developed measure of 'grit' (defined as perseverance and passion for long-term goals), a researcher administers the measure to the same group of participants at two time points, separated by a six-month interval. The resulting correlation coefficient is found to be statistically significant but only moderate (r = 0.45). Which of the following interpretations BEST accounts for this finding, considering the nature of the 'grit' construct?
When evaluating the test-retest reliability of a newly developed measure of 'grit' (defined as perseverance and passion for long-term goals), a researcher administers the measure to the same group of participants at two time points, separated by a six-month interval. The resulting correlation coefficient is found to be statistically significant but only moderate (r = 0.45). Which of the following interpretations BEST accounts for this finding, considering the nature of the 'grit' construct?
A research team is designing a study to investigate the effect of a novel drug on reaction time. To minimize systematic variability, they decide to use a within-subjects design. However, they are concerned about potential order effects. Which of the following strategies would be the MOST effective in mitigating order effects in this context?
A research team is designing a study to investigate the effect of a novel drug on reaction time. To minimize systematic variability, they decide to use a within-subjects design. However, they are concerned about potential order effects. Which of the following strategies would be the MOST effective in mitigating order effects in this context?
In a study examining the impact of stereotype threat on women's performance in advanced mathematics, participants are randomly assigned to either a stereotype threat condition (where the stereotype about women's math abilities is made salient) or a control condition (where it is not). The results indicate that women in the stereotype threat condition perform significantly worse on a challenging math test. However, subsequent analysis reveals that the effect is only present for women who strongly identify with their gender. What type of effect is exemplified by gender identification in this scenario?
In a study examining the impact of stereotype threat on women's performance in advanced mathematics, participants are randomly assigned to either a stereotype threat condition (where the stereotype about women's math abilities is made salient) or a control condition (where it is not). The results indicate that women in the stereotype threat condition perform significantly worse on a challenging math test. However, subsequent analysis reveals that the effect is only present for women who strongly identify with their gender. What type of effect is exemplified by gender identification in this scenario?
A study utilizes observational measures to assess social interaction among preschool children. Two independent coders observe the same children and record the frequency of prosocial behaviors. To assess interrater reliability, the researchers calculate Cohen's kappa. Which of the following scenarios would yield the HIGHEST Cohen's kappa coefficient, indicating the strongest interrater reliability?
A study utilizes observational measures to assess social interaction among preschool children. Two independent coders observe the same children and record the frequency of prosocial behaviors. To assess interrater reliability, the researchers calculate Cohen's kappa. Which of the following scenarios would yield the HIGHEST Cohen's kappa coefficient, indicating the strongest interrater reliability?
A researcher is investigating the efficacy of a new therapeutic intervention for social anxiety disorder. Participants are randomly assigned to either the intervention group or a waitlist control group. To control for observer bias, the researchers implement a double-blind study design. What specific measures should the researchers take within this design to ensure a rigorous implementation of blinding?
A researcher is investigating the efficacy of a new therapeutic intervention for social anxiety disorder. Participants are randomly assigned to either the intervention group or a waitlist control group. To control for observer bias, the researchers implement a double-blind study design. What specific measures should the researchers take within this design to ensure a rigorous implementation of blinding?
A researcher seeks to adapt an existing, well-validated measure of depression for use with an adolescent population. The original measure primarily uses language and examples relevant to adults. What crucial step should the researcher undertake to ensure content validity in the adapted measure?
A researcher seeks to adapt an existing, well-validated measure of depression for use with an adolescent population. The original measure primarily uses language and examples relevant to adults. What crucial step should the researcher undertake to ensure content validity in the adapted measure?
In a within-subjects experiment examining the effects of different types of music on cognitive performance, participants complete a series of tasks while listening to classical music, rock music, and silence. The researcher observes that participants consistently perform best during the silence condition. However, upon closer examination, it's revealed that the silence condition always occurred last in the sequence. What specific type of order effect is MOST likely influencing the results?
In a within-subjects experiment examining the effects of different types of music on cognitive performance, participants complete a series of tasks while listening to classical music, rock music, and silence. The researcher observes that participants consistently perform best during the silence condition. However, upon closer examination, it's revealed that the silence condition always occurred last in the sequence. What specific type of order effect is MOST likely influencing the results?
A researcher is designing a study to investigate the effectiveness of a new intervention aimed at reducing test anxiety among college students. The researcher plans to use a pretest/posttest design with a control group. To enhance statistical power while controlling for individual differences, which of the following statistical techniques would be MOST appropriate?
A researcher is designing a study to investigate the effectiveness of a new intervention aimed at reducing test anxiety among college students. The researcher plans to use a pretest/posttest design with a control group. To enhance statistical power while controlling for individual differences, which of the following statistical techniques would be MOST appropriate?
A researcher seeks to measure 'emotional intelligence' using a performance-based task that requires participants to accurately identify emotions displayed in facial expressions. However, pilot testing reveals that nearly all participants achieve near-perfect scores on the task, regardless of their actual emotional intelligence levels. What is the MOST likely explanation for this phenomenon?
A researcher seeks to measure 'emotional intelligence' using a performance-based task that requires participants to accurately identify emotions displayed in facial expressions. However, pilot testing reveals that nearly all participants achieve near-perfect scores on the task, regardless of their actual emotional intelligence levels. What is the MOST likely explanation for this phenomenon?
A researcher is conducting a longitudinal study on the development of moral reasoning in adolescents. Participants complete a series of moral dilemma tasks at ages 13, 15, and 17. The researcher observes that participants' scores on the moral reasoning tasks tend to become less extreme (i.e., closer to the average) over time, regardless of any specific interventions or experiences. What statistical artifact is MOST likely contributing to this pattern?
A researcher is conducting a longitudinal study on the development of moral reasoning in adolescents. Participants complete a series of moral dilemma tasks at ages 13, 15, and 17. The researcher observes that participants' scores on the moral reasoning tasks tend to become less extreme (i.e., closer to the average) over time, regardless of any specific interventions or experiences. What statistical artifact is MOST likely contributing to this pattern?
A researcher designs a study to investigate the impact of a mindfulness intervention on stress levels among healthcare workers. Participants are randomly assigned to either a mindfulness intervention group or a control group. Stress levels are measured using a self-report questionnaire administered before and after the intervention. However, during the study period, a major organizational change occurs within the healthcare system, affecting all workers regardless of group assignment. Which of the following threats to internal validity is MOST salient in this context??
A researcher designs a study to investigate the impact of a mindfulness intervention on stress levels among healthcare workers. Participants are randomly assigned to either a mindfulness intervention group or a control group. Stress levels are measured using a self-report questionnaire administered before and after the intervention. However, during the study period, a major organizational change occurs within the healthcare system, affecting all workers regardless of group assignment. Which of the following threats to internal validity is MOST salient in this context??
When designing a between-groups experiment, a researcher is confronted with the challenge of potential individual difference confounds, particularly given a relatively small sample size. Which of the following design strategies would be MOST effective in mitigating individual differences??
When designing a between-groups experiment, a researcher is confronted with the challenge of potential individual difference confounds, particularly given a relatively small sample size. Which of the following design strategies would be MOST effective in mitigating individual differences??
A researcher is interested in examining the relationship between conscientiousness and academic achievement. The researcher collects data on these variables from a sample of undergraduate students. However, the researcher suspects that the relationship may be influenced by students’ perceived level of social support. In this scenario, what statistical technique would be MOST appropriate?
A researcher is interested in examining the relationship between conscientiousness and academic achievement. The researcher collects data on these variables from a sample of undergraduate students. However, the researcher suspects that the relationship may be influenced by students’ perceived level of social support. In this scenario, what statistical technique would be MOST appropriate?
A researcher aims to assess the impact of a new cognitive training program on working memory capacity among older adults. However, the program requires participants to attend multiple sessions over several weeks, and the researcher is concerned about potential attrition bias. What are the BEST strategies for minimizing attrition bias?
A researcher aims to assess the impact of a new cognitive training program on working memory capacity among older adults. However, the program requires participants to attend multiple sessions over several weeks, and the researcher is concerned about potential attrition bias. What are the BEST strategies for minimizing attrition bias?
A researcher designs a study to investigate the effect of sleep deprivation on cognitive performance. Participants are randomly assigned to either a sleep-deprived group (24 hours without sleep) or a control group (8 hours of sleep). However, the researcher suspects that individual differences in caffeine consumption habits may confound the results. What steps should be taken to address caffeine use?
A researcher designs a study to investigate the effect of sleep deprivation on cognitive performance. Participants are randomly assigned to either a sleep-deprived group (24 hours without sleep) or a control group (8 hours of sleep). However, the researcher suspects that individual differences in caffeine consumption habits may confound the results. What steps should be taken to address caffeine use?
A researcher is conducting a study on the effect of social media use on self-esteem. Participants are asked to report their daily social media usage and complete a self-esteem scale. However, the researcher suspects that participants may be underreporting their social media usage due to social desirability bias. What are the BEST strategies for mitigating this bias?
A researcher is conducting a study on the effect of social media use on self-esteem. Participants are asked to report their daily social media usage and complete a self-esteem scale. However, the researcher suspects that participants may be underreporting their social media usage due to social desirability bias. What are the BEST strategies for mitigating this bias?
A researcher is adapting a well-established measure of anxiety for use with a culturally diverse population. The researcher wants to ensure that the adapted measure is culturally sensitive and maintains its validity across different cultural groups. What steps should be taken to establish cultural validity?
A researcher is adapting a well-established measure of anxiety for use with a culturally diverse population. The researcher wants to ensure that the adapted measure is culturally sensitive and maintains its validity across different cultural groups. What steps should be taken to establish cultural validity?
In single and double blind studies, researchers are UNAWARE of which condition participants are put into. What can this help to reduce?
In single and double blind studies, researchers are UNAWARE of which condition participants are put into. What can this help to reduce?
Two observers count how many times a child shows aggression. If interrater reliability is high, then which statement would be true?
Two observers count how many times a child shows aggression. If interrater reliability is high, then which statement would be true?
A new anxiety scale should correlate HIGHLY with what kind of questionnaire?
A new anxiety scale should correlate HIGHLY with what kind of questionnaire?
What coefficient indicates nearly no relationship?
What coefficient indicates nearly no relationship?
In a depression questionnaire with 10 questions, what should happen with the scores if the scale is internally reliable?
In a depression questionnaire with 10 questions, what should happen with the scores if the scale is internally reliable?
How to operationalize a conceptual variable of 'hunger'?
How to operationalize a conceptual variable of 'hunger'?
What is a concrete construct?
What is a concrete construct?
What are measures based on direct observation of behavior?
What are measures based on direct observation of behavior?
What emphasizes the consistency of a measure?
What emphasizes the consistency of a measure?
What kind of scale is finishing places in a race?
What kind of scale is finishing places in a race?
Which action ensures high internal validity?
Which action ensures high internal validity?
The time of day differing between conditions is considered
The time of day differing between conditions is considered
What kind of study has one group studies with classical music, and another with no music, and then compares test performance?
What kind of study has one group studies with classical music, and another with no music, and then compares test performance?
Participants are randomly assigned to groups and then tested once on the dependent variable.
Participants are randomly assigned to groups and then tested once on the dependent variable.
In measurement contexts, what does ‘internal validity’ sometimes mean?
In measurement contexts, what does ‘internal validity’ sometimes mean?
What indicates how narrow an estimate is around an effect?
What indicates how narrow an estimate is around an effect?
Flashcards
Abstract Construct
Abstract Construct
A mental or theoretical concept (e.g., love, hunger, intelligence).
Concrete Construct
Concrete Construct
A construct that is directly observable or measurable (e.g., height).
Conceptual Definition
Conceptual Definition
Specifies precisely what the researcher means when referring to that variable.
Operationalization
Operationalization
How a conceptual variable is measured or manipulated in a study.
Signup and view all the flashcards
Self-Report Measures
Self-Report Measures
Measurements based on participants' verbal or written responses such as surveys or interviews.
Signup and view all the flashcards
Observational Measures
Observational Measures
Measurements based on direct observation of behavior.
Signup and view all the flashcards
Physiological Measures
Physiological Measures
Measures that record biological data.
Signup and view all the flashcards
Reliability
Reliability
The consistency or repeatability of a measure.
Signup and view all the flashcards
Test-Retest Reliability
Test-Retest Reliability
The consistency of a measure across two or more testing occasions.
Signup and view all the flashcards
Interrater Reliability
Interrater Reliability
The degree to which two or more observers agree on their observations.
Signup and view all the flashcards
Internal Reliability
Internal Reliability
The extent to which multiple items on the same measure consistently measure the same construct.
Signup and view all the flashcards
Nominal Scale
Nominal Scale
Categorical, with no numerical meaning.
Signup and view all the flashcards
Ordinal Scale
Ordinal Scale
Ranked order, but intervals are not necessarily equal.
Signup and view all the flashcards
Interval Scale
Interval Scale
Numeric scales with equal intervals, but no true zero.
Signup and view all the flashcards
Ratio Scale
Ratio Scale
Numeric scales with equal intervals and a true zero.
Signup and view all the flashcards
Scatterplots
Scatterplots
Graphs displaying the relationship between two variables to visualize correlation.
Signup and view all the flashcards
Correlation Coefficients
Correlation Coefficients
A statistical measure indicating the direction and strength of the relationship between two variables.
Signup and view all the flashcards
Validity of a Measure
Validity of a Measure
The extent to which a measure assesses what it is intended to measure.
Signup and view all the flashcards
Face Validity
Face Validity
Whether a measure appears, at face value, to measure what it claims.
Signup and view all the flashcards
Content Validity
Content Validity
The extent to which a test includes all parts of the construct it aims to assess.
Signup and view all the flashcards
Criterion Validity
Criterion Validity
The extent to which a measure predicts or relates to an outcome it should theoretically predict.
Signup and view all the flashcards
Convergent Validity
Convergent Validity
The degree to which a measure correlates strongly with other measures of the same construct.
Signup and view all the flashcards
Discriminant Validity
Discriminant Validity
The extent to which a measure does not correlate strongly with measures of different constructs.
Signup and view all the flashcards
Internal Validity
Internal Validity
The measure is free from confounding factors so that you can be confident the measure alone is capturing your construct.
Signup and view all the flashcards
Between-Groups Designs
Between-Groups Designs
Different groups of participants are placed into different levels of the independent variable.
Signup and view all the flashcards
Posttest-Only Design
Posttest-Only Design
Participants are randomly assigned to groups, then tested once on the dependent variable.
Signup and view all the flashcards
Pretest/Posttest Designs
Pretest/Posttest Designs
Measured on the DV before exposure to the IV (pretest) and then again after exposure (posttest).
Signup and view all the flashcards
Control Variable
Control Variable
Any variable the experimenter holds constant across conditions.
Signup and view all the flashcards
Dependent Variable
Dependent Variable
The variable that is measured to see whether it is affected by the independent variable.
Signup and view all the flashcards
Independent Variable
Independent Variable
The variable that the experimenter manipulates; has distinct levels/conditions.
Signup and view all the flashcards
Design Confounds
Design Confounds
When a second variable systematically varies with the IV.
Signup and view all the flashcards
Effect Size
Effect Size
A quantitative measure of the strength or magnitude of the relationship between variables.
Signup and view all the flashcards
Internal Validity (Experiments)
Internal Validity (Experiments)
A study rules out alternative explanations for a causal relationship between the IV and DV.
Signup and view all the flashcards
Matched-Groups Design
Matched-Groups Design
Participants are matched on a particular characteristic (e.g., IQ, age) and then randomly assigned.
Signup and view all the flashcards
Order Effects
Order Effects
Being exposed to one condition changes how participants respond to subsequent conditions.
Signup and view all the flashcards
Selection Effects
Selection Effects
Occur when participants in one IV level systematically differ from those in another IV level.
Signup and view all the flashcards
Systematic Variability
Systematic Variability
A variable's levels coincide in a predictable (non-random) manner with the IV.
Signup and view all the flashcards
Unsystematic Variability
Unsystematic Variability
Random fluctuations across conditions that contribute noise to data.
Signup and view all the flashcards
Within-Groups Designs
Within-Groups Designs
The same group of participants experiences all levels/conditions of the independent variable.
Signup and view all the flashcards
Concurrent Measures
Concurrent Measures
Participants are exposed to all IV levels at the same time, then a single attitudinal or behavioral preference is the DV.
Signup and view all the flashcardsStudy Notes
- Identifying Good Measurement
Abstract vs. Concrete Constructs
- An abstract construct refers to a mental or theoretical concept (e.g., love, hunger, intelligence).
- A concrete construct can be directly observed or measured (e.g., number of items recalled, reaction time, height).
- "Love" is an example of an abstract concept.
- "Blood pressure" can be a concrete physiological index of stress.
Conceptual Definitions
- The conceptual definition of a variable, also known as the construct definition, clarifies the researcher's meaning when referring to that variable.
- A conceptual definition of "hunger" could be "the subjective feeling of needing food."
Operationalization
- Operationalization, also called an operational definition, is how a conceptual variable gets measured or manipulated in a study.
- "Hunger" might be operationalized as "hours since last meal" or "total calories consumed in a day.”
Self-Report Measures
- These measures gather data through participants' verbal or written responses, like surveys, questionnaires, or interviews.
- Parent or teacher reports can also be included when studying children.
- A questionnaire asking, "On a scale of 1–5, how hungry are you right now?" is an example.
Observational Measures (Behavioral)
- These measures are based on direct observation of behavior or physical traces.
- Physical traces, like counting wrappers in a trash can, can indicate snacking behavior.
- Counting the number of times someone opens a refrigerator gives an indication of hunger through observable behavior.
Physiological Measures
- These measures record biological data, such as fMRI, EEG, hormone levels, and heart rate.
- Measuring salivary cortisol indicates stress levels, while saliva production can operationalize hunger.
Reliability
- Reliability refers to the consistency or repeatability of a measure.
- It encompasses test-retest, interrater, and internal reliability.
Test-Retest Reliability
- This is the consistency of a measure across two or more testing occasions for constructs expected to remain stable (e.g., intelligence).
- Measuring someone's intelligence in January and June should yield strongly correlated scores if the measure is reliable for "IQ".
Interrater Reliability
- Interrater reliability indicates the degree to which two or more observers or coders agree on their observations of the same behavior.
- If two observers count instances of aggression on a playground, their tallies should be very similar if interrater reliability is high.
Internal Reliability
- Internal reliability is the extent to which multiple items on the same measure or scale consistently measure the same construct.
- Cronbach's alpha is often used to assess it.
- Scores should be correlated if a depression questionnaire with 10 questions targeting depression symptoms is internally reliable.
Scales of Measurement (Levels of Measurement)
-
Nominal scales are categorical with no numerical meaning (e.g., types of soda like Coke, Sprite, Pepsi).
-
Assigning "1" to Coke, "2" to Sprite, and so on, is for labeling only.
-
Ordinal scales indicate ranked order, though intervals are not necessarily equal (e.g., finishing places in a race: 1st, 2nd, 3rd).
-
The time gap between 1st and 2nd place may differ from the gap between 2nd and 3rd.
-
Interval scales include numeric scales that have equal intervals between values, but no true zero (zero doesn't mean "none" of the construct).
-
For example, 0°C doesn't mean “no temperature.”
-
Ratio scales feature numeric scales with equal intervals and a true zero (i.e., zero means "nothing" of that variable).
-
Examples include height, weight, and reaction time (0 ms signifies no reaction time).
Scatterplots
- Scatterplots are graphs that display the relationship between two variables, often to visualize correlation.
Correlation Coefficients
- A statistical measure, like Pearson's r, that indicates the direction and strength of a relationship between two variables, ranging from -1.0 to +1.0.
- An r of +0.80 indicates a strong positive correlation, while r = -0.05 indicates nearly no relationship.
Validity (of a Measure)
- This refers to the extent to which a measure assesses what it is intended to measure.
Face Validity
- Face validity assesses whether a measure appears, on the surface, to measure what it claims to measure.
- A survey titled "Depression Questionnaire" which asks about mood and energy has face validity since it looks like it's measuring depression.
Content Validity
- This refers to the extent to which a test or measure includes all parts of the construct it aims to assess.
- A math test covering algebra, geometry, and calculus (not just algebra) captures the entire domain.
Criterion Validity
- Criterion validity is the extent to which a measure predicts or relates to an outcome it should theoretically predict.
- SAT scores serving as a measure to predict college performance is an example.
Convergent Validity
- Convergent validity refers to the degree to which a measure correlates strongly with other measures of the same construct.
- A new anxiety scale correlating highly with an established anxiety questionnaire demonstrates convergent validity.
Discriminant (Divergent) Validity
- This refers to the extent to which a measure does not correlate strongly with measures of different constructs.
- A depression scale shouldn't correlate strongly with a measure of physical fitness.
Internal Validity (as a measurement term)
- In measurement contexts, "internal validity" can mean the measure is free from confounding factors.
- It assures that the measure alone captures the construct.
Between-Groups Designs (Independent-Groups or Between-Subjects Designs)
- Different groups of participants are placed into different levels of the independent variable.
- Group 1 studies with classical music, and Group 2 studies with no music to compare test performance.
Equivalent Groups, Posttest-Only Design
- Participants are randomly assigned to groups to ensure equivalence, then tested once (posttest) on the dependent variable.
- Randomly assigning people to watch either a funny or a serious video, then measuring their mood afterward, exemplifies a posttest-only design.
Equivalent Groups, Pretest/Posttest Designs
- Participants are randomly assigned to at least two groups, measured on the DV before exposure to the IV (pretest) and then again after exposure (posttest).
- When two groups both take a mood pretest, later Group 1 gets the "funny" video and Group 2 gets the "serious" video, followed by a mood posttest.
Control Variable (and Control Variables)
- Any variable the experimenter holds constant across conditions to isolate the independent variable's effect on the dependent variable.
- Maintaining consistent room temperature, time of day, and lighting for all participants.
Dependent Variable
- This is the variable measured to determine whether it is affected by the independent variable.
- Test scores, reaction time, and mood ratings are examples of the dependent variable.
Independent Variable
- The variable that the experimenter manipulates; it has distinct levels/conditions.
- Classical music versus no music in a memory study is an example.
Design Confounds
- A second variable systematically varies with the IV, providing an alternative explanation for the results.
Effect Size
- Effect size is a quantitative measure of the strength or magnitude of the relationship (e.g., Cohen's d, Pearson's r) between variables.
- A large Cohen's d (e.g., 0.80) indicates a big difference between groups.
Internal Validity (in Experiments)
-
This is the degree to which a study rules out alternative explanations for a causal relationship between the IV and DV.
-
Also it establishes that the IV alone caused changes in the DV.
-
A well-controlled study with no confounds and random assignment ensures high internal validity.
Matched-Groups Design
- Participants are matched on a particular characteristic (e.g., IQ, age) and then randomly assigned to different conditions.
- This design is often used to reduce individual difference confounds in small samples.
- Matching on reading level and then assigning matched pairs to different teaching-method conditions.
Order Effects (Carryover & Practice Effects)
- A within-subjects phenomenon: being exposed to one condition changes how participants respond to subsequent conditions.
Types of Order Effects:
- Carryover Effects: Residual influence from a previous condition (e.g., drug still in the system).
- Practice (or Fatigue) Effects: Participants get better (practice) or worse (fatigue) over repeated tasks.
Pretest/Posttest Design
- This is a type of between-groups design where participants are measured on the DV both before and after exposure to the IV.
Selection Effects
- Selection effects occur when participants in one IV level systematically differ from those in another IV level.
- This often arises when participants self-select or when assignment isn't properly random.
- Volunteers for the "stressful condition" may already be more thrill-seeking compared to those placed in a neutral condition.
Systematic Variability
- This is when, in experiments, a variable's levels coincide in a predictable (non-random) manner with the IV, potentially creating confounds if uncontrolled.
- When all enthusiastic research assistants run the "treatment" group while all bored assistants run the "control" group.
Unsystematic Variability
- Random or haphazard fluctuations across conditions contribute noise to data but don't provide a systematic alternative explanation.
- Individual differences in mood fluctuate daily, adding random variation unrelated to the IV group.
Within-Groups Designs (Within-Subjects Designs)
- The same group of participants experiences all levels/conditions of the independent variable.
Concurrent Measures
- Participants are exposed to all IV levels (or multiple stimuli) simultaneously, then a single attitudinal or behavioral preference is the DV.
- Infants view two faces simultaneously (male vs. female), and the time spent looking at each face is measured.
Repeated Measures
- Participants are measured on the DV multiple times, after each distinct level of the IV.
- The same group rates the taste of Cookie A, then Cookie B, then Cookie C.
Attrition Threats
- Attrition threatens occur when certain participants (often extreme scorers) drop out of a study systematically.
- This threatens internal validity if one condition loses more participants than another.
- Participants with severe anxiety dropping out of the "candy therapy" condition while calmer individuals stay.
Blind Studies (and Double-Blind Studies)
- In a single-blind (masked) study: Participants or researchers are unaware of which condition participants are in.
- In a double-blind study: Neither the participants nor the researchers evaluating them know who's in which condition.
- In a drug trial: neither participants nor experimenters know who receives the real drug versus a placebo.
Ceiling and Floor Effects
- Ceiling effect: occurs when all scores cluster at the top of the scale because a task is too easy or the measurement is capped.
- Floor effect: occurs when all scores cluster at the bottom because a task is too hard or the measurement starts above zero.
- A math test that is far too easy could produce near-100% scores for everyone (ceiling effect).
Combined Threats
- Combined threats occur when two or more internal validity threats overlap.
- For example, selection-history, where a historical event only affects participants in one condition.
- When only the experimental group experiences a campus wellness event, leading them to change their behavior differently from the control group.
Demand Characteristics
- These are cues in the research setting that allow participants to guess the hypothesis or expectations, potentially altering their behavior to "help" the study.
- A participant notices the researcher's excitement about one condition and starts trying to please them in that condition.
History Threats
- External events happening during the study affect all (or most) participants in the treatment group.
- A campus-wide stress-reduction campaign might reduce everyone's stress, not just the group receiving "therapy.”
Instrumentation Threats
- These occur if the measurement instrument (e.g., coding guidelines, calibration) changes over time.
- The changes make pretest/posttest scores not directly comparable.
- A researcher becoming more lenient in scoring anxiety over the course of the study.
Maturation Threats
- A natural change in participants (like spontaneous improvement) occurs over time, not due to the IV.
- Students becoming less nervous by the end of the semester simply because they've adjusted to school demands.
Measurement Error
- Measurement error consists of factors that inflate or deflate a person's true score on the DV.
- Using a poorly calibrated scale to weigh participants introduces random error in weight measurements.
Observer Bias
- Observer bias occurs when researcher expectations influence how they interpret outcomes or record behaviors.
- A researcher unconsciously rates participants they "expect" to improve as more improved.
Placebo Effects
- Improvement or change occurs simply because participants believe they are receiving a valid treatment, not from the treatment's "active" ingredients.
- Taking a sugar pill for headache relief and feeling better purely due to the belief that the pill is real medicine.
Precision & Power
- Power is the likelihood of finding a statistically significant effect when one truly exists.
- Precision refers to how narrow the estimate (e.g., a confidence interval) is around an effect.
- Larger sample sizes and well-controlled methods increase power by reducing random noise.
Regression Threats (Regression to the Mean)
- This is a tendency for extreme scores at one measurement to be less extreme (closer to average) at the next measurement.
- Students with unusually high nervousness scores on the first day might naturally score closer to average by the second day, regardless of any treatment.
Reverse Confounds
- When an unaccounted-for variable pushes the effect of the IV in the opposite direction, potentially masking a real difference.
- If the "no coffee" group accidentally all had intense sugar rushes, it might cancel out the caffeine advantage in the "coffee" group.
Sample Sizes
- The number of participants in each condition.
- Larger sample sizes reduce random error and increase the precision of estimates, thereby boosting power.
- A study with 200 participants per group typically has higher power than one with only 20 participants per group.
Situation Noise
- Any external distractions or variability in the testing environment that increase unsystematic variability within each group.
- Running one group of participants in a noisy hallway and the other group in a quiet lab could add extra "noise" to the data.
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.