Podcast
Questions and Answers
What distinguishes psychological measurements from measurements in classical sciences like physics or chemistry?
What distinguishes psychological measurements from measurements in classical sciences like physics or chemistry?
- Psychological measurements use complex mathematical formulas, while classical sciences rely on direct observation.
- Classical sciences focus on external phenomena, while psychology studies internal, unmeasurable traits.
- Psychological measurements involve subjective interpretations, whereas classical sciences are entirely objective.
- Classical sciences have instruments with specified error margins, unlike psychology which measures constructs not directly observable. (correct)
In the context of psychological testing, what is the primary goal of employing mathematical or statistical theories?
In the context of psychological testing, what is the primary goal of employing mathematical or statistical theories?
- To eliminate subjectivity in the evaluation of psychological traits.
- To infer trait levels from observed performance, since psychological traits are not directly measurable. (correct)
- To provide a standardized method of administering psychological tests.
- To directly measure psychological traits without relying on observation.
According to the Classical Test Theory (TCT), what is the main objective when using a test?
According to the Classical Test Theory (TCT), what is the main objective when using a test?
- To predict future behavior.
- To determine the degree to which a test reflects the true levels of a trait. (correct)
- To measure the error in psychological measurement.
- To diagnose psychological disorders.
Within the framework of Classical Test Theory (TCT), what are the components that constitute an individual's observed score (X) on a test?
Within the framework of Classical Test Theory (TCT), what are the components that constitute an individual's observed score (X) on a test?
In the context of Classical Test Theory (TCT), which element is directly observable?
In the context of Classical Test Theory (TCT), which element is directly observable?
What is the definition of the 'true score' (V) of a person according to the Classical Test Theory (TCT)?
What is the definition of the 'true score' (V) of a person according to the Classical Test Theory (TCT)?
According to Classical Test Theory (TCT), what is the nature of the measurement error (E)?
According to Classical Test Theory (TCT), what is the nature of the measurement error (E)?
According to Classical Test Theory (TCT), what is the difference between the empiric score and the true score?
According to Classical Test Theory (TCT), what is the difference between the empiric score and the true score?
Which of the following is an example of a source of error related to transient factors, according to Classical Test Theory (TCT)?
Which of the following is an example of a source of error related to transient factors, according to Classical Test Theory (TCT)?
According to Classical Test Theory (TCT), what describes errors due to the specificity of the contents of the items?
According to Classical Test Theory (TCT), what describes errors due to the specificity of the contents of the items?
According to Classical Test Theory (TCT), what would errors due to sporadic or aleatory factors mean?
According to Classical Test Theory (TCT), what would errors due to sporadic or aleatory factors mean?
What are parallel forms in the context of Classical Test Theory (TCT)?
What are parallel forms in the context of Classical Test Theory (TCT)?
What is the purpose of using parallel test forms?
What is the purpose of using parallel test forms?
If parallel forms of a test are administered to the same individuals during the same session, what does the correlation between the observed scores on both forms estimate?
If parallel forms of a test are administered to the same individuals during the same session, what does the correlation between the observed scores on both forms estimate?
Which of the following is an assumption of Classical Test Theory (TCT)?
Which of the following is an assumption of Classical Test Theory (TCT)?
What does it mean if the correlation between true scores and errors is zero in Classical Test Theory (TCT)?
What does it mean if the correlation between true scores and errors is zero in Classical Test Theory (TCT)?
According to Classical Test Theory (TCT), what is the implication if the correlation between the errors of two parallel forms is non-zero?
According to Classical Test Theory (TCT), what is the implication if the correlation between the errors of two parallel forms is non-zero?
What is a limitation of Classical Test Theory (TCT) regarding comparisons between individuals?
What is a limitation of Classical Test Theory (TCT) regarding comparisons between individuals?
According to Classical Test Theory (TCT), what is a limitation in generalizing the results?
According to Classical Test Theory (TCT), what is a limitation in generalizing the results?
What does the limitation regarding inconsistency of measurement error in Classical Test Theory (TCT) mean?
What does the limitation regarding inconsistency of measurement error in Classical Test Theory (TCT) mean?
Why is the Classical Test Theory (TCT) considered infalsifiable?
Why is the Classical Test Theory (TCT) considered infalsifiable?
How does Item Response Theory (IRT) differ from Classical Test Theory (TCT) with regard to the sample?
How does Item Response Theory (IRT) differ from Classical Test Theory (TCT) with regard to the sample?
Within Item Response Theory (IRT), what does the item characteristic curve (ICC) represent?
Within Item Response Theory (IRT), what does the item characteristic curve (ICC) represent?
In Item Response Theory (IRT), what does the 'discrimination index' (a) of an item indicate?
In Item Response Theory (IRT), what does the 'discrimination index' (a) of an item indicate?
Within the context of Item Response Theory (IRT), what does the 'difficulty index' (b) of an item signify?
Within the context of Item Response Theory (IRT), what does the 'difficulty index' (b) of an item signify?
In Item Response Theory (IRT), what does the pseudo-guessing parameter (c) refer to?
In Item Response Theory (IRT), what does the pseudo-guessing parameter (c) refer to?
What parameter does the Rasch model use?
What parameter does the Rasch model use?
In Item Response Theory (IRT), which of the following is a fundamental assumption?
In Item Response Theory (IRT), which of the following is a fundamental assumption?
What does 'local independence' mean in Item Response Theory (IRT)?
What does 'local independence' mean in Item Response Theory (IRT)?
What is one of the benefits of computerized adaptive testing (CAT) based on Item Response Theory (IRT)?
What is one of the benefits of computerized adaptive testing (CAT) based on Item Response Theory (IRT)?
Flashcards
Psychological Trait Measurement
Psychological Trait Measurement
Theories that use mathematical or statistical methods to infer psychological traits from performance.
Classical Test Theory (CTT)
Classical Test Theory (CTT)
A theory proposing a formal model to determine how well a test reflects true trait levels.
Empirical Score (X)
Empirical Score (X)
The direct, observed value obtained from a test.
True Score (V)
True Score (V)
Signup and view all the flashcards
Measurement Error (E)
Measurement Error (E)
Signup and view all the flashcards
Components of an empirical score
Components of an empirical score
Signup and view all the flashcards
Transient Error
Transient Error
Signup and view all the flashcards
Specificity Error
Specificity Error
Signup and view all the flashcards
Random Errors
Random Errors
Signup and view all the flashcards
Parallel Forms
Parallel Forms
Signup and view all the flashcards
Expected Error Value
Expected Error Value
Signup and view all the flashcards
Error Correlation
Error Correlation
Signup and view all the flashcards
Parallel Forms Error Correlation
Parallel Forms Error Correlation
Signup and view all the flashcards
Comparing scores
Comparing scores
Signup and view all the flashcards
Generalizing results
Generalizing results
Signup and view all the flashcards
Measurement variation
Measurement variation
Signup and view all the flashcards
Indicators of adjustment
Indicators of adjustment
Signup and view all the flashcards
Item Response Theory (IRT)
Item Response Theory (IRT)
Signup and view all the flashcards
Latent Trait
Latent Trait
Signup and view all the flashcards
Item Characteristic Curve (ICC)
Item Characteristic Curve (ICC)
Signup and view all the flashcards
Discrimination Index (a)
Discrimination Index (a)
Signup and view all the flashcards
Difficulty Index (b)
Difficulty Index (b)
Signup and view all the flashcards
Pseudo-guessing (c)
Pseudo-guessing (c)
Signup and view all the flashcards
Rasch Model
Rasch Model
Signup and view all the flashcards
Two-Parameter Model
Two-Parameter Model
Signup and view all the flashcards
Three-Parameter Model
Three-Parameter Model
Signup and view all the flashcards
Unidimensionality
Unidimensionality
Signup and view all the flashcards
Local Independence
Local Independence
Signup and view all the flashcards
Test Improvement
Test Improvement
Signup and view all the flashcards
Cross population comparison
Cross population comparison
Signup and view all the flashcards
Study Notes
- The topic is about measurement theories in psychometrics.
Introduction
- In classical sciences like medicine, physics, and chemistry, instruments with specified error margins exist to measure characteristics like temperature, blood pressure, weight, and chemical concentrations.
- Psychology lacks such direct measurement instruments for traits like introversion, attitudes towards abortion, spatial aptitude, or reading ability.
- Mathematical or statistical theories are developed to measure psychological traits by inferring trait levels from observed performance.
- The question is whether a test score represents a good manifestation of the authentic trait of attention in that person.
- Psychologists need to be certain that the scores from tests produce good quantifications of the levels in their patients.
Classical Test Theory (CTT)
- Classical Test Theory (CTT) emerged at the beginning of the 20th century from the early work of psychologist Charles E. Spearman.
- CTT proposes a formal model to determine to what extent a test provides information about true trait levels.
CTT Scores and Types of Error
- A person's direct empirical score on a test (X) comprises two theoretical components which are the person's true trait level or score (V) and the error of measurement (E) that occurs when measuring the trait with the test: X = V + E
- X is the total score and empirically derived.
- V is the true score.
- E is the Measurement error.
- The total score (X) is the only element that can be observed.
- A true score (V) is constant for each person, or what they would score if no error existed.
- Measurement error (E) is a random variable that causes variability in scores.
Types of Error
- Errors occur due to transitory factors that are dependent on environmental factors, time, and motivation.
- Errors arise due to the specificity of the content of the items, where items should be representative and varied.
- Errors are from random factors that occur due to sporadic circumstances.
Parallel forms in the CTT
- Parallel forms refer to creating equivalent versions of a measurement test.
- These instruments measure the same construct with equivalent psychometric properties.
- Parallel forms have similar characteristics like the number of items, content, difficulty, instructions, and administration time.
- The versions are used interchangeably to measure the same construct.
- Repeated test administrations are permitted without risks of individuals recalling their responses from the previous administration.
- When administered to the same people simultaneously, the correlation between observed scores correlates in both forms, estimating reliability coefficient.
CTT Assumptions
- The expected value of errors is 0; the average is zero if applied infinitely: E(e) = 0.
- The correlation between true scores of people and their errors is zero: PVE = 0.
- The correlation between errors of both forms is zero: PE1E2 = 0.
- The correlation between the true scores of people and their errors in each form is zero: PVE1= PVE2 = 0.
CTT Limitations
- It is difficult to make comparisons because both subjects have distinct true scores because there is variability between subjects.
- It is difficult to generalize results because item parameters depend on the sample, such as difficulty and discrimination.
- There is inconsistency in the measurement error because the error is not constant between scores and can vary between skill levels.
- There is a lack of model fit indicators, which makes the CTT model unfalsifiable, so there are no mechanisms to verify if it functions.
Item Response Theory (IRT)
- Item Response Theory (IRT) emerged in the 1950s, in response to the limitations of CTT.
- IRT relies on the relationship between the latent trait (θ), representing an individual's ability taking the test, and the probability of correctly answering item P(θ).
- IRT offers a more flexible approach that doesn't rely on the sample and adapts to an evaluated person's different skill or trait levels.
Item Characteristic Curve
- Y-axis: probability of success, or P(0)
- X-axis: level of a trait (0)
Parameters of Items
- Discrimination Index (a) measures the item's capacity to distinguish between subjects with better or worse scores.
- Difficulty Index (b) signals the point at which one is more likely to get the item right around 0.5.
- Pseudo-Randomness (c) measures the probability of getting the item correctly by chance, even when they do not know the correct answer.
TRI: Main Models
- Models employ different complexity, parameters, and utility.
- The Rasch model uses parameter b for difficulty, is simple, and has easy interpretation.
- The 2-parameter (ML2P) model uses parameters b for difficulty and a for discrimination to better represent real data with discriminative capacity.
- The 3-parameter (ML3P) mode uses parameters b for difficulty, a for discrimination, and c for pseudo-guessing, to more closely model multiple-choice items by adjusting the probabilities for guessing.
IRT Assumptions
- Unidimensionality ensures that the items measure a single construct.
- Local independence occurs when the responses to an item do not depend on other items.
IRT Applications
- Developing tests allows for more precise measurement, which allows for improvements in the study of psychometric properties.
- The different models allow for different evaluations to assess more complex, multidimensional traits.
- It allows for comparison between populations that does not depend on estimating the sample parameters.
- It develops adaptive computer-based tests (CAT) where each question is different per evaluated subject and adapts to their trace level.
- Examines the differential functioning of Items (DIF), which determines whether an item functions better or worse based on a person's group.
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.