1st Diagnostic Examination BPSY 198 (Psychological Assessment) PDF
Document Details
Uploaded by SuperFriendship
Tags
Summary
This is a past paper for a psychology course, including questions and answers on topics such as psychological assessment, internal validity, and reliability. It appears to cover various aspects of psychological testing and measurement.
Full Transcript
DIAGNOSTIC EXAMINATION: BPSY 198 - Do D (resolve conflict while adhering to ethics) first as much as (PSYCHOLOGICAL ASSESSMENT) possible...
DIAGNOSTIC EXAMINATION: BPSY 198 - Do D (resolve conflict while adhering to ethics) first as much as (PSYCHOLOGICAL ASSESSMENT) possible - If you’ve tried everything and hindi talaga pwede, ADHERE TO THE LAW Name: ___________________________________ 4. The test taker neglected to drink water before the test. As he Year & Section:____________________________ proceeds with the examination, he feels so thirsty. What type of Date:_____________________________________ challenge to internal validity does this situation pose? Score:____________________________________ a. Testing b. Instrumentation Instructions: Read the following questions carefully and encircle the c. History correct answer. Strictly no erasures allowed, erasures are d. Selection considered wrong automatically. Threats to internal validity 1. A study reveals as screen time on social media among - History - occurrence of events (before they took test) adolescents increases, mental health tends to decrease. This kind of - Maturation - internal/physical changes (longitudinal studies) results show - Testing - effects of pretest to the post test (practice effect) a. Negative relationship - Instrumentation - inconsistent use of measurement b. Positive relationship instrument (mistake in test material or administration) c. Significant difference - Statistical regression - tendency of extreme scores to regress toward mean score d. Correlation relationship - Selection - no random assignment - Quasi experiment: no random assignment or no control 2. What is the best statistical tool to use in establishing the split-half group reliability of a test with a limited number of items? - Subject mortality - loss of subjects a. Spearman brown formula - Selection interaction - family of threats (multiple threats) b. Cronbach's Coefficient Alpha 5. The House-Tree-Person test is developed by: c. Kuder-Richardson 20. a. Lewis Terman - Revised Binet-Simon scale to d. Pearson r Standord-Binet Reliability - Consistency, accuracy, dependability of test results b. John Buck Test-retest reliability c. R.B. Cattell - Developed CFIT, 16 PF, conceptualized - Administering a test at 2 different times fluid and crystallized intelligence, nagpasikat ng factor analysis - time sampling (consistency of test over time) d. Florence Goodenough - DAP - Pearson r James Cattell - Coined term “mental test” and launched beginning Parallel Forms reliability of mental testing - Compares 2 equivalent forms of a test that measure the same 6. Which of the following elements must be present before an attributes experiment can be called as a true experiment? - Item sampling (Diff items, but same difficulty/# of items/content) a. Random assignment - Alternate forms/equivalent forms b. Control group Internal consistency (measures only one construct) c. Random Samplig - Split–half reliability d. All of the above - Divide into halves then score separately (odd-even or True experiment is random assignment + control group, If either one random) is missing, it’s quasi experiment - Spearman-Brown formula (the more items, the higher 7. When the distribution of scores includes outliers, it is better not to reliability); good if you have limited # of items use - Kuder-Richardson 20 a. Mean - Interval/ratio (not skewed) - Used for dichotomous items; only one correct answer - Test has varying degrees of item difficulty b. Median odd = 1 median ; even= 2 median (divide into - Kuder-Richardson 21 two) - Used for dichotomous items; only one correct answer - ordinal - Test has same level of item difficulty (usually for speed - Interval/ratio (skewed) test) c. Mode - nominal - Cronbach’s Coefficient Alpha d. All of the above - Used for polychotomous (several possible answers); no one correct answer - Never get the mean if there are outliers - Used for likert scales - Get Median if there are outliers; least affected by extreme scores Dichotomous items: Only 1 correct answer; Polychotomous: No one 8. The Children Personality Questionnaire-R is an example of what correct answer; does NOT refer to number of choices kind of test? a. Unstructured test Interrater Reliability b. Projective test - Consistency of judges/raters evaluating the same behavior c. Structured test - Observer differences d. Intelligence test. - Kappa statistics Ability Test (maximal) 3. What is the best course of action when our ethical principles come - Achievement test - previous learning (past) into conflict with law? - Aptitude test - potential for learning or acquiring new skill a. Maintain our position regardless (future) of legal requirements, as our ethical - Intelligence test - general potential (present) guidelines are designed to protect Personality test (typical) - overt and covert dispositions the safety and well-being of our - Structured - usually self-report; evaluate yourself clients and patients. - Projective - either stimulus or response is ambiguous; unstructured b. Comply with the law while 9. Niks and Cari are scheduled to get their height and weight simultaneously making efforts to measured at the clinic. They mentioned it was unnecessary since reduce any inconvenience it may they had just taken their measurements the day before. To their cause to those affected. surprise, Niks' weight increased from 45kg to 50kg, and Cari's c. Follow the law, as it represents the supreme authority in weight went from 55kg to 57kg, while both maintained their height. the country. This could imply that: d. Address the conflict while remaining dedicated to a. The weighing scale has a systematic error upholding the code of ethics. b. The weighing scale has a random error measure c. Niks ate more than Cari did the night before - Physical appearance of test d. None of the above 14. For individuals who are legally unable to give consent, we must: Systematic error - error is fixed; still possible to get true score (eg. a. Still provide an appropriate explanation to the consistently adding 5kg to all scores) client Random error - error is not consistent; now difficult to get true scor b. Seek informed assent from them 10. The split-half reliability is used to determine c. Obtain proper authorization from their legal a. If the test can disregard bias despite the representative number of factors being measured d. All of the above b. If all the items in the test measures the - Informed assent: For minors (17 and below) same dimension - Informed consent: For adults (18 and above) c. Whether consistent scores would be obtained regardless of the characteristics of the test taker - If incapable of providing consent, get both informed consent and assent d. All of the above 15. When releasing test data, we divulge which of the following? See number 1 a. Release raw and scaled scores - Split half - internal consistency 11. The 16-PF is a personality test that could also determine b. Release client's responses to the test questions unusual responses. Specifically, which of the following? c.Observation notes a. Impression Management d. None of the above b. Infrequency - Don’t release raw and scaled scores because the client does not know the interpretation c. Acquiescence - Don’t release responses because that’s confidential d. All of the above. - Don’t release observation notes because you’re not obligated 16 PF can detect 3 kinds of unusual responses to give them (baka messy or judgmental pa lol) - Impression management: social desirability 16. In creating a research, which one of the following types of - IM score is 95% and above; person is faking good statements is always false? - IM score is 5% and below; person is faking bad a. Analytical statement - always true - Infrequency: person is playing safe - IN score is 95% and above; person is playing safe b.Falsifiable statement - can disapprove by research - Acquiescence: tendency to agree to most questions c.Contradictory statement – always false - AC score is 95% and above; person is agreeing to d.Hypothetical statement everything - We don’t want analytical and contradictory statements 12. This is the test used to determine the current developmental - We WANT falsifiable statements level of infants. 17. One of the primary objectives in developing this scale was to a. Apgar test – given to newborn babies to check if they provide an intelligence test suitable for adults, as previously have abnormalities; given twice (1 min after born then 5 mins after available tests were all designed for school children. born); give third time in 10 mins if results still aren’t good a. Weschler-Bellevue Intelligence Scale b. Kaufman Assessment Battery b. Raven's Progressive Matrices - Intelligence test for young children c. Stanford-Binet Intelligence Scale c. Woodcock-Johnson Ill - Intelligence test; For detecting d. Culture Fair Intelligence Test learning disabilities - Stanford Binet was first developed for school children (gifted or d. Bayley Scale not) and was verbal 13. In order to employed is determine the concurrent validity of a - Weschler said we need a test for adults and with nonverbal test, the statistical tool to be component - In future revisions, SB added nonverbal a. Spearman-brown formula b. Kuder-Richardson 20 Weschler preschool and primary scale of intelligence - 2 ½ – c. Point-biserial correlation 7yrs d. Pearson r Weschler intelligence scale for children - 6 - 16 yrs Validity - meaning and usefulness of results; if test is appropriate Weschler adult intelligence scale - 16-90 Criterion Validity 18. States that measurement error is always random, and - How well it corresponds to a particular criterion advocates standardization of tests. - Types: a. Domain Sampling Model - Criterion test - well-established test, sure na the test is b. Classical Test Score Theory valid c. Item Discriminability Analysis - Criterion data - any data you can use that’s related to your test/as a basis (eg. performance appraisal, d. Item Response Theory diagnosis, records) In reliability, we have two theories: CTT and IRT - Predictive validity - Forecasting function Classical test score theory - Concurrent validity - simultaneously relationship between - Assumes that each person has a true score that would be test and criterion, no significant time has passed obtained if there were no errors in measurement Content validity - Total score = true score + error X = T + E - Adequacy of representation of conceptual domain the test is - Advocates standardization - Minimize error by standardizing designed to cover test - Experts judge validity of test items, use critical/logical thinking - Domain sampling model - considers the problem created by skills using a limited number of items (the more items, the higher the Construct validity reliability) - Degree to which a test measures what it purports to measure 19. A company is conducting a study to evaluate the effectiveness of - Used if test measures abstract variables (exists but hard to a new training program for improving employee productivity. The measure) new group receives the new training program, while the old group - Based on theoretical perspective continues with the standard procedures. After learning about the - All encompassing; if you establish construct validity, you new program, the old group starts working harder than usual to establish other validities - Convergent Validity - measures well with other related demonstrate they can perform just as well without the new training. constructs; theory said constructs are related a. Reactivity - Divergent/discriminant validity - low correlations with b. Pygmalion effect. unrelated constructs; theory said constructs are unrelated c. Rosenthal effect Face validity d. John Henry effect - Test subjectively viewed that it measures what it purports to Reactivity - altering behavior due to awareness of being observed harm - Hawthorne Effect - Know they’re being studied/observed d. All of the above - John Henry Effect - Control group in competition with A - true; disclose with consent experimental group B - true; need to give result to source of referral 20. How do you establish Alternate Forms reliability? C - true; duty to protect a. You administer the form A of your personality 26. Jhunar asked his professor for help regarding a sensitive case inventory to your sample. After they are finished, of a research subject he is currently handling in his study. He gave you administer the form his professor the necessary information about the case, but not the b. You administer the form A to your sample, wait a name of the subject. Jhunar is protecting his subject's couple of weeks, and then administer the form B to the a. Anonymity - protect identity same sample. b. Confidentiality - protect information (test scores) c. All of the above. c. Obscurity d. None of the above d. Privacy 21. Subjects serve more than one condition of the independent 27. Rics was given an intelligence test on Monday and she obtained variable. a score of 98. She took the same test on Wednesday, and she a. Between-subjects design obtained a score of 120. Based on this, the intelligence test is b. Within-subjects design therefore c. Mixed design a. Reliable but not valid d. Factorial design b. Valid but not reliable. Experimental designs c. Not reliable and not valid Between subjects - subjects serve only one condition of IV d. Reliable and valid - Two/more diff groups, only one condition of IV each A test can be reliable but not valid; but a test CANNOT be valid Within-subjects - aka repeated measures; subjects serve more unless it’s reliable than one condition of IV - One group, more than one condition of IV Reliability limits the validity of the test - Longitudinal studies 28. Obtained when the test measures what it purports to measure. Mixed - one factor is within subject, the other is between subject; has two IV a. Criterion validity - Each group only experiences one condition of 1 IV, but also b. Construct validity multiple conditions of the other IV c. Reliability Ex: d. Concurrent validity 29. Which of the following personality tests does not score SOAP 2x 4x 6x ambiguous responses? a. Sack's Sentence Completion Test sentence completion jennie soap Group 1: 30 test b. Rotter Incomplete Sentence sentence completion test lisa soap Group 2: 30 Blank. c. Purpose in Life test - scores ambiguous responses jisoo soap Group 3: 30 d. None of the above 30. If there is evidence that the association between two variables rose soap Group 4: 30 is not significantly different from 0, then we a. Reject the null hypothesis 22. Krisha developed a test with two sets. In order to identify its b. Reject the alternative hypothesis. fail to reject null reliability, she should employ what statistical tool? hypothesis a. Pearson r two sets, correlation to one another c. Accept the alternative hypothesis b. Kuder-Richardson 20 - internal consistency (same d. Both a and b level) Not significantly different from 0 - there is NO significant difference c. Cronbach's alpha - polychotomous , no correct answer Null: There is NO significant difference d. Kappa statistics - interrater Alternative: There IS a significant difference 23. In reliability, what range estimate is acceptable in the clinical 31 At the very least, what should be the item difficulty of a setting? multiple-choice item with four choices for it to be reasonable? a.50 a.25. b.80 b.30 c.70 с.20 d.95 D.15 Dapat sure na sure talaga/almost perfect because when it comes to Item analysis - Set of methods to evaluate items the clinical setting, it’s determining a person’s fate Item difficulty - number of people who got the item correct If basic research,.70 is acceptable - Item easiness 24. Elijah wants to increase the reliability of his 35-item test. In order - Optimal difficulty: Halfway between 100% and level of success to do so, what should he do? expected by chance alone (item difficulty should be higher a. Conduct pilot testing than probability) b. Add more items -.30-.70 to maximize information about the differences among c. find experts to validate the test individuals d. correlate it to similar tests. Item Discriminability - determines whether people who have done Domain Samping model well on item have also done well on the whole test (can discriminate 25. According to the code of ethics, which of the following is true high scorers from low scorers) when it comes to disclosing information? - Extreme group model: compares those who have done well a. We disclose information only when the client to those who have done poorly provides permission to do so - In each item, check which group scored more (high scorers should have more to have good discriminability) b. We disclose information to the source of referral - Point-Biserial method: correlation between the performance even without the consent of the client on the item and on the test c. When the people need to be protected from One of the variables is dichotomous/categorical and the other is test takers continuous c. The changes in score affected only few test 32 This is developed when the classical test score theory is deemed items inadequate in identifying the true ability of the test takers. d. The changes in score affected all the test items a. Domain Sampling Model 40 Which of the following is an example of external consistency? b. Item discriminability. a. Interrater reliability c. Item Response Theory b. Alternate Forms reliability d. Item Analysis c. Split-half reliability Item response theory d. None of the above. - Focuses on the range of item difficulty that helps assess an External Consistency - Interrater individual’s ability Temporal Consistency - Test retest - Need an item bank, with each item having its own difficulty Form Consistency - Alternate forms - Item branching - Administering items based on response to Split-Half Consistency - Split-half previous item; can administer harder or easier items to gauge 41 You just discovered your partner cheating in your apartment last ability of testtaker night. You broke your relationship with him/her, and spent the whole - Computerized Adaptive Testing (CAT) night losing sleep. You have to go to work the next morning for your 33 Test-retest reliability only applies to scheduled sessions with your client. In your current situation: a. Overt behaviors a. You should go to work and meet your dients b. Covert traits tomorrow as it is our duty to provide service c. Stable traits (relatively enduring) personality = unconditionally introversion/ extroversion, IQ, Dominance / assertiveness b. You should only choose l continue the sessions d. Dominant traits/ recessive (genetic factors) that are deemed crucial and take the rest of the day off - Test-retest: consistency of test over time 34 The researchers are in the middle of the experiment seeking to c. You should reschedule or refer them to other identify if noise affects concentration when suddenly the aircon professionals as you are unfit to provide services to the turned off. If the experiment continues, what could be an extraneous clients variable in this case? d. You should choose the cases related to your current a. Noise situation in order to provide more effective service to the b. Concentration clients c. Volume of the sound You’re not in right/best condition to administer services 42 Your client just disclosed to you that although he has AIDS, he d. Temperature still engages in sexual relations with other people. He added that as Extraneous variables - variables that are not part of the today is the anniversary of his marriage with his wife, he also plans experiment, but they do exist and affect the results 35 Owie is a newly-hired psychometrician in a company. Just before to sleep with her tonight. Based on the ethics, you should her scheduled employee test ing, her boss spoke to her and asked if a. Talk the client out of doing it, and never let him she could finish the assessment, which normally takes 2 hours, go for the night within 30 minutes justifying that the testing procedures are just b. Inform the wife and the corresponding formality and the employee would be accepted no matter what. What institution/authorities to address the situation should Owie do? c. Adhere to the confidentiality of the information a. Finish the assessment within 30 minutes, as but secretly inform the authorities to handle the matter apparently the test would not be used as a basis d. Never divulge the information, although try your for hiring selection hardest to deter your client from doing it b. Compromise with the boss to give her at least Duty to protect; harm may come to other people 43 This is established by identifying the total test scores of those 30 minutes more who have answered correctly in a particular item of the test. c. Agree, but inform the applicant about the a. Item analysis change b. Item difficulty d. Do not agree and explain the testing procedures c. Discriminability Analysis to the boss d. Convergent analysis 36 Establishing this psychometric property requires good logical Item discriminability is also called discriminability analysis skills and intuition. 44 Which of the following is true about validity and reliability? a. Construct validity a. A test can be valid but not reliable. b. Content validity b. A test can only be both valid and reliable. c. Face validity c. A test can be reliable but not valid d. Interrater Reliability d. A test once reliable is naturally valid 37 He recognized the need for the rapid classification of recruits 45. Kaye’s 1Q is two standard deviations above the mean. Her IQ is with respect to general intellectual level during World War I. a. 110 a. Alfred Binet b. 85 b. Robert Yerkes. c. 125 c. Karl Pearson d. 130 d. Sir Francis Galton 46 In his study, Katherine seeks to determine whether the level of WWI: Army Alpha and Army Beta; Developed by Yerkes satisfaction of employees in Company A is higher than that of the - Army Alpha - Literates; verbal - Army Beta - Illiterates; nonverbal employees in Company B. The statistical treatment to use would be 38 The more items a test has, the higher the reliability it will a. ANOVA possess. The concept behind this is called b. Spearman rho a. Domain Sampling Model c. One-tailed test b. Item Response Theory. d. Two-tailed test c. Classical Test Score Theory If you specify direction, one-tailed test d. Item Bank Analysis If you did not specify direction/only want to know level, two-tailed test 39 In test retest, carryover effects do not harm the reliability when 47 Chariz is trying to establish the construct validity of her a. The changes in score happened on only a newly-developed test about Bravery. She learned, based on her proportion of the test takers literature, that bravery is not related to egotism. A correlation was b. The changes in score happened on all of the made between her participant's scores in the bravery test and in the test for egotism. For Chariz to establish validity, what should the result be? a. There should be a correlation between the two tests b. There should not be a correlation between the two tests c. The results would be insufficient to determine the convergent validity of the test d. The results would be insufficient as criterion validity must first be established Asking for Divergent Validity 48 Which of the following does not describe Projective tests? a. Projective instruments are more susceptible to faking b. Most projective techniques are inadequately standardized with respect to both administration c. Coefficients of Internal Consistency, when computed, have usually been low d. Interpretation of scores is often as projective to the examiner as the test stimuli are for the examinee Projective tests are hard to fake 49. Which is not true about case studies? a. Low degree of manipulation of antecedent conditions b. High degree of manipulation of antecedent conditions c. Low imposition of units. d. High imposition of units. Practice effect - performance improves Fatigue effect - performance declines 50. Changes the causes the performance to improve as the experiment goes on a. Practice effect b. Fatigue effect c. Progressive effect d. Test-retest effect Non-experimental approaches Phenomenology - Lived experience - Low degree of manipulation of antecedent conditions (IVs); low imposition of units (limiting data you’re getting from the subject) Case studies - Descriptive record of experiences - Low manipulation of antecedent conditions; low to high imposition of units Field Studies - Non-experimental approaches in the field - Naturalistic observation (you just watch activities of group); participant observer studies (you join activities of group) Archival studies - Existing records reexamined for a new purpose Qualitative research - Words than numbers 9. Niks and Cari are scheduled to get their height and weight DIAGNOSTIC EXAMINATION: BPSY 198 measured at the clinic. They mentioned it was unnecessary since (PSYCHOLOGICAL ASSESSMENT) they had just taken their measurements the day before. To their surprise, Niks' weight increased from 45kg to 50kg, and Cari's Name: ___________________________________ weight went from 55kg to 57kg, while both maintained their height. Year & Section:____________________________ This could imply that: Date:_____________________________________ a. The weighing scale has a systematic error Score:____________________________________ b. The weighing scale has a random error c. Niks ate more than Cari did the night before Instructions: Read the following questions carefully and encircle the d. None of the above correct answer. Strictly no erasures allowed, erasures are 10. The split-half reliability is used to determine considered wrong automatically. a. If the test can disregard bias despite the 1. A study reveals as screen time on social media among number of factors being measured adolescents increases, mental health tends to decrease. This kind of b. If all the items in the test measures the results show same dimension a. Negative relationship c. Whether consistent scores would be obtained b. Positive relationship regardless of the characteristics of the test taker c. Significant difference d. All of the above d. Correlation relationship 11. The 16-PF is a personality test that could also determine 2. What is the best statistical tool to use in establishing the split-half unusual responses. Specifically, which of the following? reliability of a test with a limited number of items? a. Impression Management a. Spearman brown formula b. Infrequency b. Cronbach's Coefficient Alpha c. Acquiescence c. Kuder-Richardson 20. d. All of the above. d. Pearson r 12. This is the test used to determine the current developmental 3. What is the best course of action when our ethical principles come level of infants. into conflict with law? a. Apgar test a. Maintain our position regardless b. Kaufman Assessment Battery of legal requirements, as our ethical c. Woodcock-Johnson Ill guidelines are designed to protect d. Bayley Scale the safety and well-being of our 13. In order to employed is determine the concurrent validity of a clients and patients. test, the statistical tool to be b. Comply with the law while a. Spearman-brown formula simultaneously making efforts to b. Kuder-Richardson 20 reduce any inconvenience it may c. Point-biserial correlation cause to those affected. d. Pearson r c. Follow the law, as it represents the supreme authority in 14. For individuals who are legally unable to give consent, we must: the country. a. Still provide an appropriate explanation to the d. Address the conflict while remaining dedicated to client upholding the code of ethics. b. Seek informed assent from them 4. The test taker neglected to drink water before the test. As he c. Obtain proper authorization from their legal proceeds with the examination, he feels so thirsty. What type of representative challenge to internal validity does this situation pose? d. All of the above a. Testing 15. When releasing test data, we divulge which of the following? b. Instrumentation a. Release raw and scaled scores c. History b. Release client's responses to the test questions d. Selection c.Observation notes 5. The House-Tree-Person test is developed by: d. None of the above a. Lewis Terman 16. In creating a research, which one of the following types of b. John Buck statements is always false? c. R.B. Cattell a. Analytical statement d. Florence Goodenough b.Falsifiable statement 6. Which of the following elements must be present before an c.Contradictory statement experiment can be called as a true experiment? d.Hypothetical statement a. Random assignment 17. One of the primary objectives in developing this scale was to b. Control group provide an intelligence test suitable for adults, as previously c. Random Samplig available tests were all designed for school children. d. All of the above a. Weschler-Bellevue Intelligence Scale 7. When the distribution of scores includes outliers, it is better not to b. Raven's Progressive Matrices use c. Stanford-Binet Intelligence Scale a. Mean d. Culture Fair Intelligence Test b. Median 18. States that measurement error is always random, and c. Mode advocates standardization of tests. d. All of the above a. Domain Sampling Model 8. The Children Personality Questionnaire-R is an example of what b. Classical Test Score Theory kind of test? c. Item Discriminability Analysis a. Unstructured test d. Item Response Theory b. Projective test 19. A company is conducting a study to evaluate the effectiveness of c. Structured test a new training program for improving employee productivity. The d. Intelligence test. new group receives the new training program, while the old group continues with the standard procedures. After learning about the new program, the old group starts working harder than usual to 29. Which of the following personality tests does not score demonstrate they can perform just as well without the new training. ambiguous responses? a. Reactivity a. Sack's Sentence Completion Test b. Pygmalion effect. b. Rotter Incomplete Sentence c. Rosenthal effect Blank. d. John Henry effect c. Purpose in Life test 20. How do you establish Alternate Forms reliability? d. None of the above a. You administer the form A of your personality 30. If there is evidence that the association between two variables inventory to your sample. After they are finished, is not significantly different from 0, then we you administer the form a. Reject the null hypothesis b. You administer the b. Reject the alternative hypothesis. form A to your sample, wait a couple of weeks, c. Accept the alternative hypothesis and then administer the form B to the same d. Both a and b sample. 31 At the very least, what should be the item difficulty of a c. All of the above. multiple-choice item with four choices for it to be reasonable? d. None of the above a.25. 21. Subjects serve more than one condition of the independent b.30 variable. с.20 a. Between-subjects design D.15 b. Within-subjects design 32 This is developed when the classical test score theory is deemed c. Mixed design inadequate in identifying the true ability of the test takers. d. Factorial design a. Domain Sampling Model 22. Krisha developed a test with two sets. In order to identify its b. Item discriminability. reliability, she should employ what statistical tool? c. Item Response Theory a. Pearson r d. Item Analysis b. Kuder-Richardson 20 33 Test-retest reliability only applies to c. Cronbach's alpha a. Overt behaviors d. Kappa statistics b. Covert traits 23. In reliability, what range estimate is acceptable in the clinical c. Stable traits setting? d. Dominant traits. a.50 34 The researchers are in the middle of the experiment seeking to b.80 identify if noise affects concentration when suddenly the aircon c.70 turned off. If the experiment continues, what could be an extraneous d.95 variable in this case? 24. Elijah wants to increase the reliability of his 35-item test. In order a. Noise to do so, what should he do? b. Concentration a. Conduct pilot testing c. Volume of the sound b. Add more items d. Temperature c. find experts to validate the test 35 Owie is a newly-hired psychometrician in a company. Just before d. correlate it to similar tests. her scheduled employee testing, her boss spoke to her and asked if 25. According to the code of ethics, which of the following is true she could finish the assessment, which normally takes 2 hours, when it comes to disclosing information? within 30 minutes justifying that the testing procedures are just a. We disclose information only when the client formality and the employee would be accepted no matter what. What provides permission to do so should Owie do? b. We disclose information to the source of referral a. Finish the assessment within 30 minutes, as even without the consent of the client apparently the test would not be used as a basis c. When the people need to be protected from for hiring selection harm b. Compromise with the boss to give her at least d. All of the above 30 minutes more 26. Jhunar asked his professor for help regarding a sensitive case c. Agree, but inform the applicant about the of a research subject he is currently handling in his study. He gave change his professor the necessary information about the case, but not the d. Do not agree and explain the testing procedures name of the subject. Allen is protecting his subject's to the boss a. Anonymity 36 Establishing this psychometric property requires good logical b. Confidentiality skills and intuition. c. Obscurity a. Construct validity d. Privacy b. Content validity 27. Rics was given an intelligence test on Monday and she obtained c. Face validity a score of 98. She took the same test on Wednesday, and she d. Interrater Reliability obtained a score of 120. Based on this, the intelligence test is 37 He recognized the need for the rapid classification of recruits therefore with respect to general intellectual level during World War I. a. Reliable but not valid a. Alfred Binet b. Valid but not reliable. b. Robert Yerkes. c. Not reliable and not valid c. Karl Pearson d. Reliable and valid d. Sir Francis Galton 28. Obtained when the test measures what it purports to measure. 38 The more items a test has, the higher the reliability it will a. Criterion validity possess. The concept behind this is called b. Construct validity a. Domain Sampling Model c. Reliability b. Item Response Theory. d. Concurrent validity c. Classical Test Score Theory d. Item Bank Analysis 39 In test retest, carryover effects do not harm the reliability when b. There should not be a correlation between the two a. The changes in score happened on only a tests proportion of the test takers c. The results would be insufficient to determine b. The changes in score happened on all of the the convergent validity of the test test takers d. The results would be insufficient as criterion c. The changes in score affected only few test validity must first be established items 48 Which of the following does not describe Projective tests? d. The changes in score affected all the test items a. Projective instruments are more susceptible to 40 Which of the following is an example of external consistency? faking a. Interrater reliability b. Most projective techniques are inadequately b. Alternate Forms reliability standardized with respect to both administration c. Split-half reliability c. Coefficients of Internal Consistency, when computed, d. None of the above. have usually been low 41 You just discovered your partner cheating in your apartment last d. Interpretation of scores is often as projective to the night. You broke your relationship with him/her, and spent the whole examiner as the test stimuli are for the examinee night losing sleep. You have to go to work the next morning for your 49. Which is not true about case studies? scheduled sessions with your client. In your current situation: a. Low degree of manipulation of antecedent a. You should go to work and meet your dients conditions tomorrow as it is our duty to provide service b. High degree of manipulation of antecedent unconditionally conditions b. You should only choose l continue the sessions c. Low imposition of units. that are deemed crucial and take the rest of the day off d. High imposition of units. c. You should reschedule or refer them to other 50. Changes the causes the performance to improve as the professionals as you are unfit to provide services to the experiment goes on clients a. Practice effect d. You should choose the cases related to your current b. Fatigue effect situation in order to provide more effective service to the c. Progressive effect clients d. Test-retest effect 42 Your client just disclosed to you that although he has AIDS, he still engages in sexual relations with other people. He added that as today is the anniversary of his marriage with his wife, he also plans to sleep with her tonight. Based on the ethics, you should a. Talk the client out of doing it, and never let him go for the night b. Inform the wife and the corresponding institution/authorities to address the situation c. Adhere to the confidentiality of the information but secretly inform the authorities to handle the matter d. Never divulge the information, although try your hardest to deter your client from doing it 43 This is established by identifying the total test scores of those who have answered correctly in a particular item of the test. a. Item analysis b. Item difficulty c. Discriminability Analysis d. Convergent analysis 44 Which of the following is true about validity and reliability? a. A test can be valid but not reliable. b. A test can only be both valid and reliable. c. A test can be reliable but not valid d. A test once reliable is naturally valid 45. Kaye’s 1Q is two standard deviations above the mean. Her IQ is a. 110 b. 85 c. 125 d. 130 46 In his study, Katherine seeks to determine whether the level of satisfaction of employees in Company A is higher than that of the employees in Company B. The statistical treatment to use would be a. ANOVA b. Spearman rho c. One-tailed test d. Two-tailed test 47 Chariz is trying to establish the construct validity of her newly-developed test about Bravery. She learned, based on her literature, that bravery is not related to egotism. A correlation was made between her participant's scores in the bravery test and in the test for egotism. For Chariz to establish validity, what should the result be? a. There should be a correlation between the two tests