Ecological Validity, Rater Bias, and the Worry Questionnaire, Exams of Psychology

The concept of ecological validity in psychological testing, examining how well a test measures what it intends to measure in real-world settings. It delves into the issue of rater bias, specifically leniency errors, and discusses strategies to mitigate such biases. The document also introduces the constructive and unconstructive worry questionnaire, a tool designed to assess individual differences in worry, highlighting its development process and potential applications.

Typology: Exams

2024/2025

Available from 02/18/2025

DrShirleyAurora
DrShirleyAurora 🇺🇸

4.4

(9)

6.2K documents

1 / 30

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
PSYC Chapter 6
Ecological validity refers to a judgement regarding how well a test measures what it purports to
measure
A) but only in a specified environment.
B) but only in a specified environment and within certain frequency limits.
C) at the time and place that the variable being measured is actually emitted.
D) All of the answers are correct. -
C) at the time and place that the variable being measured is actually emitted.
A study of the ecological validity of a test is likely to be conducted
A) by a researcher interested in learning about behavior that occurs at a specific time and place.
B) only during the season that the targeted behavior occurs if the targeted behavior is seasonal in
nature.
C) in an environment that is similar to one in which the targeted behavior will naturally occur.
D) All of the answers are correct. -
C) in an environment that is similar to one in which the targeted behavior will naturally occur.
After a live performance of Justin Bieber, the tweets of his die-hard fans on Twitter can be expected
to reflect _____ error.
A) a leniency
B) a generosity
C) Both leniency and generosity are correct.
D) None of the answers is correct. -
C) Both leniency and generosity are correct.
Gonsalvez and Crowe (2014) concluded that psychotherapy supervisors' judgments of supervisees'
competence are
1 | P a g e
pf3
pf4
pf5
pf8
pf9
pfa
pfd
pfe
pff
pf12
pf13
pf14
pf15
pf16
pf17
pf18
pf19
pf1a
pf1b
pf1c
pf1d
pf1e

Partial preview of the text

Download Ecological Validity, Rater Bias, and the Worry Questionnaire and more Exams Psychology in PDF only on Docsity!

PSYC Chapter 6

Ecological validity refers to a judgement regarding how well a test measures what it purports to measure A) but only in a specified environment. B) but only in a specified environment and within certain frequency limits. C) at the time and place that the variable being measured is actually emitted. D) All of the answers are correct. - ✅C) at the time and place that the variable being measured is actually emitted. A study of the ecological validity of a test is likely to be conducted A) by a researcher interested in learning about behavior that occurs at a specific time and place. B) only during the season that the targeted behavior occurs if the targeted behavior is seasonal in nature. C) in an environment that is similar to one in which the targeted behavior will naturally occur. D) All of the answers are correct. - ✅C) in an environment that is similar to one in which the targeted behavior will naturally occur. After a live performance of Justin Bieber, the tweets of his die-hard fans on Twitter can be expected to reflect _____ error. A) a leniency B) a generosity C) Both leniency and generosity are correct. D) None of the answers is correct. - ✅C) Both leniency and generosity are correct. Gonsalvez and Crowe (2014) concluded that psychotherapy supervisors' judgments of supervisees' competence are

A) compromised by leniency errors. B) compromised by severity errors. C) reasonably accurate given subsequent ratings. D) unreliable in the light of subsequent ratings. - ✅A) compromised by leniency errors. To improve raters' judgments of competency, Gonsalvez and Crowe (2014) recommended that A) at least three raters be used. B) specific competencies be evaluated. C) all raters be certified as competent themselves. D) All of the answers are correct. - ✅B) specific competencies be evaluated. Prior to the development of the Constructive and Unconstructive Worry Questionnaire, research on worry had shown that the act of worrying can lead to A) positive outcomes. B) negative outcomes. C) Both positive outcomes and negative outcomes are correct. D) None of the answers is correct. - ✅C) Both positive outcomes and negative outcomes are correct. A review of existing measures of individual differences in worry suggested to the authors of the Constructive and Unconstructive Worry Questionnaire that none of the measures were made to distinguish people's tendency to worry A) about things with momentous consequences versus those with trivial consequences. B) about things coming up in the future versus things one had done in the past. C) in an ideal-based fashion from a reality-based fashion. D) constructively from their tendency to worry unconstructively. -

B) 40; unique C) 80; unique D) 40; relatively equal in difficulty - ✅B) 40; unique In the development of the Constructive and Unconstructive Worry Questionnaire, after a review of the preliminary items, a total of _____ items remained in the final form of the test. A) 40 B) 18 C) 16 D) 12 - ✅B) 18 In the development of the Constructive and Unconstructive Worry Questionnaire, the test authors hypothesized that the tendency to worry _____ would be positively related to trait-anxiety. A) excessively B) frequently C) constructively D) unconstructively - ✅A) excessively In the development of the Constructive and Unconstructive Worry Questionnaire, the test authors hypothesized that the tendency to worry _____ would be negatively related to one's tendency to be punctual. A) excessively B) frequently C) constructively D) unconstructively - ✅D) unconstructively

In the development of the Constructive and Unconstructive Worry Questionnaire, the subjects in one of the preliminary studies were A) 98 Korean foreign exchange students studying at New York University. B) 398 convicted felons in the federal prison system. C) 698 residents of a South Florida trailer park during the hurricane season. D) 998 Australian residents of wildfire-prone areas. - ✅D) 998 Australian residents of wildfire-prone areas. In the development of the Constructive and Unconstructive Worry Questionnaire, which research tool was used to assist the test developers in selecting the final form of the test? A) analysis of variance B) regression analysis C) critical incident analysis D) factor analysis - ✅D) factor analysis In the development of the Constructive and Unconstructive Worry Questionnaire, the amount of worry one experiences was captured using A) the Worry Domains Questionnaire. B) the Penn State Worry Questionnaire. C) trained raters marking a 5-point scale. D) Both the Worry Domains Questionnaire and the Penn State Worry Questionnaire are correct. - ✅D) Both the Worry Domains Questionnaire and the Penn State Worry Questionnaire are correct. For future research on the validity of the Constructive and Unconstructive Worry Questionnaire, the developers of this test suggested that studies be conducted using A) a clinical population of pathological worriers.

"It's a measure of validity that is arrived at by a comprehensive analysis of how scores on a test relate to other test scores." This statement refers to A) face validity. B) content validity. C) the trinitarian index. D) construct validity. - ✅D) construct validity. Messick supported a unitary view, while _____ supported the trinitarian approach. A) Cronbach B) Lawshe C) Guion D) Dangerfield - ✅C) Guion In Chapter 6 of your text, Adam Shoemaker, the featured professional in Meet an Assessment Professional, described the use of a test with little criterion validity. Dr. Shoemaker recalled that this test was used for the purpose of A) gauging inter-item consistency of another test. B) gaining "buy-in" from the test users. C) providing a "job preview" of sorts to aspirants. D) hiring candidates for mid-level executive positions. - ✅C) providing a "job preview" of sorts to aspirants. Criterion-related validity is to predictive validity as criterion-related validity is to A) construct validity. B) content validity.

C) concurrent validity. D) test bias. - ✅C) concurrent validity. Test blueprinting is applied in the design of A) an attitude test. B) a personality test. C) an employment test. D) All of the answers are correct. - ✅D) All of the answers are correct. In order to remain consistent with a test's blueprint, a test administered on a regular basis is likely to require A) item-pool management. B) base rate maintenance. C) predictive validity certification. D) None of the answers is correct. - ✅A) item-pool management. The effect of _____ of test scores for remedying adverse impact is to make equivalent all scores that fall within a particular range. A) within-group norming B) differential cutoffs C) preference policies D) banding - ✅D) banding "How can group differences on cognitive ability tests be reduced while retaining existing high levels of reliability and criterion-related validity?" According to Gottfredson, the answer to this question

Relating scores obtained on a test to other test scores or data from other assessment procedures is typically done in an effort to establish the _____ validity of a test. A) content-related B) criterion-related C) face D) about-face - ✅B) criterion-related Face validity refers to A) the most preferred method for determining validity. B) another name for content validity. C) the appearance of relevancy of the test items. D) validity determined by means of face-to-face interviews. - ✅C) the appearance of relevancy of the test items. Face validity A) may influence the way a test taker approaches the situation. B) relates more to what a test appears to measure than what the test may actually measure. C) is given short-shrift as compared to other indices of validity. D) All of the answers are correct. - ✅D) All of the answers are correct. Which assessment technique is the best example of a face-valid method? A) a personality test in which test takers are asked to describe what they see in inkblots B) administering a word processing test to a person applying to a job that requires the use of a word processor C) asking test takers to draw a picture of their family to assess family relationships D) measuring the height of applicants applying for a semi-pro basketball team -

✅B) administering a word processing test to a person applying to a job that requires the use of a word processor In an undergraduate measurement course, an instructor announces that the first examination will cover the topics of reliability and validity. One student in the class, Jamarr, publicly predicts that only questions on reliability will be posed. As it turns out, true to Jamarr's prediction, all of the test questions are only on the topic of reliability. Given this background, which of the following is the most reasonable conclusion that Jamarr's fellow students could draw? A) The first examination lacked concurrent validity. B) The first examination lacked content validity. C) The first examination lacked face validity. D) Jamarr should be consulted prior to the second examination. - ✅B) The first examination lacked content validity. _____ is defined as the degree to which an additional predictor explains something about the criterion measure that is not explained by predictors already in use. A) A false positive rate B) Evidence of construct validity C) Predictive validity D) Incremental validity - ✅D) Incremental validity Before constructing a comprehensive final examination that covers everything you have studied since the first day of your course, your instructor reviews the objectives of the course, the textbook, and all lecture notes. Your instructor is clearly making a diligent effort to maximize the _____ validity of the final examination. A) content B) criterion-related C) predictive D) internal consistency - ✅A) content

B) content validity and predictive validity C) concurrent validity and predictive validity D) concurrent validity and content validity - ✅C) concurrent validity and predictive validity The form of criterion-related validity that reflects the degree to which a test score is correlated with a criterion measure obtained at the same time that the test score was obtained is known as A) predictive validity. B) construct validity. C) concurrent validity. D) content validity. - ✅C) concurrent validity. The form of criterion-related validity that reflects the degree to which a test score correlates with a criterion measure that was obtained subsequent to the test score is known as A) predictive validity. B) construct validity. C) concurrent validity. D) content validity. - ✅A) predictive validity. A key difference between concurrent and predictive validity has to do with A) the time frame during which data on the criterion measure is collected. B) the magnitude of the reliability coefficient considered significant at the .05 level. C) the magnitude of the validity coefficient considered significant at the .05 level. D) Both the magnitude of the reliability coefficient considered significant at the .05 level and the magnitude of the validity coefficient considered significant at the .05 levelare correct. - ✅A) the time frame during which data on the criterion measure is collected.

Which is an example of a criterion? A) achievement test scores B) success in being able to repair a defective toaster C) student ratings of teacher effectiveness D) All of the answers are correct. - ✅D) All of the answers are correct. Criterion contamination occurs when A) the criterion measure is influenced by the predictor measure. B) subjects talk to one another about the test. C) the characteristic being measured occurs with low frequency in the group being studied. D) All of the answers are correct. - ✅A) the criterion measure is influenced by the predictor measure. According to the text, face validity may ultimately be more of an issue of _____ than _____. A) social values; psychometric soundness B) psychometric soundness; public relations C) public relations; psychometric soundness D) social values; public perception - ✅C) public relations; psychometric soundness An investigation of a test's construct validity may yield evidence that A) the test is measuring a single construct. B) the test correlates with another test purporting to measure the same construct. C) test scores increase as a function of age. D) All of the answers are correct. - ✅D) All of the answers are correct.

D) All of the answers are correct. - ✅D) All of the answers are correct. Which magnitude of validity coefficient is typically acceptable to conclude that a test is valid? A) 1. B) 1. C) above 1. D) None of the answers is correct. - ✅D) None of the answers is correct. A coefficient of correlation is calculated between Henry's score on a test of sociopathy and a clinician's rating of Henry on the variable of sociopathy. This coefficient of correlation might also be referred to as A) an index of reliability. B) an index of sociopathy. C) a validity coefficient. D) a content-related validity coefficient. - ✅C) a validity coefficient. Employment test data suggests that an individual applicant is incapable of successfully performing a particular job. However, in reality, this individual would be very successful at the job. Such a scenario exemplifies A) a base rate. B) a false positive. C) a false negative. D) a false expectancy. - ✅C) a false negative. Which is an example of a false positive?

A) A test identifies a client as schizophrenic when the client is not. B) A test correctly identifies a client as schizophrenic. C) A test correctly identifies a client as not having schizophrenia. D) A test indicates that a client is not schizophrenic when in fact the client is. - ✅A) A test identifies a client as schizophrenic when the client is not. If you were a psychologist working in the field of human resource management, which claim for a new personnel selection test by a test publisher would be most compelling and persuasive? A) The test identifies a large number of false positives. B) The test improves the hit rate. C) The test identifies a large base rate. D) The test improves the selection ratio. - ✅B) The test improves the hit rate. A construct is A) unobservable. B) something that describes behavior. C) something that is assumed to exist. D) All of the answers are correct. - ✅D) All of the answers are correct. Which qualifies as a construct? A) depression B) intelligence C) mechanical aptitude D) All of the answers are correct. - ✅D) All of the answers are correct.

C) education. D) All of the answers are correct. - ✅D) All of the answers are correct. If a test is a valid measure of a particular construct, we would expect that A) groups of people who differ with respect to the construct will obtain different test scores. B) groups of people who differ with respect to the construct will obtain similar test scores. C) groups of people who obtain similar scores will have similar personalities. D) None of the answers is correct. - ✅A) groups of people who differ with respect to the construct will obtain different test scores. A significant, positive relationship exists between scores on a new test of intelligence and scores on the fourth edition of the Stanford-Binet intelligence scale. This may be viewed as supportive of which type of validity evidence for the new test? A) criterion-related validity B) content validity C) convergent evidence of construct validity D) discriminant evidence of construct validity - ✅C) convergent evidence of construct validity A statistically insignificant correlation exists between scores on a new test of depression and a well- established measure of satisfaction with life. These data may be construed as which type of validity evidence with regard to the test of depression? A) criterion-related validity B) convergent evidence of construct validity C) discriminant evidence of construct validity D) None of the answers is correct because there was an insignificant relationship. - ✅C) discriminant evidence of construct validity

The names attributed to different factors in a factor analysis are A) dictated by the factors themselves. B) subject to change as new analyses occur. C) thoroughly validated against dictionary definitions. D) typically dependent on an analyst's judgment. - ✅D) typically dependent on an analyst's judgment. In the context of validity, a valid test A) may be used fairly. B) may be used unfairly. C) may be used either fairly or unfairly. D) is only used by biased test users. - ✅C) may be used either fairly or unfairly. A test is considered to be biased if A) 50 percent of the test takers fail the test. B) one group, such as males, consistently performs better than another group, such as females. C) a factor inherent in the test systematically prevents accurate measurement. D) the test developer was found to harbor prejudice against some group. - ✅C) a factor inherent in the test systematically prevents accurate measurement. Which is true regarding a rating? A) It refers only to a numerical judgment that places a person or an attribute along a continuum. B) It refers only to a verbal judgment that places a person or an attribute along a continuum. C) It tends not to involve a judgment. D) It refers to either a numerical or a verbal judgment that places a person or an attribute along a continuum. -