








Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
An overview of essay items, their purpose, and guidelines for writing effective essay questions. Essay items are open-ended assessment tools used to evaluate higher-order thinking skills, such as the ability to analyze, evaluate, and synthesize information. They are particularly useful for assessing the relationship between ideas and the ability to express ideas coherently and persuasively in writing. the advantages and disadvantages of essay items, including their flexibility, the importance of planning and outlining, and the challenges of scoring them.
Typology: Study notes
1 / 14
This page cannot be seen from the preview
Don't miss anything!









Difficulty Index ☺☺☺☺ (really pretty easy)
ne of the things that we will stress throughout this part of Tests & Measurement for People Who Hate Tests & Measurement is that the type of item you use as an assess- ment tool is closely tied to the level of information (or level of thinking skills) you want your test taker to show that he or she does (or does not) have. In Chapter 6, we emphasized how short answer items are best used when trying to assess memorization of factual information. In this chapter on essay items, you’ll find other tools that you can use to assess other types of outcomes. Here, we’ll focus on essay items —those items where the test taker is expected to write a coherent and informative response to a ques- tion. Forget about that Friday spelling test or even the SAT—essay tests are the real thing when it comes to seeing how well test takers integrate ideas and how well they can express them in written form.
Essay questions allow for perhaps the most unrestricted type of written assessment item that we will cover in Tests & Measurement for People Who Hate Tests & Measurement. What you want to know is how well the test taker can organize information and express his or her ideas in writing. That’s why the really, really big exams in one’s academic career are usually of the essay type—these types of items just tap more higher-level and complex skills.
Essay questions come in two basic flavors: open-ended (also called unrestricted or extended) questions and closed-ended (also called restricted) questions. An open-ended (or unrestricted response ) essay question is one where there are no restrictions on the response, including the amount of time allowed to finish, the number of pages written, or material included. Now, it is a bit impractical to allow test takers to have 25 hours to answer one essay question or to write hundreds of pages about anything they like. So, of course there are practical limits. It’s just that the limits do not define the scope of the response. For example, here’s an open-ended essay question.
That is a nice, open-ended question where the response is unrestricted. Now, take a look at this closed-ended question where the response is restricted.
These two types of questions reflect different types of experiences. The first question, which is much less restrictive, gives the test taker a lot more flexibility (among other things) and allows for a more creative approach. The more restricted closed-ended question places definite limits on the content as well as the format.
120—— Part III ♦ The Tao and How of Testing
Essay items are the item of choice if you want an unrestricted response and want to access higher-order thinking, such as the relationship between ideas and the pros and cons of a particular argument.
It’s not like this is that poorly designed a question, but it sure does not reflect a clear notion of what was learned or what is being assessed. This is the kind of a topic that some historian could write seven volumes about! Look how much more clear the following question is.
122—— Part III ♦ The Tao and How of Testing
Figure 7.1 Outline
As you can see, there are different levels of headings. By clicking various buttons on the toolbar right above the outline, the material can be rearranged, expanded, or collapsed, and many other operations can be performed. Are outlines for you? Your word processor has the option—try it and see.
This second example just provides additional direction, which may be exactly what the test taker needs to answer the question more completely.
Such a question requires a knowledge of rates of infant mortal- ity but goes far beyond the statement of just facts. In fact, here’s this chapter’s cheat sheet about what kinds of words signal higher-order skills (and these words usually show up in essay items).
Chapter 7 ♦ Essay Items: Hope You Can Write 123
Analyze
Apply
Arrange
Classify
Contrast
Compare
Create
Evaluate
Explain
Generalize
Infer
Integrate
Justify
Organize
Persuade
Predict
Summarize
Synthesize
Chapter 7 ♦ Essay Items: Hope You Can Write 125
Among the many reasons why essays are particularly useful is that they are very flexible in both form (the size and complexity) and purpose (relate simple ideas or elaborate a complex argument).
Essay items are absolutely terrific and almost indispensable for sampling higher-order thinking. I’ve mentioned that several times throughout this chapter. But they are a bear to score—time con- suming and very demanding of all the scorer’s attention. With that in mind, here are a few tips that might make the scor- ing process more efficient and result in a fairer assessment.
126—— Part III ♦ The Tao and How of Testing
Each of these elements, for example, could be scored separately. A model too difficult for you to do? Then at least have the impor- tant points listed on a separate sheet of paper against which you can compare test takers’ responses.
128—— Part III ♦ The Tao and How of Testing
Essay Items and Unreliability
You remember from Chapter 2 that our discussion about reliabil- ity focused in part on the reduction of error variance. The more error variance that is removed from the entire testing situation, the higher the reliability. OK—given that thought, there are a lot of sources of error variance when it comes to essay items, and perhaps the largest source is differences in grading. By its very nature, an essay item has so many indefinite things associated with it (objectivity of the grader, student’s writing skills, etc.) that it is even more important to control what you can. So, do try and ensure anonymity, use a model for scoring, and standardize conditions whenever possible. You’ll have a more reliable test, and the test takers will be treated more fairly.
Essay questions are terrific to get at those more sophisticated thinking skills that we have to assess to find out if an individual understands more complex ideas and how these ideas relate to one another. But although they might be relatively easy to create (they are short, after all, and only a few are needed), they can underrep- resent content, they are tough to score, and there are lots of reasons why they could be considered unreliable. So, use ’em only when they need to be used!
TIME TO PRACTICE
ANSWERS TO PRACTICE QUESTIONS
Discuss the origins of the testing movement in the United States and be sure to identify how the social and political events of the times helped encourage the growth of the testing industry (45 minutes).
Chapter 7 ♦ Essay Items: Hope You Can Write 129
examination were compared with students with the opposite pattern (top third on the multiple-choice questions and bottom third on the essay questions). Part of the findings? Students who were strong in the essay format and weak in the multiple-choice format were as successful in their college courses as students with the opposite pattern. Guess if you know it, you know it.
Chapter 7 ♦ Essay Items: Hope You Can Write 131