A conceptual introduction to psychometrics by G, J., Mellenbergh - a summary
- 4516 reads
Join with a free account for more service, or become a member for full access to exclusives and extra support of WorldSupporter >>
A conceptual introduction to psychometrics
Chapter 3
Typical performance tests
Typical performance tests assess behavior that is typical for the person.
These tests are used to measure attitudes, interests, values, opinions, and personality characteristics.
The test developer has to specify the latent variable of interest that is assumed to effect test takers’ item responses and test scores.
The ususal constructs of interest of typical performance tests are:
The responses to typical performance tests are not evaluated on their correctness, but are considered to typify a person.
At the start of a test development project, the researcher needs information on the construct of interest. This information can be obtained from different sources
A study of the literature on the construct and existing measurement instruments is nearly always needed at the start of a test development project
Different types of research can be done on the construct.
The test developer can use information from different sources to define the construct and, later on the test development process, he or she can use this information for item writing.
Each of these four modes can occur in tow different varieties
The reactive/nonreactive distinction is only used for typical performance measurements, and not for maximum performance measurements.
A maximum performance test asks test takers to do the best they can to perform the task.
Each of the four response modes can occur in two versions
Self-report mode
Test takers are asked to respond to questions or stimuli to assess their attitudes, values, interests, opinions, or personality.
Other-report mode
Uses other people to report on a given person’s typical performance construct.
Somatic indicators mode
Uses somatic signs to assess typical performance constructs.
Physical traces mode
Uses traces that persons left behind to assess their typical performance constructs
The test developer must define the target population and the inclusion and exclusion criteria of persons.
If subpopulations need to be distinguished, the test developer must define these subpopulations and must provide criteria to include persons in these subpopulations.
Tree broad classes of strategies to construct typical performance tests. In reach of these classes, tow specific test development methods are distinguished.
Construct method
Uses a theoretical framework.
The construct is defined, and it is embedded in a network of other constructs. The theory and its network are used to write items.
The facet design method
Generates items from a conceptual analysis of the construct that has to be measured by the test.
Starts with an inventory of the observable behavior that applies to the construct.
This behavior is classified according to a number of aspects, which are called facets. Each of these facets contains a number of facet elements.
Important facets for the construction of typical performance tests are behavioral and situational facets.
The facets are crossed, and items are written for each of the combinations of the different facet elements.
A typical performance item consists of a question or statement, and the test taker is asked to answer the question or to react to the statement.
A number of distinctions are made, and these are used to classify the response modes of typical performance items.
The response scales of closed-ended items are divided into:
Endorsement scales are subdivided into:
The intensity endorsement scales are subdivided in
Unipolar scale: a response scale can go from a zero point to one direction only.
Bipolar scale: a response scale can go from a negative pole to a positive pole
Dichotomous scale: a scale with only two categories
Ordinalpolytomous: a scale that has more than two ordered categories
Bounded-continuous scale: a continuous scale that is bounded, for example, with two endpoints
The main modes to administer typical performance tests are:
Pilot studies are necessary to assess the quality of concept items.
Usually, a large number of concept items has to be revised or has to be removed from the pool of concept items.
Three types of pilot studies:
Expert’s pilots
Concept items have to be reviewed by experts.
Three types of expertise are needed:
Test takers’ pilots
The concept items are administered to a small group test takers from the target population. Each of the test takers is interviewed about their thinking while answering the items.
Responses to typical performance items may be affected by response tendencies.
Response tendency: the differential application of the response scale.
Response style: the differential use of the item response scale by different persons.
A response style varies between responses, but it is relatively constant across measurements of different constructs and across measurements of the same construct on different constructs and across measurements of the same construct on different occasions.
It is a person-specific property.
Important response styles are:
Response set: the differential use of the item response scale by different persons and different constructs.
The response may differ between persons and between constructs, and is only relatively stable across measurements of the same construct on different occasions. It is a person/construct-specific property.
Response sets:
Social desirability is a person-specific property because it varies between persons.
It is also construct-specific because it may vary between constructs.
The best strategy is to assess social desirability with specific measurement instruments.
Acquiescence and dissentience can only occur with endorsement items and not inn frequency items.
The extremity and midpoint response styles can occur in both.
Acquiescence, dissentience, the extemity and midpoint styles can occur in both the reactive self-report and the reactive other-report measurement modes.
Social desirability can only occur in the reactive self-report mode.
Acquiesce and dissentience can be detected by including both indicative and contra-indicative items into the questionnaire.
The extremity and midpoint response styles are hard to detect.
The concept items that survived the pilot studies are used to compile the first draft of the test and instructions for test takers are added.
Usually, the instruction contains some example items to guarantee that test takers understand the test items.
Balanced test: consists of about 50% indicative and 50% contra-indicative items.
Social desirability items can also be added to the test.
Usually, indicative, contra-indicative, and social desirability items are arbitrarily mixed in the test.
The concept test is submitted to a group of experts. This group can be the same as the group that was used in the experts’ pilot study on item quality.
The group needs to have expertise about both the construct and test construction. The experts evaluate whether the test instruction is sufficiently clear for the population of test takers.
They study the content validation (whether the test adequately coves all aspects of the construct)
The comments of the experts are used to compile the first draft of the test.
The first draft is administered in a try-out to at least 200 test takers from the target population.
The try-out data are analyzed using methods of classical and modern test theory. s
This is a summary of the book A conceptual introduction to psychometrics by G, J., Mellenbergh. The summary contains chapter 1 to 6, and focusus on developing psychological tests.
The first chapter of this summary is for free, but to support worldsupporter and Joho,
...There are several ways to navigate the large amount of summaries, study notes en practice exams on JoHo WorldSupporter.
Do you want to share your summaries with JoHo WorldSupporter and its visitors?
Field of study
JoHo can really use your help! Check out the various student jobs here that match your studies, improve your competencies, strengthen your CV and contribute to a more tolerant world
4227 |
Add new contribution