WSRt, critical thinking, a list of terms used in the articles of block 3

Join Log in Profile Search

Join with a free account for more service, or become a member for full access to exclusives and extra support of WorldSupporter >>

Validity
Reliability
Hits and misses
Prediction or description
Interpretation
Utility
Items
Tools
Assumptions

Validity

Validity: if test-results can be interpreted in terms of the construct the test tries to measure.

The nomological network: the system of hypothetical relations around the construct.
This can be a part of the theory.

Forms of validity:

Impression-validity: an subjective judgment of the usability of an measurement-instrument on the base of the direct observable properties of the test-material.

Content-validity: the judgment about the representativeness of the observations, appointments, and questions for a certain purpose.

Criterium-validity: the (cor)relation between test-score and a psychological or social criterium.

Predictive criterium-oriented validity: the criterium lies in the future. Criterium-performance aren’t measured at the same time as test-performance, but later.
Concurrent criterium-oriented validity: the criterium is measured at the same time as the test. The criterium lays in the now or past.

Process-validity: the manner on which the response is established.

Construct-validity: A part of the similarities between the strictly formulated, hypothetical relations between the measured construct, and other constructs and otherwise empirical proved relations between instruments which should measure those constructs.

The multitrait-multimethod approach of validation: a process in which with separated independent measurement-procedures at different traits is sought to construct-validity of a test.
Convergence: a tests is cohesive with other measurements of the same construct or related constructs.
Divergence: the test isn’t cohesive with other non-related constructs.

Reliability

Internal consistence-reliability: mutual cohesion of items that form a scale or sub-tests.

Repeated reliability: repeated measures with the same instrument

Local reliability: an impression of the reliability of the measurement within a certain wide of scores.

The homogeneity or consistency-reliability: the cohesion between the different (items) of a scale. With psychological measurement, it is assumed the the items are repeated, independent measures of a trait.

The reliability of the prediction: the repeatability of the prediction on a certain point of time.

Stability of the prediction: the repeatability of the prediction in the course of time.

Hits and misses

Base rate: the proportion of people in the population that possesses a particular trait, behaviour, characteristic, or attribute.c
Criterium-group: a, for the users-goal of the test, representative group of which all the members have the same criterium-behaviour and of which all the criterium-scores are known.

Hits

Hit: a correct classification

Hit rate: the proportion of people that an assessment tool accurately identifies as possessing or exhibiting a particular trait, ability, behaviour, or attribute

Misses

Miss: an incorrect classification

Miss rate: the proportion of people that an assessment tool inaccurately identifies as possessing or exhibiting a particular trait, ability, behaviour, or attributeReturn to investment: the ratio of benefits to costs.

Prediction-error or classification-error: the percentage wrongfully submitted cases by the test.

False positive: a specific type of miss whereby an assessment tool falsely indicates that the test-taker possesses or exhibits a particular trait, ability, behaviour, or attribute

False negative: a specific type of miss whereby an assessment tool falsely indicates that the test-taker does not possess or exhibit a particular trait, ability, behaviour, or attribute

Sensitivity and specificity

Sensitivity or predictive accuracy: the percentage rightfully submitted cases that actually has the trait (hits).

Specificity: the percentage of cases that is rightfully not submitted and that also doesn’t have the trait.

Predictive values

Positive predictive value (PPV): the percentage that is rightfully detected with the trait by the test of the total persons that the test said has the trait.

Negative predictive value (NPV): the percentage which the test rightfully said didn’t have the trait of the total of people the test said didn’t have the trait.

Prediction or description

Measurement-model: about what the constructor wants to measure

Structure-model: about what the constructor wants to predict

Prediction: first the test-score is established, then the criterium-score

Postdiction: first is the criterium-score established, then the test-score

Interpretation

The reflective interpretation: the measured attribute is conceptualized as the common cause of the observables

Formative interpretation: the measured attribute is seen as the common effect of the observables.

Utility

Utility of an instrument: the use of an instrument as becomes apparent from a costs-bate analysis.

Utility gain: an estimate of the benefit of using a particular test or selection method.

Norm-group: the group of people that forms the norm

Taylor-Russell tables: increase the base rate of successful performance that is associated with a particular level of criterion-related validity.

Naylor-Shine tables: tells us the likely average increase in criterion performance as a result of using a particular test or intervention: also provides selection ratio needed to achieve a particular increase in criterion performance.

The expectancy table or chart: tells us the likelihood that individuals who score within a given range on the predictor will perform successfully on the criterion.

Items

A domain: a wide area of more or less coherent properties.

Difficulty: the attribute of not being easily accomplished, solved, or comprehended.

Discrimination: the degree to which an item differentiates among people with higher levels or lower levels of the trait, ability, or whatever it is being measures.

Local dependence: items are all dependent on some factor that is different from what the test as a whole is measuring. Items are locally dependent if they are more related to each other than to the other items on the test.

Dichotomous or polytomous

Dichotomous test items: test items or questions that can be answered with only one or two alternative responses.

Polytomous test items: test items or questions with three or more alternative responses, where only one is scored as being consistent with a targated trait or other construct.

Inter and intra individual differences

Inter-individual differences: differences between people

Intra-individual differences: differences within people

Tools

Cross-validation: control of instability of outcomes

Method-variance: systematic variance as a result of the measurement-procedure with which the trait is measured.

Rasch model: an IRT model with very specific assumptions about the underlying distribution.

Information in IRT: the precision of measurement.

Item characteristic curve (ICC), an item response curve, a category response curve, or an item trace line: the expression in graphic form of the probabilistic relationship between a test-taker’s response to a test item and that test-taker’s level on the latent construct being measured.

Assumptions

The unidimensionality assumption: the set of items measures a single continuous latent construct. This construct is referred to by the Greek letter theta (θ).

Theta level: a reference to the degree of the underlying ability or trait that the test-taker is presumed to bring to the test.

The assumption of local independence: a) there is a systematic relationship between all of the test items and b) that relationship has to do with the theta level of the test-taker. When the assumption is met, it means that differences in responses to items are reflective of differences in the underlying trait or ability.

The assumption of monotonicity: the probability of endorsing or selecting an item response indicative of higher levels of theta should increase as the underlying level of theta increases.

Access:

Public

This content is related to:

WSRt, critical thinking - a summary of all articles needed in the third block of second year psychology at the uva

This is a summary of the articles and reading materials that are needed for the third block in the course WSR-t. This course is given to second year psychology students at the Uva. The course is about thinking critically about scientific research and how such research is...Read more

1600 keer gelezen

Check more of this topic?

Samenvattingen voor psychologie en gedrag

Search other summaries?

Associate with your Field of Study

Search Summaries or Notes

Start using Summaries

Add a Summary

Click & Go to more related summaries or chapters

Everything you need for the course WSRt of the second year of Psychology at the Uva

This magazine contains all the summaries you need for the course WSRt at the second year of psychology at the Uva.

Summaries and supporting content:

WSRt, critical thinking, a list of terms used in the articles of block 2

WSRt, critical thinking - a summary of all articles needed in the second block of second year psychology at the uva

WSRt, critical thinking, a list of terms used in the articles of block 3

WSRt using SPSS, manual for tests in the third block of the second year of psychology at the uva

WSRt, critical thinking - a summary of all articles needed in the third block of second year psychology at the uva

WSRt, critical thinking, a list of terms used in the articles of block 4

WSRt, critical thinking - a summary of all articles needed in the fourth block of second year psychology at the uva

Sharon Klinkenberg legt SPSS uit op YouTube

Summary of Discovering statistics using IBM SPSS statistics by Field - 5th edition

Critical thinking: A concise guide by Bowell & Kemp (4th edition) - a summary

What is a confidence interval in null hypothesis significance testing?

What is the difference between a p-value and Bayes likelihood?

What are important elements of Bayesian statistics?

What is the Bayes factor?

What are weaknesses of the Bayesian approach?

What is qualitative psychological research?

What criteria should be held by good qualitative research?

Year 2 of psychology at the uva

Access:

Public

This content is also used in .....

WSRt, critical thinking - a summary of all articles needed in the third block of second year psychology at the uva

Validity - a summary of chapter 8 of Testleer en Testconstructie by Stouthard

Critical thinking
Article: Stouthard, M, E, A
Validity

Understanding, test, and validity
Criterium oriented validity
Construct-validity

Validity: if test-results can be interpreted in terms of the construct the test tries to measure.

Understanding, test, and validity

A test is taken to make an inference about an construct that lies outside the measure-instrument itself, and that the instrument is supposed to measure.
Understanding of these results lie in lay in the extent to which are an indication of the construct.

Validity is an overacting concept. It is a term for an number of possible properties of a test.
Often multiple empirical sorts of knowledge are needed to get validity of a test.

Which sources of empirical knowledge are important for a test, depends on the users-goal of a test.

Describing use of a test
When a test is meant to measure an specific behaviour or property.
The focus lies in the validation process to find support of the underlying theoretical concept.
Deciding use of a test
When a test is meant to select, classification, or diagnose.
Here, support is needed for the prediction of the test of an extern criterium.

Two sorts of validity:

criterium-oriented validity
concept-validity

The difference between these two isn’t absolute.

Criterium oriented validity

When a test is meant to predict behaviour outside the test-situation, it is relevant to ask whether the instrument is a good predictor of the behaviour.
How better the test predicts the variations of the criterium, the higher the validity of the test.

The criterium

Like a test, a criterium is an operationalization of an underlying concept.
More criteria are possible.
There are different methods to make distinctions between criteria.

Kinds of criteria

An distinction between:

Specific/ closed criteria
In selection situations.
Global/ open criteria
By classification

An distinction in time

Predictive criterium-oriented validity
The criterium lies in the future
Criterium-performance aren’t measured at the same time as test-performance, but later.
Concurrent criterium-oriented validity
The criterium is measured at the same time as the test.
The criterium lays in the now or past.
Mostly diagnostic use of the test

Distinction of criteria in the future

Final criterium
Typically has a big criterium-relevance
The criterium-behaviour is the most fully reflected.
Meantime criterium
Instant criterium

Relation test and criterium

The relation between a test and a criterium is mostly expressed as a correlation between both.
There is a cohesion but no causality.

A condition to interpret a relation between a test and a criterium as support for the criterium-oriented validity, is that there is at least one

Access:

Public

Psychological measurement-instruments - a summary for WSRt -of an article by Oostervel & Vorst (2010)

Critical thinking
Article: Oostervel & Vorst (2010)
Psychological measurement-instruments

The construction of measurement-instrument is an important subject.

certain instruments age because theories about human behaviour or because social changes tear down existing instruments
new instruments can be necessary because existing instruments aren’t sufficient enough.
new instruments can be necessary because existing instruments aren’t suitable for an certain target group.

Measurement preferences
Validity and measurement-quality
Threats to validity of measures

Measurement preferences

Measurement preferences of an instrument: the goal of an measurement-instrument.
This is about a more or less hypothetical property.

The domain of human acting

The instrument is usually focussed on measuring an property in a global domain of human acting.
A domain: a wide area of more or less coherent properties.

Observation methods

Every measurement-instrument uses one or more observation methods. For different properties of different domains, usually different observation methods are used.

performance-tests
questionnaires
observation tests

When properties are measured with different observation methods, it is logical that with different methods, different domains of the traits or categories are measured.

Instruments based on one observation method seem to form a common method-factor, which usually is stronger than the common trait-factor of equal traits measured with different observation methods.

Theory

The development of an instrument is usually based on an elaborated theory or insights based on empirical research or ideas based on informal knowledge.
Instruments developed on the base of formal knowledge and an elaborated theory are of better quality than instruments based on informal knowledge and an poorly formulated theory.

Construct

An instrument forms the elaboration of an construct that refers to an combination of properties.
Measurement instruments for specific (latent) traits are of better quality than instruments for global traits or composite traits.

Structure

The structure of an test depends on the properties it measures.

Unstructured observation-methods are the measurement-conditions that aren’t standardized and because of that it’s measurement-results are difficult to compare to other persons and situations. Objective scores are difficult to obtain.

Application possibilities

The application possibilities of an measurement-instrument the researcher wants to achieve can be related to theoretical or describing research.
It is about analysis of an great number of observations.

For individual applications high requirements are placed on realised measurement-preferences.

Costs

An often decisive element in the description of the measurement-preferences of an measurement-instrument are the costs of that instrument.

Dimensionality

An instrument consists of one or more measurement-scales or sub-tests.
More scales refer to more dimensions of the construct and a subdivision in more latent traits or latent categories.

An instrument that is based on a specific latent trait must be one-dimensional.

Reliability

Three kinds of reliability:

Internal consistence-reliability
Mutual cohesion of items that form a scale or sub-tests.
Repeated reliability

Access:

Public

Intelligence versus cognition: time for a (good) relation - a summary of an article by Kan and van der Maas (2010)

Critical thinking
Article Kan, K., and van der Maas, H. (2010)
Intelligentie versus cognitie: tijd voor een (goede) relatie

Cognition versus intelligence, universal traits versus differential traits
Diverse use of the term intelligence

Cognition versus intelligence, universal traits versus differential traits

Inter-individual differences: differences between people
Intra-individual differences: differences within people

Diverse use of the term intelligence

There are many difference views regarding intelligence. This makes it difficult to pin down what people in psychology call intelligence.

Alternative theories

In some cases, mutual interactions between populations lead to a situation in which parties take profit of each other.
The growth of one population leads the other population to grow. This is mutual.
This dynamical interaction is mutualism.

As a result of individual differences in limited capacities and as a result of mutualistic interactions between cognitive processes, cognitive processes become correlated in the course of development.
The functionally independent cognitive functions within each individual become positively correlated.
The functionally independent cognitive functions within each individual become statistical dependent over groups of people.

Implications

It is possible that the positive cohesion between cognitive abilities is caused by mutualistic interactions that are a result of cognitive development and measurement-problems.
It can’t be ruled out that some influences have effect on the development of all (or multiple) cognitive abilities.
Intelligence can best be compared with an index of general health; it isn’t a real property like cognitive processes are.

Access:

Public

Deconstructing the construct: A network perspective on psychological phenomena - a summary of an article by Schmittmann, Cramer, Waldorp, Epskamp, Kievit, & Dorsboom (2011)

Critical thinking
Article: Schmittmann, V, D., Cramer, A, O, J., W., Waldorp, L, J., Epskamp, S., Kievit, R, A., & Dorsboom, D (2011)
Deconstructing the construct: A network perspective on psychological phenomena

In psychological measurement, three interpretations of measurement systems have been developed:

the reflective interpretation
The measured attribute is conceptualized as the common cause of the observables
Formative interpretation
The measured attribute is seen as the common effect of the observables.
Attributes are conceptualized as systems of causally coupled (observable) variables.

Reflective and formative models
Problems with the reflective and formative conceptualizations
The network perspective: constructs dynamical systems
Constructs and their interrelations

Reflective and formative models

In reflective models, observed indicators (item or subject scores) are modelled as a function of a common latent (unobserved) and item-specific error variance.
Commonly presented as ‘measurement models’.
In these models, a latent variable is introduced to account for the covariance between indicators.

In reflective models, indicators are regarded as exchangeable save for measurement parameters.
The observed correlations between the indicators are spurious in the reflective model.
Observed indicators should correlate, but they only do so because they share a cause.

In formative models, possibly latent composite variables are modelled as a function of indicators.
Without residual variance on the composite, models like principal components analysis and clustering techniques serve to construct an optimal composite out of observed indicators.
But, one can turn the composite into a latent composite if one introduces residual variance on it.
This happens, for instance, if model parameters are chosen in a way that optimizes a criterion variable.

In formative models, conditioning on the composite variable induces covariance among observables even if they were unconditionally independent.
The composite variable functions analogously to a common effect.

Formative models differ from reflective models in many aspects

indicators are not exchangeable because indicators are hypothesized to capture different aspects of the construct.
contrary to reflective models, there is not a priori assumption about whether indicators of a formative construct should correlate positively, negatively or not at all.

Problems with the reflective and formative conceptualizations

The role of time

In most conceptions of causality, causes are required to precede their effects in time.
But, in psychometric models like the reflective and formative models, time is generally not explicitly represented.
The dynamics of the system are not explicated.

it is therefore unclear whether the latent variables relate to the observables in whatever dynamical process generated the observations. It is unclear whether the latent variables in question would figure in a dynamic account at all.

This puts the causal interpretation of latent

Access:

Public

Item Response Theory - summary of an part the science of psychological measurement by Cohen

Critical thinking
Article: Cohen
Item response theory (IRT)

Item response theory (IRT)
Assumptions in using IRT
IRT in practice

Item response theory (IRT)

The procedures of item response theory provide a way to model the probability that a person with X ability will be able to perform at a level of Y.

Because so often the psychological or educational construct being measured is physically unobservable (latent), and because the construct being measured may be a trait, a synonym for IRT is latent-trait theory.

IRT is not a term used to refer to a single theory or method.
It refer to a family of theories and methods, and quite a large family at that, with many other names used to distinguish specific approaches.

Difficulty: the attribute of not being easily accomplished, solved, or comprehended.
Discrimination: the degree to which an item differentiates among people with higher levels or lower levels of the trait, ability, or whatever it is being measures.

A number of different IRT models exists to handle data resulting from the administration of tests with various characteristics and in various formats.

Dichotomous test items: test items or questions that can be answered with only one or two alternative responses.
Polytomous test items: test items or questions with three or more alternative responses, where only one is scored correct or scored as being consistent with a targeted trait or other construct.

Other IRT models exits to handle other types of data.

In general, latent-trait models differ in some important ways from CTT.

In CTT, no assumptions are made about the frequency distribution of test scores.

Such assumptions are inherent in latent-trait models.
Rasch model: an IRT model with very specific assumptions about the underlying distribution.

Assumptions in using IRT

Three assumptions regarding data to be analysed within an IRT framework.

Unidimensionality
Local independence
Monotonicity

Unidimensionality

The unidimensionality assumption: the set of items measures a single continuous latent construct.
This construct is referred to by the Greek letter theta (θ).
It is a person’s theta level that gives rise to a response to the items in the scale.
Theta level: a reference to the degree of the underlying ability or trait that the test-taker is presumed to bring to the test.

The assumption of unidimensionality does not preclude that the set of items may have a number of minor dimensions (which, in turn, may be measured by subscales).
It does assume that one dominant dimension explains the underlying structure.

Local independence

Local dependence: items are all dependent on some factor that is different from what the test as a whole is measuring. Items are locally dependent if they are more

Access:

Public

Testconstruction and testresearch - a summary of an article by Oosterveld & Vorst (2010)

Critical thinking
Article: Oosterveld & Vorst 2010
Testconstructie en testonderzoek

Validity-theory
Validity and measurement-quality of measurement-instruments

Validity-theory

There are problematic theories about validity

Examples van viewpoints

Dorsboom (2003)

According to Dorsboom, it is plausible that the mercury thermometer is a valid measurement of temperature of objects, because differences in the real temperature cause differences in the measurement-instrument.
If the causal string is described exactly, and this is a plausible representation of reality, than is the instrument valid in reality.
Real validity is unknown as long as not all the relevant knowledge is available.
Because it is in principle unknown in what extent relevant knowledge is available, validity is hypothetical unsure.
Even if the causal string between true variation in the trait and the measured variation is known well, knowledge about the causal strings can change due to new knowledge. This is why real validity is hypothetical.
However, people can have a judgment about the validity of measurement-instruments. This validity-judgment doesn’t have anything to do with the real validity.
In psychology, true causal strings are (yet) impossible
That is why psychology temporarily deals with hypothetical validity-judgments. This is in suspense of more precise and true causal strings between true trait-variation and measurement-variation.
The quality of measurement, not the validity, must be proven from psycho-metrical analysis (reliability, one-dimensionality, representative content of the measurement-instrument, connections with external criteria, support of theoretical expected connections)

Science-philosophical viewpoint

If a test is valid depends on the state of affairs in reality (ontology)

Description of validity

Validity: assumed property of trait varies in values in the population; differences in trait-values cause differences in measurement.
No validity if: differences in measurement-results can’t (be) explained by differences in trait (if traits don’t exist or no variance in values or no causal relation)

Derived statements

Validity is present or not
due to the knowledge of reality, the real validity of an instrument is hypothetical and for the time being.
the validity-judgment is a subjective estimate of the true validity of an instrument
validity can be assumed if causal relations in reality are applied in the construction of the measurement-instrument
Validity doesn’t have anything to do with relations between properties of criteria
validity is only about the measurement-instrument
distinction between forms of validity and forms of validity-research is pointless

Research to measurement-quality/validity

research to the causal relations between variance in properties and variance in measurement in central
existing validity-research is research to the quality of measurement
impression-validity is an superficial, subjective judgment of the measurement-quality
content-validity is a judgment about the measurement-quality of the content
criterium-validity: a

Access:

Public

Utility analysis - a summary of a part of The science of psychological measurement by Cohen

Critical thinking
Article: Cohen
Utility Analysis

What is a utility analysis?
How is a utility analysis conducted?
Utility analysis: an illustration

What is a utility analysis?

Utility analysis: a family of techniques that entail a family of techniques that entail a cost-benefit analysis designed to yield information relevant to a decision about the usefulness and/or practical value of a tool of assessment.
It is not one specific technique used for one specific objective. It is an umbrella term covering various possible methods, each requiring various kinds of data to be inputted and yielding various kinds of output.

In a most general sense, a utility analysis may be undertaken for the purpose of evaluating whether the benefits of using a test outweigh the costs.

If undertaken to evaluate a test, the utility analysis will help make decisions whether:

one test is preferable to another test for use for a specific purpose.
one tool of assessment is preferable to another tool of assessment for a specific purpose
the addition of one or more tests that are already in use is preferable for a specific purpose.
no testing or assessment is preferable to any testing or assessment

If undertaken for the purpose of evaluating a training program or intervention, the utility analysis will help make decisions regarding whether:

one training program is preferable to another training program
one method of intervention is preferable to another method of intervention
the addition or subtraction of elements t an existing training program improves the overall training program by making it more effective and efficient
the addition or subtraction of elements to an existing method of intervention improves the overall intervention by making it more effective and efficient
no training program is preferable to a given training program
no intervention is preferable to a given intervention

The endpoint of a utility analysis is typically an educated decision about which of many possible courses of action is optimal.

How is a utility analysis conducted?

The specific objective of a utility analysis will dictate what sort of information will be required as well as the specific methods to be used.

Expectancy data

Some utility analyses will require little more than converting a scatterplot of test data to an expectancy table.
An expectancy table can provide an indication of the likelihood that a test-taker will score within some interval of scores on a criterion measure.

Taylor-Russell tables: increase the base rate of successful performance that is associated with a particular level of criterion-related validity.
The value assigned for the test’s validity: the computed validity coefficient.
But, the relationship must be linear.
Naylor-Shine tables: tells us the likely average increase in criterion

Access:

Public

Predicting a criterium-score - a summary of an article by Oosterveld & Vorst (2010)

Critical thinking
Article: Oosterveld & Vorst, 2010
Voorspellen van een criteriumwaarde

Prediction-table: cross-table of criterium-values and test-scores
Indices for the quality of prediction
Use of tests in prediction

Prediction-table: cross-table of criterium-values and test-scores

The test-scores and criterium-values can lay on a (almost) continuous scale, or have a dichotomous character.
Usually, criterium-values are established by judgements of experts.
Commonly, a criterium-value is valuable, it is true for the time being.

The test-score and criterium-value can be established simultaneously or with a short or long period in between.

Prediction: first the test-score is established, then the criterium-score
Postdiction: first is the criterium-score established, then the test-score

This has effect on the interpretation of the table
With a long time in between, prediction becomes less stable.

Usually, criterium-values are placed in the vertical axis and test-scores on the horizontal axis.

if the test-score on a criterium-value is higher, the person has more of the trait

Not everyone uses this system

Indices for the quality of prediction

Base rate or prevalence: the percentage occurrence of the trait in the population.
With a low prevalence, finding the trait is difficult.
The use of a test must lead to a higher percentage well detected cases (hits) than the prevalence. Otherwise, using the test is useless.

Prediction-error or classification-error: the percentage wrongfully submitted cases by the test.
It is a global indicator of the performance of the test.
Sensitivity or predictive accuracy: the percentage rightfully submitted cases that actually has the trait (hits).
Specificity: the percentage of cases that is rightfully not submitted and that also doesn’t have the trait.

Sensitivity and specificity are direct clues to the predictive value of the test.

Positive predictive value (PPV): the percentage that is rightfully detected with the trait by the test of the total persons that the test said has the trait.
Negative predictive value (NPV): the percentage which the test rightfully said didn’t have the trait of the total of people the test said didn’t have the trait.

PPV and NPV are direct clues to the predictive value of the test.

Reliability of the prediction

The reliability of the prediction: the repeatability of the prediction on a certain point of time.

The reliability of the prediction can be established with cross-validation.

the population is split a-select, and two prediction-tables are formed with both sub-populations.
Differences in indices between tables give an indication of the prediction-reliability.

Stability of the prediction

Stability of the prediction: the repeatability of the prediction in the course of time.
Especially important when the predictions are about a period of time.

The stability of

Access:

Public

Clinical versus actuarial judgement - a summary of an article by Dawes, R, M., Faust, D., & Meehl, P, E. (1989)

Critical thinking
Article: Dawes, R, M., Faust, D., & Meehl, P, E. (1989)
Clinical versus actuarial judgment

Methods of judgment and means of comparison
Results of comparative studies
Factors underlying the superiority of actuarial methods
Lack of impact and sources of resistance
Application of actuarial methods: limits, benefits, and implications

Methods of judgment and means of comparison

In the clinical method, the decision maker combines or processes information in his or her head.
In the actuarial or statistical method, conclusions rest solely on empirically established relations between data and the condition or event of interest.

The actuarial method should not be equated with automated decision rules alone.
To be truly actuarial, interpretations must be both automatic (pre-specified or routinised) and based on empirically established relations.
Virtually any type of data is amenable to actuarial interpretation.

The combination of clinical and actuarial methods offers a third potential judgment strategy, one for which certain viable approaches have been proposed.
But, most proposals for clinical-actuarial combination presume that the two judgment methods work together harmoniously and overlook the many situations that require dichotomous choices.
Conditions for a fair comparison of the two methods:

both methods should base judgments on the same data
one must avoid conditions that can artificially inflate the accuracy of actuarial methods.

Results of comparative studies

Actuarial methods seem to have advantages over the clinical method.
Although most comparative research in medicine favours the actuarial method overall, the studies that suggest a slight clinical advantage seem to involve circumstances in which judgments rest on firm theoretical grounds.

Consideration of utilities. Depending on the task, certain judgment errors may be more serious than others.
The adjustment of decision rules or cutting scores to reduce either false-negative or false-positive errors can decrease the procedure’s overall accuracy by may still be justified if the consequences of these opposing forms of error are unequal.

The clinician’s potential capacity to capitalize on configural patterns or relations among predictive cues raises two related but separable issues:

the capacity to recognize configural relations
Certain forms of human pattern recognition still cannot be duplicated or equalled by artificial means.
the capacity to use these observations to diagnose and predict.
The possession of unique observational capacities clearly implies that human input or interaction is often needed to achieve maximal predictive accuracy but tempts us to draw an additional, dubious inference (because actuarial methods are often more accurate).

A unique capacity to observe is not the same as a unique capacity to predict on the bases of integration of observations.
Greater accuracy may be achieved if the skilled observer performs this function and then steps

Access:

Public

WSRt, critical thinking, a list of terms used in the articles of block 3

Validity
Reliability
Hits and misses
Prediction or description
Interpretation
Utility
Items
Tools
Assumptions

Validity

Validity: if test-results can be interpreted in terms of the construct the test tries to measure.

The nomological network: the system of hypothetical relations around the construct.
This can be a part of the theory.

Forms of validity:

Impression-validity: an subjective judgment of the usability of an measurement-instrument on the base of the direct observable properties of the test-material.

Content-validity: the judgment about the representativeness of the observations, appointments, and questions for a certain purpose.

Criterium-validity: the (cor)relation between test-score and a psychological or social criterium.

Predictive criterium-oriented validity: the criterium lies in the future. Criterium-performance aren’t measured at the same time as test-performance, but later.
Concurrent criterium-oriented validity: the criterium is measured at the same time as the test. The criterium lays in the now or past.

Process-validity: the manner on which the response is established.

The multitrait-multimethod approach of validation: a process in which with separated independent measurement-procedures at different traits is sought to construct-validity of a test.
Convergence: a tests is cohesive with other measurements of the same construct or related constructs.
Divergence: the test isn’t cohesive with other non-related constructs.

Reliability

Internal consistence-reliability: mutual cohesion of items that form a scale or sub-tests.

Repeated reliability: repeated measures with the same instrument

Local reliability: an impression of the reliability of the measurement within a certain wide of scores.

The reliability of the prediction: the repeatability of the prediction on a certain point of time.

Stability of the prediction: the repeatability of the prediction in the course of time.

Hits and misses

Hits

Hit: a correct classification

Hit rate: the proportion of people that an assessment tool accurately identifies as possessing or exhibiting a particular trait, ability, behaviour, or attribute

Misses

Miss: an incorrect classification

Miss rate: the proportion of people that

Access:

Public

Everything you need for the course WSRt of the second year of Psychology at the Uva

This magazine contains all the summaries you need for the course WSRt at the second year of psychology at the Uva.

Summaries and supporting content:

WSRt, critical thinking, a list of terms used in the articles of block 2

WSRt, critical thinking - a summary of all articles needed in the second block of second year psychology at the uva

WSRt, critical thinking, a list of terms used in the articles of block 3

WSRt using SPSS, manual for tests in the third block of the second year of psychology at the uva

WSRt, critical thinking - a summary of all articles needed in the third block of second year psychology at the uva

WSRt, critical thinking, a list of terms used in the articles of block 4

WSRt, critical thinking - a summary of all articles needed in the fourth block of second year psychology at the uva

Sharon Klinkenberg legt SPSS uit op YouTube

Summary of Discovering statistics using IBM SPSS statistics by Field - 5th edition

Critical thinking: A concise guide by Bowell & Kemp (4th edition) - a summary

What is a confidence interval in null hypothesis significance testing?

What is the difference between a p-value and Bayes likelihood?

What are important elements of Bayesian statistics?

What is the Bayes factor?

What are weaknesses of the Bayesian approach?

What is qualitative psychological research?

What criteria should be held by good qualitative research?

Year 2 of psychology at the uva

Access:

Public

Follow the author: SanneA

SanneA

More contributions of WorldSupporter author: SanneA:

Work for WorldSupporter

JoHo can really use your help! Check out the various student jobs here that match your studies, improve your competencies, strengthen your CV and contribute to a more tolerant world

Working for JoHo as a student in Leyden

Parttime werken voor JoHo

Comments, Compliments & Kudos:

Add new contribution

Promotions

The JoHo Insurances Foundation is specialized in insurances for travel, work, study, volunteer, internships an long stay abroad

Check the options on joho.org (international insurances) or go direct to JoHo's https://www.expatinsurances.org

Check how to use summaries on WorldSupporter.org

Online access to all summaries, study notes en practice exams
How and why would you use WorldSupporter.org for your summaries and study assistance?
Using and finding summaries, study notes en practice exams on JoHo WorldSupporter
Quicklinks to fields of study for summaries and study assistance

Online access to all summaries, study notes en practice exams

Check out: Register with JoHo WorldSupporter: starting page (EN)
Check out: Aanmelden bij JoHo WorldSupporter - startpagina (NL)

How and why would you use WorldSupporter.org for your summaries and study assistance?

For free use of many of the summaries and study aids provided or collected by your fellow students.
For free use of many of the lecture and study group notes, exam questions and practice questions.
For use of all exclusive summaries and study assistance for those who are member with JoHo WorldSupporter with online access
For compiling your own materials and contributions with relevant study help
For sharing and finding relevant and interesting summaries, documents, notes, blogs, tips, videos, discussions, activities, recipes, side jobs and more.

Using and finding summaries, study notes en practice exams on JoHo WorldSupporter

There are several ways to navigate the large amount of summaries, study notes en practice exams on JoHo WorldSupporter.

Use the menu above every page to go to one of the main starting pages
- Starting pages: for some fields of study and some university curricula editors have created (start) magazines where customised selections of summaries are put together to smoothen navigation. When you have found a magazine of your likings, add that page to your favorites so you can easily go to that starting point directly from your profile during future visits. Below you will find some start magazines per field of study
Use the topics and taxonomy terms
- The topics and taxonomy of the study and working fields gives you insight in the amount of summaries that are tagged by authors on specific subjects. This type of navigation can help find summaries that you could have missed when just using the search tools. Tags are organised per field of study and per study institution. Note: not all content is tagged thoroughly, so when this approach doesn't give the results you were looking for, please check the search tool as back up
Check or follow your (study) organizations:
- by checking or using your study organizations you are likely to discover all relevant study materials.
- this option is only available trough partner organizations
Check or follow authors or other WorldSupporters
- by following individual users, authors you are likely to discover more relevant study materials.
Use the Search tools
- 'Quick & Easy'- not very elegant but the fastest way to find a specific summary of a book or study assistance with a specific course or subject.
- The search tool is also available at the bottom of most pages

Do you want to share your summaries with JoHo WorldSupporter and its visitors?

Check out: Why and how to add a WorldSupporter contributions
JoHo members: JoHo WorldSupporter members can share content directly and have access to all content: Join JoHo and become a JoHo member
Non-members: When you are not a member you do not have full access, but if you want to share your own content with others you can fill out the contact form

Quicklinks to fields of study for summaries and study assistance

Field of study

Check the related and most recent topics and summaries:

Activity abroad, study field of working area:

Public
WorldSupporters only
JoHo members
Private

Statistics

1798