ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	10
Since 2017 (last 10 years)	18

Descriptor

Psychological Testing	18
Test Items	18
Item Response Theory	9
Educational Testing	6
Goodness of Fit	6
Test Construction	5
Scores	4
Test Validity	4
Accuracy	3
Educational Assessment	3
Error of Measurement	3
Factor Analysis	3
Markov Processes	3
Psychometrics	3
Reaction Time	3
Scoring	3
Simulation	3
Test Bias	3
Classification	2
Feedback (Response)	2
Item Analysis	2
Mathematics Tests	2
Models	2
Monte Carlo Methods	2
Performance	2
More ▼

Source

ProQuest LLC	5
Educational and Psychological…	4
Journal of Educational and…	2
Education Sciences	1
Frontline Learning Research	1
Grantee Submission	1
International Educational…	1
International Journal of…	1
Large-scale Assessments in…	1
Nebraska Department of…	1

Publication Type

Journal Articles	10
Reports - Research	9
Dissertations/Theses -…	5
Collected Works - Proceedings	1
Information Analyses	1
Numerical/Quantitative Data	1
Reports - Descriptive	1
Reports - Evaluative	1

Education Level

Secondary Education	3
Higher Education	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1

Audience

Location

Germany	1
Nebraska	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…

What Works Clearinghouse Rating

Showing 1 to 15 of 18 results Save | Export

Added Value of Subscores for Tests with Polytomous Items

Peer reviewed

Direct link

Kylie Gorney; Sandip Sinharay – Educational and Psychological Measurement, 2025

Test-takers, policymakers, teachers, and institutions are increasingly demanding that testing programs provide more detailed feedback regarding test performance. As a result, there has been a growing interest in the reporting of subscores that potentially provide such detailed feedback. Haberman developed a method based on classical test theory…

Descriptors: Scores, Test Theory, Test Items, Testing

Why Do Regular and Reversed Items Load on Separate Factors? Response Difficulty vs. Item Extremity

Peer reviewed

Direct link

Kam, Chester Chun Seng – Educational and Psychological Measurement, 2023

When constructing measurement scales, regular and reversed items are often used (e.g., "I am satisfied with my job"/"I am not satisfied with my job"). Some methodologists recommend excluding reversed items because they are more difficult to understand and therefore engender a second, artificial factor distinct from the…

Descriptors: Test Items, Difficulty Level, Test Construction, Construct Validity

Detecting Feigning on the Montreal Cognitive Assessment (MoCa): Development of an Embedded Performance Validity Test

Direct link

Paige Haley – ProQuest LLC, 2023

As the research on feigning has grown, the number and quality of performance validity tests (PVTs) has increased as well. However, while several PVTs have been developed from assessments commonly used as part of neuropsychological batteries, there has been less exploration for PVTs scored from items in cognitive screeners. The Montreal Cognitive…

Descriptors: Cognitive Measurement, Performance, Test Validity, Psychological Testing

On the Generalized S-X[superscript 2]-Test of Item Fit: Some Variants, Residuals, and a Graphical Visualization

Peer reviewed

Direct link

Ranger, Jochen; Brauer, Kay – Journal of Educational and Behavioral Statistics, 2022

The generalized S-X[superscript 2]-test is a test of item fit for items with polytomous responses format. The test is based on a comparison of the observed and expected number of responses in strata defined by the test score. In this article, we make four contributions. We demonstrate that the performance of the generalized S-X[superscript 2]-test…

Descriptors: Goodness of Fit, Test Items, Statistical Analysis, Item Response Theory

A New Stopping Criterion for Rasch Trees Based on the Mantel-Haenszel Effect Size Measure for Differential Item Functioning

Peer reviewed

Direct link

Henninger, Mirka; Debelak, Rudolf; Strobl, Carolin – Educational and Psychological Measurement, 2023

To detect differential item functioning (DIF), Rasch trees search for optimal split-points in covariates and identify subgroups of respondents in a data-driven way. To determine whether and in which covariate a split should be performed, Rasch trees use statistical significance tests. Consequently, Rasch trees are more likely to label small DIF…

Descriptors: Item Response Theory, Test Items, Effect Size, Statistical Significance

Using Item Scores and Distractors to Detect Aberrant Behavior

Direct link

Gorney, Kylie – ProQuest LLC, 2023

Aberrant behavior refers to any type of unusual behavior that would not be expected under normal circumstances. In educational and psychological testing, such behaviors have the potential to severely bias the aberrant examinee's test score while also jeopardizing the test scores of countless others. It is therefore crucial that aberrant examinees…

Descriptors: Behavior Problems, Educational Testing, Psychological Testing, Test Bias

Psychometric Modeling Approaches for Understanding Item Response Process in Cognitive and Noncognitive Assessments

Direct link

Nana Kim – ProQuest LLC, 2022

In educational and psychological assessments, attending to item response process can be useful in understanding and improving the validity of measurement. This dissertation consists of three studies each of which proposes and applies item response theory (IRT) methods for modeling and understanding cognitive/psychological response process in…

Descriptors: Psychometrics, Item Response Theory, Test Items, Cognitive Tests

Practical Significance of Item Misfit and Its Manifestations in Constructs Assessed in Large-Scale Studies

Peer reviewed

Direct link

Fährmann, Katharina; Köhler, Carmen; Hartig, Johannes; Heine, Jörg-Henrik – Large-scale Assessments in Education, 2022

When scaling psychological tests with methods of item response theory it is necessary to investigate to what extent the responses correspond to the model predictions. In addition to the statistical evaluation of item misfit, the question arises as to its practical significance. Although item removal is undesirable for several reasons, its…

Descriptors: Psychological Testing, Scaling, Test Items, Item Response Theory

Response Styles in Multiscale Measures

Direct link

Zebing Wu – ProQuest LLC, 2024

Response style, one common aberrancy in non-cognitive assessments in psychological fields, is problematic in terms of inaccurate estimation of item and person parameters, which leads to serious reliability, validity, and fairness issues (Baumgartner & Steenkamp, 2001; Bolt & Johnson, 2009; Bolt & Newton, 2011). Response style refers to…

Descriptors: Response Style (Tests), Accuracy, Preferences, Psychological Testing

Assessing Fit of the Lognormal Model for Response Times

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip; van Rijn, Peter W. – Journal of Educational and Behavioral Statistics, 2020

Response time models (RTMs) are of increasing interest in educational and psychological testing. This article focuses on the lognormal model for response times, which is one of the most popular RTMs. Several existing statistics for testing normality and the fit of factor analysis models are repurposed for testing the fit of the lognormal model. A…

Descriptors: Educational Testing, Psychological Testing, Goodness of Fit, Factor Analysis

Assessing Fit of the Lognormal Model for Response Times

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip; van Rijn, Peter – Grantee Submission, 2020

Response-time models are of increasing interest in educational and psychological testing. This paper focuses on the lognormal model for response times (van der Linden, 2006), which is one of the most popular response-time models. Several existing statistics for testing normality and the fit of factor-analysis models are repurposed for testing the…

Descriptors: Educational Testing, Psychological Testing, Goodness of Fit, Factor Analysis

Ensuring Content Validity of Psychological and Educational Tests -- The Role of Experts

Peer reviewed
PDF on ERIC

Download full text

Beck, Klaus – Frontline Learning Research, 2020

Many test developers try to ensure the content validity of their tests by having external experts review the items, e.g. in terms of relevance, difficulty, or clarity. Although this approach is widely accepted, a closer look reveals several pitfalls need to be avoided if experts' advice is to be truly helpful. The purpose of this paper is to…

Descriptors: Content Validity, Psychological Testing, Educational Testing, Student Evaluation

The Effect of Person Misfit on Item Parameter Estimation and Classification Accuracy: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Mousavi, Amin; Cui, Ying – Education Sciences, 2020

Often, important decisions regarding accountability and placement of students in performance categories are made on the basis of test scores generated from tests, therefore, it is important to evaluate the validity of the inferences derived from test results. One of the threats to the validity of such inferences is aberrant responding. Several…

Descriptors: Student Evaluation, Educational Testing, Psychological Testing, Item Response Theory

Within-Item Interactions in Bifactor Models for Ordered-Categorical Item Responses

Direct link

Fager, Meghan L. – ProQuest LLC, 2019

Recent research in multidimensional item response theory has introduced within-item interaction effects between latent dimensions in the prediction of item responses. The objective of this study was to extend this research to bifactor models to include an interaction effect between the general and specific latent variables measured by an item.…

Descriptors: Test Items, Item Response Theory, Factor Analysis, Simulation

Modeling of Item Response Functions under the D-Scoring Method

Peer reviewed

Direct link

Dimitrov, Dimiter M. – Educational and Psychological Measurement, 2020

This study presents new models for item response functions (IRFs) in the framework of the D-scoring method (DSM) that is gaining attention in the field of educational and psychological measurement and largescale assessments. In a previous work on DSM, the IRFs of binary items were estimated using a logistic regression model (LRM). However, the LRM…

Descriptors: Item Response Theory, Scoring, True Scores, Scaling

Previous Page | Next Page »

Pages: 1 | 2

Sinharay, Sandip	2
Beck, Klaus	1
Bichi, Ado Abdu	1
Brauer, Kay	1
Cui, Ying	1
Debelak, Rudolf	1
Dimitrov, Dimiter M.	1
Fager, Meghan L.	1
Feng, Mingyu, Ed.	1
Fährmann, Katharina	1
Gorney, Kylie	1
Hartig, Johannes	1
Heine, Jörg-Henrik	1
Henninger, Mirka	1
Kam, Chester Chun Seng	1
Kylie Gorney	1
Käser, Tanja, Ed.	1
Köhler, Carmen	1
Mousavi, Amin	1
Nana Kim	1
Paige Haley	1
Ranger, Jochen	1
Sandip Sinharay	1
Strobl, Carolin	1
Talib, Rohaya	1
More ▼