ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	10
Since 2007 (last 20 years)	15

Descriptor

Educational Testing	32
Psychological Testing	32
Goodness of Fit	7
Item Response Theory	7
Models	6
Higher Education	5
Reaction Time	5
Simulation	5
Testing Problems	5
Computer Assisted Testing	4
Elementary Secondary Education	4
Personality Measures	4
Scores	4
Test Construction	4
Test Items	4
Test Use	4
Test Validity	4
Comparative Analysis	3
Decision Making	3
Educational Diagnosis	3
Factor Analysis	3
Psychometrics	3
Research Reports	3
Student Evaluation	3
Test Bias	3
More ▼

Source

Journal of Educational…	4
Educational and Psychological…	2
Grantee Submission	2
Psychometrika	2
ACT, Inc.	1
AERA Online Paper Repository	1
Applied Measurement in…	1
Behavior in Our Schools	1
Education Sciences	1
European Journal of…	1
Journal of Educational and…	1
Journal of Experimental…	1
Language Testing in Asia	1
Reading Improvement	1
Reading Psychology	1
Soviet Education	1
More ▼

Publication Type

Reports - Research	32
Journal Articles	18
Collected Works - Proceedings	4
Opinion Papers	4
Speeches/Meeting Papers	3
Information Analyses	1
Reports - Evaluative	1

Education Level

Early Childhood Education	1
Higher Education	1
Postsecondary Education	1
Preschool Education	1

Audience

Location

Australia	1
Canada	1
USSR	1

Laws, Policies, & Programs

Assessments and Surveys

Wechsler Intelligence Scale…

What Works Clearinghouse Rating

Showing 1 to 15 of 32 results Save | Export

Added Value of Subscores for Tests with Polytomous Items

Peer reviewed

Direct link

Kylie Gorney; Sandip Sinharay – Educational and Psychological Measurement, 2025

Test-takers, policymakers, teachers, and institutions are increasingly demanding that testing programs provide more detailed feedback regarding test performance. As a result, there has been a growing interest in the reporting of subscores that potentially provide such detailed feedback. Haberman developed a method based on classical test theory…

Descriptors: Scores, Test Theory, Test Items, Testing

Reviewing the Test Reviews: Quality Judgments and Reviewer Agreements in the Mental Measurements Yearbook

Peer reviewed

Direct link

Hogan, Thomas; DeStefano, Marissa; Gilby, Caitlin; Kosman, Dana; Peri, Joshua – Applied Measurement in Education, 2021

Buros' "Mental Measurements Yearbook (MMY)" has provided professional reviews of commercially published psychological and educational tests for over 80 years. It serves as a kind of conscience for the testing industry. For a random sample of 50 entries in the "19th MMY" (a total of 100 separate reviews) this study determined…

Descriptors: Test Reviews, Interrater Reliability, Psychological Testing, Educational Testing

Establishing a Fair Cut Score for an In-House English Test: A Case Study on Integrating Two Standard-Setting Methods

Peer reviewed

Direct link

Suthathip Thirakunkovit – Language Testing in Asia, 2025

Establishing a cut score is a crucial aspect of the test development process since the selected cut score has the potential to impact students' performance outcomes and shape instructional strategies within the classroom. Therefore, it is vital for those involved in test development to set a cut score that is both fair and justifiable. This cut…

Descriptors: Cutting Scores, Culture Fair Tests, Language Tests, Test Construction

Performance of Person-Fit Statistics under Model Misspecification

Peer reviewed

Direct link

Hong, Seong Eun; Monroe, Scott; Falk, Carl F. – Journal of Educational Measurement, 2020

In educational and psychological measurement, a person-fit statistic (PFS) is designed to identify aberrant response patterns. For parametric PFSs, valid inference depends on several assumptions, one of which is that the item response theory (IRT) model is correctly specified. Previous studies have used empirical data sets to explore the effects…

Descriptors: Educational Testing, Psychological Testing, Goodness of Fit, Error of Measurement

Assessing Fit of the Lognormal Model for Response Times

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip; van Rijn, Peter W. – Journal of Educational and Behavioral Statistics, 2020

Response time models (RTMs) are of increasing interest in educational and psychological testing. This article focuses on the lognormal model for response times, which is one of the most popular RTMs. Several existing statistics for testing normality and the fit of factor analysis models are repurposed for testing the fit of the lognormal model. A…

Descriptors: Educational Testing, Psychological Testing, Goodness of Fit, Factor Analysis

Assessing Fit of the Lognormal Model for Response Times

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip; van Rijn, Peter – Grantee Submission, 2020

Response-time models are of increasing interest in educational and psychological testing. This paper focuses on the lognormal model for response times (van der Linden, 2006), which is one of the most popular response-time models. Several existing statistics for testing normality and the fit of factor-analysis models are repurposed for testing the…

Descriptors: Educational Testing, Psychological Testing, Goodness of Fit, Factor Analysis

The Effect of Person Misfit on Item Parameter Estimation and Classification Accuracy: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Mousavi, Amin; Cui, Ying – Education Sciences, 2020

Often, important decisions regarding accountability and placement of students in performance categories are made on the basis of test scores generated from tests, therefore, it is important to evaluate the validity of the inferences derived from test results. One of the threats to the validity of such inferences is aberrant responding. Several…

Descriptors: Student Evaluation, Educational Testing, Psychological Testing, Item Response Theory

A New Person-Fit Statistic for the Lognormal Model for Response Times

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational Measurement, 2018

Response-time models are of increasing interest in educational and psychological testing. This article focuses on the lognormal model for response times, which is one of the most popular response-time models, and suggests a simple person-fit statistic for the model. The distribution of the statistic under the null hypothesis of no misfit is proved…

Descriptors: Reaction Time, Educational Testing, Psychological Testing, Models

A New Person-Fit Statistic for the Lognormal Model for Response Times

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip – Grantee Submission, 2018

Response-time models are of increasing interest in educational and psychological testing. This paper focuses on the lognormal model for response times (van der Linden, 2006), which is one of the most popular response-time models, and suggests a simple person-fit statistic for the model. The distribution of the statistic under the null hypothesis…

Descriptors: Reaction Time, Educational Testing, Psychological Testing, Models

Neglected Validities: Evaluating Preschool Assessment against the Standards

Peer reviewed

Direct link

Buek, Katharine; Barghaus, Katherine; Fantuzzo, John – AERA Online Paper Repository, 2017

Quality assessment is an essential component of preschool education. The Standards for Educational and Psychological Testing provide benchmarks for evaluating the validity of inferences made from assessment or test results (AERA, APA & NCME, 2014). According to the Standards, test developers should investigate and document information related…

Descriptors: Preschool Education, Test Validity, Preschool Children, Standards

On the Issue of Item Selection in Computerized Adaptive Testing with Response Times

Peer reviewed

Direct link

Veldkamp, Bernard P. – Journal of Educational Measurement, 2016

Many standardized tests are now administered via computer rather than paper-and-pencil format. The computer-based delivery mode brings with it certain advantages. One advantage is the ability to adapt the difficulty level of the test to the ability level of the test taker in what has been termed computerized adaptive testing (CAT). A second…

Descriptors: Computer Assisted Testing, Reaction Time, Standardized Tests, Difficulty Level

Item Selection in Multidimensional Computerized Adaptive Testing--Gaining Information from Different Angles

Peer reviewed

Direct link

Wang, Chun; Chang, Hua-Hua – Psychometrika, 2011

Over the past thirty years, obtaining diagnostic information from examinees' item responses has become an increasingly important feature of educational and psychological testing. The objective can be achieved by sequentially selecting multidimensional items to fit the class of latent traits being assessed, and therefore Multidimensional…

Descriptors: Psychological Testing, Adaptive Testing, Scientific Concepts, Item Analysis

Performance of the Generalized S-X[Superscript 2] Item Fit Index for Polytomous IRT Models

Peer reviewed

Direct link

Kang, Taehoon; Chen, Troy T. – Journal of Educational Measurement, 2008

Orlando and Thissen's S-X[superscript 2] item fit index has performed better than traditional item fit statistics such as Yen' s Q[subscript 1] and McKinley and Mill' s G[superscript 2] for dichotomous item response theory (IRT) models. This study extends the utility of S-X[superscript 2] to polytomous IRT models, including the generalized partial…

Descriptors: Item Response Theory, Models, Rating Scales, Generalization

Graded Response Model Based on the Logistic Positive Exponent Family of Models for Dichotomous Responses

Peer reviewed

Direct link

Samejima, Fumiko – Psychometrika, 2008

Samejima ("Psychometrika "65:319--335, 2000) proposed the logistic positive exponent family of models (LPEF) for dichotomous responses in the unidimensional latent space. The objective of the present paper is to propose and discuss a graded response model that is expanded from the LPEF, in the context of item response theory (IRT). This…

Descriptors: Psychological Testing, Item Response Theory, Psychometrics, Educational Testing

An Investigation of the Performance of the Generalized S-X[superscript 2] Item-Fit Index for Polytomous IRT Models. ACT Research Report Series, 2007-1

Download full text

Kang, Taehoon; Chen, Troy T. – ACT, Inc., 2007

Orlando and Thissen (2000, 2003) proposed an item-fit index, S-X[superscript 2], for dichotomous item response theory (IRT) models, which has performed better than traditional item-fit statistics such as Yen's (1981) Q[subscript 1] and McKinley and Mill's (1985) G[superscript 2]. This study extends the utility of S-X[superscript 2] to polytomous…

Descriptors: Item Response Theory, Models, Computer Software, Statistical Analysis

Previous Page | Next Page »

Pages: 1 | 2 | 3

Sinharay, Sandip	4
Chen, Troy T.	2
Kang, Taehoon	2
Barghaus, Katherine	1
Bartram, Dave	1
Ben-Yashar, Ruth	1
Boben, Dusica	1
Bowen, Charles E.	1
Buek, Katharine	1
Chang, Hua-Hua	1
Cooksey, Ray W.	1
Cui, Ying	1
Dalton, Tom	1
DeStefano, Marissa	1
Evers, Arne	1
Falk, Carl F.	1
Fantuzzo, John	1
Fernandez-Hermida, Jose R.	1
Forness, Steven R.	1
Freebody, Peter	1
Gauthier, Yvon	1
Gilby, Caitlin	1
Glabeke, Kathia	1
Harvey, Anne L.	1
More ▼