Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 1 |
| Since 2007 (last 20 years) | 3 |
Descriptor
| Evaluation Methods | 8 |
| Simulation | 8 |
| Test Format | 8 |
| Item Response Theory | 4 |
| Equated Scores | 3 |
| Item Analysis | 3 |
| Achievement Tests | 2 |
| Elementary Secondary Education | 2 |
| Models | 2 |
| Multidimensional Scaling | 2 |
| Sampling | 2 |
| More ▼ | |
Source
| Applied Psychological… | 1 |
| Educational Sciences: Theory… | 1 |
| Educational and Psychological… | 1 |
| International Journal of… | 1 |
Author
| Baker, Herbert George | 1 |
| Bastari, B. | 1 |
| Dorans, Neil J. | 1 |
| Finch, Fredrick | 1 |
| Foertsch, Mary | 1 |
| Hammond, Shelby | 1 |
| Hanson, Bradley A. | 1 |
| Harris, Deborah J. | 1 |
| Kelecioglu, Hülya | 1 |
| Ki Lynn Cole | 1 |
| Liu, Jinghua | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 4 |
| Reports - Evaluative | 4 |
| Reports - Research | 3 |
| Speeches/Meeting Papers | 2 |
Education Level
| Elementary Education | 1 |
| Elementary Secondary Education | 1 |
| Grade 8 | 1 |
| Junior High Schools | 1 |
| Middle Schools | 1 |
| Secondary Education | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
| SAT (College Admission Test) | 1 |
| Trends in International… | 1 |
What Works Clearinghouse Rating
Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025
This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…
Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis
Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016
The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…
Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores
Dorans, Neil J.; Liu, Jinghua; Hammond, Shelby – Applied Psychological Measurement, 2008
This exploratory study was built on research spanning three decades. Petersen, Marco, and Stewart (1982) conducted a major empirical investigation of the efficacy of different equating methods. The studies reported in Dorans (1990) examined how different equating methods performed across samples selected in different ways. Recent population…
Descriptors: Test Format, Equated Scores, Sampling, Evaluation Methods
Wang, Tianyou; Hanson, Bradley A.; Harris, Deborah J. – 1998
Equating a test form to itself through a chain of equatings, commonly referred to as circular equating, has been widely used as a criterion to evaluate the adequacy of equating. This paper uses both analytical methods and simulation methods to show that this criterion is in general invalid in serving this purpose. For the random groups design done…
Descriptors: Equated Scores, Evaluation Methods, Heuristics, Sampling
Sireci, Stephen G.; Bastari, B. – 1998
In many cross-cultural research studies, assessment instruments are translated or adapted for use in multiple languages. However, it cannot be assumed that different language versions of an assessment are equivalent across languages. A fundamental issue to be addressed is the comparability or equivalence of the construct measured by each language…
Descriptors: Construct Validity, Cross Cultural Studies, Evaluation Methods, Multidimensional Scaling
Wang, Wen-Chung; Wilson, Mark – Educational and Psychological Measurement, 2005
This study presents a procedure for detecting differential item functioning (DIF) for dichotomous and polytomous items in testlet-based tests, whereby DIF is taken into account by adding DIF parameters into the Rasch testlet model. Simulations were conducted to assess recovery of the DIF and other parameters. Two independent variables, test type…
Descriptors: Test Format, Test Bias, Item Response Theory, Item Analysis
Baker, Herbert George; And Others – 1990
Tailored Response Testing (TRT) is a new type of test that has demonstrated its applicability to the evaluation of human performance in a wide variety of occupations and work settings. The Navy is using TRT to measure the technical proficiency of job incumbents in three of its jobs. The methodology holds great promise for testing aboard ships as…
Descriptors: Adaptive Testing, Evaluation Methods, Fire Fighters, Job Performance
Finch, Fredrick; Foertsch, Mary – 1993
Performance assessment is reviewed as an emerging form of alternative assessment, focusing on how it has been defined in the research literature, the criteria for evaluating its authenticity, the measurement of process and product, and the link between assessment and instruction. Three important dimensions that must be considered in describing…
Descriptors: Alternative Assessment, Educational Assessment, Elementary Secondary Education, Evaluation Methods

Peer reviewed
Direct link
