ERIC - Search Results

Publication Date

In 2026	0
Since 2025	3
Since 2022 (last 5 years)	24
Since 2017 (last 10 years)	55
Since 2007 (last 20 years)	101

Descriptor

Bayesian Statistics	172
Test Items	172
Item Response Theory	90
Models	47
Adaptive Testing	40
Simulation	37
Computer Assisted Testing	36
Computation	35
Maximum Likelihood Statistics	35
Monte Carlo Methods	33
Comparative Analysis	31
Difficulty Level	31
Test Construction	28
Item Analysis	27
Estimation (Mathematics)	26
Statistical Analysis	25
Accuracy	23
Markov Processes	23
Foreign Countries	21
Scores	19
Test Bias	19
Ability	17
Goodness of Fit	17
Achievement Tests	16
Mathematics Tests	16
More ▼

Publication Type

Journal Articles	118
Reports - Research	102
Reports - Evaluative	44
Speeches/Meeting Papers	14
Dissertations/Theses -…	10
Reports - Descriptive	10
Information Analyses	4
Numerical/Quantitative Data	4
Collected Works - Proceedings	1
Collected Works - Serials	1
Opinion Papers	1
More ▼

Education Level

Higher Education	15
Secondary Education	10
Postsecondary Education	8
Grade 8	5
Elementary Education	4
Elementary Secondary Education	4
Middle Schools	4
Grade 4	3
Intermediate Grades	3
Junior High Schools	3
Early Childhood Education	2
High Schools	2
Preschool Education	2
Grade 12	1
Grade 5	1
Grade 7	1
Grade 9	1
Kindergarten	1
Primary Education	1
More ▼

Audience

Practitioners	1
Researchers	1

Location

Taiwan	3
Canada	2
Germany	2
Netherlands	2
Nigeria	2
Saudi Arabia	2
Africa	1
Botswana	1
Chile	1
Georgia Republic	1
Germany (Berlin)	1
Ghana	1
Malaysia	1
North Carolina (Charlotte)	1
Norway	1
Philippines	1
Poland	1
Russia	1
Singapore	1
South Africa	1
Switzerland	1
Thailand	1
Turkey	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	5
Trends in International…	4
National Assessment of…	3
Comprehensive Tests of Basic…	2
Graduate Record Examinations	2
ACT Assessment	1
Armed Services Vocational…	1
COMPASS (Computer Assisted…	1
California Achievement Tests	1
Law School Admission Test	1
MacArthur Communicative…	1
Michigan Test of English…	1
Progress in International…	1
School and College Ability…	1
Wechsler Adult Intelligence…	1
More ▼

What Works Clearinghouse Rating

Test Items X

Showing 61 to 75 of 172 results Save | Export

Parameter Recovery and Classification Accuracy under Conditions of Testlet Dependency: A Comparison of the Traditional 2PL, Testlet, and Bi-Factor Models

Peer reviewed

Direct link

Koziol, Natalie A. – Applied Measurement in Education, 2016

Testlets, or groups of related items, are commonly included in educational assessments due to their many logistical and conceptual advantages. Despite their advantages, testlets introduce complications into the theory and practice of educational measurement. Responses to items within a testlet tend to be correlated even after controlling for…

Descriptors: Classification, Accuracy, Comparative Analysis, Models

Examining the Validity of GOLD® with 4-Year-Old Dual Language Learners

Peer reviewed

Direct link

Kim, Do-Hong; Lambert, Richard G.; Durham, Sean; Burts, Diane C. – Early Education and Development, 2018

Research Findings: This study builds on prior work related to the assessment of young dual language learners (DLLs). The purposes of the study were to (a) determine whether latent subgroups of preschool DLLs would replicate those found previously and (b) examine the validity of GOLD® by Teaching Strategies with empirically derived subgroups.…

Descriptors: Preschool Education, Teaching Methods, Bilingualism, Bilingual Education

Confirming Testlet Effects

Peer reviewed

Direct link

DeMars, Christine E. – Applied Psychological Measurement, 2012

A testlet is a cluster of items that share a common passage, scenario, or other context. These items might measure something in common beyond the trait measured by the test as a whole; if so, the model for the item responses should allow for this testlet trait. But modeling testlet effects that are negligible makes the model unnecessarily…

Descriptors: Test Items, Item Response Theory, Comparative Analysis, Models

Improving Mantel-Haenszel DIF Estimation through Bayesian Updating

Peer reviewed

Direct link

Zwick, Rebecca; Ye, Lei; Isham, Steven – Journal of Educational and Behavioral Statistics, 2012

This study demonstrates how the stability of Mantel-Haenszel (MH) DIF (differential item functioning) methods can be improved by integrating information across multiple test administrations using Bayesian updating (BU). The authors conducted a simulation that showed that this approach, which is based on earlier work by Zwick, Thayer, and Lewis,…

Descriptors: Test Bias, Computation, Statistical Analysis, Bayesian Statistics

Optimal Designs for the Rasch Model

Peer reviewed

Direct link

Grasshoff, Ulrike; Holling, Heinz; Schwabe, Rainer – Psychometrika, 2012

In this paper, optimal designs will be derived for estimating the ability parameters of the Rasch model when difficulty parameters are known. It is well established that a design is locally D-optimal if the ability and difficulty coincide. But locally optimal designs require that the ability parameters to be estimated are known. To attenuate this…

Descriptors: Item Response Theory, Test Items, Psychometrics, Statistical Analysis

The Performance of the Linear Logistic Test Model When the Q-Matrix Is Misspecified: A Simulation Study

Direct link

MacDonald, George T. – ProQuest LLC, 2014

A simulation study was conducted to explore the performance of the linear logistic test model (LLTM) when the relationships between items and cognitive components were misspecified. Factors manipulated included percent of misspecification (0%, 1%, 5%, 10%, and 15%), form of misspecification (under-specification, balanced misspecification, and…

Descriptors: Simulation, Item Response Theory, Models, Test Items

A Semiparametric Model for Jointly Analyzing Response Times and Accuracy in Computerized Testing

Peer reviewed

Direct link

Wang, Chun; Fan, Zhewen; Chang, Hua-Hua; Douglas, Jeffrey A. – Journal of Educational and Behavioral Statistics, 2013

The item response times (RTs) collected from computerized testing represent an underutilized type of information about items and examinees. In addition to knowing the examinees' responses to each item, we can investigate the amount of time examinees spend on each item. Current models for RTs mainly focus on parametric models, which have the…

Descriptors: Reaction Time, Computer Assisted Testing, Test Items, Accuracy

Weighted Maximum-a-Posteriori Estimation in Tests Composed of Dichotomous and Polytomous Items

Peer reviewed

Direct link

Sun, Shan-Shan; Tao, Jian; Chang, Hua-Hua; Shi, Ning-Zhong – Applied Psychological Measurement, 2012

For mixed-type tests composed of dichotomous and polytomous items, polytomous items often yield more information than dichotomous items. To reflect the difference between the two types of items and to improve the precision of ability estimation, an adaptive weighted maximum-a-posteriori (WMAP) estimation is proposed. To evaluate the performance of…

Descriptors: Monte Carlo Methods, Computation, Item Response Theory, Weighted Scores

Dealing with Omitted and Not-Reached Items in Competence Tests: Evaluating Approaches Accounting for Missing Responses in Item Response Theory Models

Peer reviewed

Direct link

Pohl, Steffi; Gräfe, Linda; Rose, Norman – Educational and Psychological Measurement, 2014

Data from competence tests usually show a number of missing responses on test items due to both omitted and not-reached items. Different approaches for dealing with missing responses exist, and there are no clear guidelines on which of those to use. While classical approaches rely on an ignorable missing data mechanism, the most recently developed…

Descriptors: Test Items, Achievement Tests, Item Response Theory, Models

Item Selection and Ability Estimation Procedures for a Mixed-Format Adaptive Test

Peer reviewed

Direct link

Ho, Tsung-Han; Dodd, Barbara G. – Applied Measurement in Education, 2012

In this study we compared five item selection procedures using three ability estimation methods in the context of a mixed-format adaptive test based on the generalized partial credit model. The item selection procedures used were maximum posterior weighted information, maximum expected information, maximum posterior weighted Kullback-Leibler…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Selection

The Problem of Bias in Person Parameter Estimation in Adaptive Testing

Peer reviewed

Direct link

Doebler, Anna – Applied Psychological Measurement, 2012

It is shown that deviations of estimated from true values of item difficulty parameters, caused for example by item calibration errors, the neglect of randomness of item difficulty parameters, testlet effects, or rule-based item generation, can lead to systematic bias in point estimation of person parameters in the context of adaptive testing.…

Descriptors: Adaptive Testing, Computer Assisted Testing, Computation, Item Response Theory

Item Pool Design for an Operational Variable-Length Computerized Adaptive Test

Peer reviewed

Direct link

He, Wei; Reckase, Mark D. – Educational and Psychological Measurement, 2014

For computerized adaptive tests (CATs) to work well, they must have an item pool with sufficient numbers of good quality items. Many researchers have pointed out that, in developing item pools for CATs, not only is the item pool size important but also the distribution of item parameters and practical considerations such as content distribution…

Descriptors: Item Banks, Test Length, Computer Assisted Testing, Adaptive Testing

Assessing Scientific Reasoning: A Comprehensive Evaluation of Item Features That Affect Item Difficulty

Peer reviewed

Direct link

Stiller, Jurik; Hartmann, Stefan; Mathesius, Sabrina; Straube, Philipp; Tiemann, Rüdiger; Nordmeier, Volkhard; Krüger, Dirk; Upmeier zu Belzen, Annette – Assessment & Evaluation in Higher Education, 2016

The aim of this study was to improve the criterion-related test score interpretation of a text-based assessment of scientific reasoning competencies in higher education by evaluating factors which systematically affect item difficulty. To provide evidence about the specific demands which test items of various difficulty make on pre-service…

Descriptors: Logical Thinking, Scientific Concepts, Difficulty Level, Test Items

The Performance of IRT Model Selection Methods with Mixed-Format Tests

Peer reviewed

Direct link

Whittaker, Tiffany A.; Chang, Wanchen; Dodd, Barbara G. – Applied Psychological Measurement, 2012

When tests consist of multiple-choice and constructed-response items, researchers are confronted with the question of which item response theory (IRT) model combination will appropriately represent the data collected from these mixed-format tests. This simulation study examined the performance of six model selection criteria, including the…

Descriptors: Item Response Theory, Models, Selection, Criteria

A Bifactor Multidimensional Item Response Theory Model for Differential Item Functioning Analysis on Testlet-Based Items

Peer reviewed

Direct link

Fukuhara, Hirotaka; Kamata, Akihito – Applied Psychological Measurement, 2011

A differential item functioning (DIF) detection method for testlet-based data was proposed and evaluated in this study. The proposed DIF model is an extension of a bifactor multidimensional item response theory (MIRT) model for testlets. Unlike traditional item response theory (IRT) DIF models, the proposed model takes testlet effects into…

Descriptors: Item Response Theory, Test Bias, Test Items, Bayesian Statistics

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12

Educational and Psychological…	23
Applied Psychological…	21
Journal of Educational and…	17
Journal of Educational…	11
ProQuest LLC	10
Psychometrika	9
Applied Measurement in…	7
Grantee Submission	5
ETS Research Report Series	4
Educational Measurement:…	3
Assessment & Evaluation in…	2
International Journal of…	2
Practical Assessment,…	2
Alberta Journal of…	1
Computers & Education	1
EURASIA Journal of…	1
Early Education and…	1
Education and Information…	1
Educational Research and…	1
Educational Technology &…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
International Working Group…	1
Journal of Applied Testing…	1
More ▼

Mislevy, Robert J.	6
van der Linden, Wim J.	6
Sinharay, Sandip	5
Zwick, Rebecca	5
Lewis, Charles	4
Man, Kaiwen	4
Chang, Hua-Hua	3
Glas, Cees A. W.	3
Harring, Jeffrey R.	3
Huang, Hung-Yu	3
Kim, Seock-Ho	3
Reckase, Mark D.	3
Revuelta, Javier	3
Tao, Jian	3
Vos, Hans J.	3
Wang, Chun	3
Weiss, David J.	3
Berger, Martijn P. F.	2
Bradlow, Eric T.	2
Chun Wang	2
De Boeck, Paul	2
Dodd, Barbara G.	2
Fox, Jean-Paul	2
Hambleton, Ronald K.	2
More ▼