ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	5

Descriptor

Maximum Likelihood Statistics	13
Multiple Choice Tests	13
Test Items	13
Estimation (Mathematics)	6
Item Response Theory	6
Latent Trait Theory	6
Mathematical Models	6
Computation	5
Item Analysis	4
Models	4
Equations (Mathematics)	3
Responses	3
Ability	2
Accuracy	2
Bayesian Statistics	2
Comparative Analysis	2
Computer Assisted Testing	2
Guessing (Tests)	2
Measurement	2
Scoring	2
Simulation	2
Test Bias	2
Test Reliability	2
Test Theory	2
Academic Ability	1
More ▼

Source

Alberta Journal of…	1
ETS Research Report Series	1
Educational Research and…	1
Journal of Educational…	1
Journal of Educational and…	1
Large-scale Assessments in…	1
Psychometrika	1

Publication Type

Reports - Research	10
Journal Articles	7
Reports - Evaluative	2
Information Analyses	1
Numerical/Quantitative Data	1
Speeches/Meeting Papers	1

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Researchers

Location

Sweden	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Armed Services Vocational…	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing all 13 results Save | Export

A Structural Equation Modeling Approach for Examining Position Effects in Large-Scale Assessments

Peer reviewed

Direct link

Bulut, Okan; Quo, Qi; Gierl, Mark J. – Large-scale Assessments in Education, 2017

Position effects may occur in both paper--pencil tests and computerized assessments when examinees respond to the same test items located in different positions on the test. To examine position effects in large-scale assessments, previous studies often used multilevel item response models within the generalized linear mixed modeling framework.…

Descriptors: Structural Equation Models, Educational Assessment, Measurement, Test Items

A Strategy for Replacing Sum Scoring

Peer reviewed

Direct link

Ramsay, James O.; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2017

This article promotes the use of modern test theory in testing situations where sum scores for binary responses are now used. It directly compares the efficiencies and biases of classical and modern test analyses and finds an improvement in the root mean squared error of ability estimates of about 5% for two designed multiple-choice tests and…

Descriptors: Scoring, Test Theory, Computation, Maximum Likelihood Statistics

Variance Difference between Maximum Likelihood Estimation Method and Expected A Posteriori Estimation Method Viewed from Number of Test Items

Peer reviewed
PDF on ERIC

Download full text

Mahmud, Jumailiyah; Sutikno, Muzayanah; Naga, Dali S. – Educational Research and Reviews, 2016

The aim of this study is to determine variance difference between maximum likelihood and expected A posteriori estimation methods viewed from number of test items of aptitude test. The variance presents an accuracy generated by both maximum likelihood and Bayes estimation methods. The test consists of three subtests, each with 40 multiple-choice…

Descriptors: Maximum Likelihood Statistics, Computation, Item Response Theory, Test Items

The Effects of Rater Severity and Rater Distribution on Examinees' Ability Estimation for Constructed-Response Items. Research Report. ETS RR-13-23

Peer reviewed
PDF on ERIC

Download full text

Wang, Zhen; Yao, Lihua – ETS Research Report Series, 2013

The current study used simulated data to investigate the properties of a newly proposed method (Yao's rater model) for modeling rater severity and its distribution under different conditions. Our study examined the effects of rater severity, distributions of rater severity, the difference between item response theory (IRT) models with rater effect…

Descriptors: Test Format, Test Items, Responses, Computation

Nested Logit Models for Multiple-Choice Item Response Data

Peer reviewed

Direct link

Suh, Youngsuk; Bolt, Daniel M. – Psychometrika, 2010

Nested logit item response models for multiple-choice data are presented. Relative to previous models, the new models are suggested to provide a better approximation to multiple-choice items where the application of a solution strategy precedes consideration of response options. In practice, the models also accommodate collapsibility across all…

Descriptors: Computation, Simulation, Psychometrics, Models

A Response Model for Multiple Choice Items. Psychometric Technical Report No. 1.

Thissen, David; Steinberg, Lynne – 1983

An extension of the Bock-Samejima model for multiple choice items is introduced. The model provides for varying probabilities of the response alternative when the examinee guesses. A marginal maximum likelihood method is devised for estimating the item parameters, and likelihood ratio tests for comparing more and less constrained forms of the…

Descriptors: Ability, Estimation (Mathematics), Guessing (Tests), Latent Trait Theory

The Use of Invariant Item Parameters to Derive an Absolute Score Metric.

Bradshaw, Charles W., Jr. – 1968

A method for determining invariant item parameters is presented, along with a scheme for obtaining test scores which are interpretable in terms of a common metric. The method assumes a unidimensional latent trait and uses a three parameter normal ogive model. The assumptions of the model are explored, and the methods for calculating the proposed…

Descriptors: Equated Scores, Item Analysis, Latent Trait Theory, Mathematical Models

Estimation of the Operating Characteristics When the Test Information of the Old Test Is Not Constant II: Simple Sum Procedure of the Conditional P.D.F. Approach/Normal Approach Method Using Three Subtests of the Old Test. No. 2.

Samejima, Fumiko – 1981

This is a continuation of a previous study in which a new method of estimating the operating characteristics of discrete item responses based upon an Old Test, which has a non-constant test information function, was tested upon each of two subtests of the original Old Test, Subtests 1 and 2. The results turned out to be quite successful. In the…

Descriptors: Academic Ability, Computer Assisted Testing, Estimation (Mathematics), Latent Trait Theory

The Effects of Examinee Motivation on Multiple-Choice Item Parameter Estimates

Peer reviewed

Direct link

van Barneveld, Christina – Alberta Journal of Educational Research, 2003

The purpose of this study was to examine the potential effect of false assumptions regarding the motivation of examinees on item calibration and test construction. A simulation study was conducted using data generated by means of several models of examinee item response behaviors (the three-parameter logistic model alone and in combination with…

Descriptors: Simulation, Motivation, Computation, Test Construction

Results of Item Parameter Estimation Using LOGIST 5 on Simulated Data.

Samejima, Fumiko – 1984

In order to evaluate our methods and approaches of estimating the operating characteristics of discrete item responses, it is necessary to try other comparable methods on similar sets of data. LOGIST 5 was taken up for this reason, and was tried upon the hypothetical test items, which follow the normal ogive model and were used frequently in…

Descriptors: Computer Simulation, Computer Software, Estimation (Mathematics), Item Analysis

An Introduction to Multilinear Formula Score Theory. Measurement Series 84-4.

Levine, Michael V. – 1984

Formula score theory (FST) associates each multiple choice test with a linear operator and expresses all of the real functions of item response theory as linear combinations of the operator's eigenfunctions. Hard measurement problems can then often be reformulated as easier, standard mathematical problems. For example, the problem of estimating…

Descriptors: Cognitive Ability, Estimation (Mathematics), Latent Trait Theory, Maximum Likelihood Statistics

Latent Trait Models and Random Guessing: A Comparison of Two Logistic Latent Trait Models.

Waller, Michael I. – 1986

This study compares the fit of the 3-parameter model to the Ability Removing Random Guessing (ARRG) model on data from a wide range of tests of cognitive ability in three representative samples. When the guessing parameters under the 3-parameter model are estimated individually for each item, the 3-parameter model yields the better fit to…

Descriptors: Cognitive Tests, Cohort Analysis, Elementary Secondary Education, Equations (Mathematics)

Influence of Prior Distributions on Detection of DIF.

Peer reviewed

Cohen, Allan S.; And Others – Journal of Educational Measurement, 1991

Detecting differential item functioning (DIF) on test items constructed to favor 1 group over another was investigated on parameter estimates from 2 item response theory-based computer programs--BILOG and LOGIST--using data for 1,000 White and 1,000 Black college students. Use of prior distributions and marginal-maximum a posteriori estimation is…

Descriptors: Black Students, College Students, Computer Assisted Testing, Equations (Mathematics)

Samejima, Fumiko	2
Bolt, Daniel M.	1
Bradshaw, Charles W., Jr.	1
Bulut, Okan	1
Cohen, Allan S.	1
Gierl, Mark J.	1
Levine, Michael V.	1
Mahmud, Jumailiyah	1
Naga, Dali S.	1
Quo, Qi	1
Ramsay, James O.	1
Steinberg, Lynne	1
Suh, Youngsuk	1
Sutikno, Muzayanah	1
Thissen, David	1
Waller, Michael I.	1
Wang, Zhen	1
Wiberg, Marie	1
Yao, Lihua	1
van Barneveld, Christina	1
More ▼