ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	5
Since 2007 (last 20 years)	7

Descriptor

Maximum Likelihood Statistics	12
Scores	12
Simulation	12
Item Response Theory	6
Computation	4
Goodness of Fit	3
Test Items	3
Error of Measurement	2
Estimation (Mathematics)	2
Mathematics	2
Probability	2
Regression (Statistics)	2
Sample Size	2
Scoring	2
Statistical Analysis	2
Structural Equation Models	2
Test Bias	2
Test Reliability	2
Achievement Tests	1
Adaptive Testing	1
Adults	1
Answer Sheets	1
Associative Learning	1
College Entrance Examinations	1
Comparative Testing	1
More ▼

Source

Educational and Psychological…	3
Grantee Submission	2
Journal of Educational and…	2
Applied Psychological…	1
Journal of Educational…	1
Journal of Educational…	1
Psychometrika	1

Publication Type

Journal Articles	9
Reports - Research	7
Reports - Evaluative	3
Reports - Descriptive	2
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education

Audience

Researchers

Location

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

SAT (College Admission Test)

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Using Item Scores and Distractors to Detect Item Compromise and Preknowledge

Peer reviewed

Direct link

Gorney, Kylie; Wollack, James A.; Sinharay, Sandip; Eckerly, Carol – Journal of Educational and Behavioral Statistics, 2023

Any time examinees have had access to items and/or answers prior to taking a test, the fairness of the test and validity of test score interpretations are threatened. Therefore, there is a high demand for procedures to detect both compromised items (CI) and examinees with preknowledge (EWP). In this article, we develop a procedure that uses item…

Descriptors: Scores, Test Validity, Test Items, Prior Learning

A Small Sample Correction for Factor Score Regression

Peer reviewed

Direct link

Bogaert, Jasper; Loh, Wen Wei; Rosseel, Yves – Educational and Psychological Measurement, 2023

Factor score regression (FSR) is widely used as a convenient alternative to traditional structural equation modeling (SEM) for assessing structural relations between latent variables. But when latent variables are simply replaced by factor scores, biases in the structural parameter estimates often have to be corrected, due to the measurement error…

Descriptors: Factor Analysis, Regression (Statistics), Structural Equation Models, Error of Measurement

A Multiple Imputation Score Test for Model Modification in Structural Equation Models

Peer reviewed
PDF on ERIC

Download full text

Mansolf, Maxwell; Jorgensen, Terrence D.; Enders, Craig K. – Grantee Submission, 2020

Structural equation modeling (SEM) applications routinely employ a trilogy of significance tests that includes the likelihood ratio test, Wald test, and score test or modification index. Researchers use these tests to assess global model fit, evaluate whether individual estimates differ from zero, and identify potential sources of local misfit,…

Descriptors: Structural Equation Models, Computation, Scores, Simulation

Summed Score Likelihood Based Indices for Testing Latent Variable Distribution Fit in Item Response Theory

Peer reviewed
PDF on ERIC

Download full text

Li, Zhen; Cai, Li – Grantee Submission, 2017

In standard item response theory (IRT) applications, the latent variable is typically assumed to be normally distributed. If the normality assumption is violated, the item parameter estimates can become biased. Summed score likelihood based statistics may be useful for testing latent variable distribution fit. We develop Satorra-Bentler type…

Descriptors: Scores, Goodness of Fit, Statistical Distributions, Item Response Theory

Large Sample Confidence Intervals for Item Response Theory Reliability Coefficients

Peer reviewed

Direct link

Andersson, Björn; Xin, Tao – Educational and Psychological Measurement, 2018

In applications of item response theory (IRT), an estimate of the reliability of the ability estimates or sum scores is often reported. However, analytical expressions for the standard errors of the estimators of the reliability coefficients are not available in the literature and therefore the variability associated with the estimated reliability…

Descriptors: Item Response Theory, Test Reliability, Test Items, Scores

Multimodal Likelihoods in Educational Assessment: Will the Real Maximum Likelihood Score Please Stand up?

Peer reviewed

Direct link

Wothke, Werner; Burket, George; Chen, Li-Sue; Gao, Furong; Shu, Lianghua; Chia, Mike – Journal of Educational and Behavioral Statistics, 2011

It has been known for some time that item response theory (IRT) models may exhibit a likelihood function of a respondent's ability which may have multiple modes, flat modes, or both. These conditions, often associated with guessing of multiple-choice (MC) questions, can introduce uncertainty and bias to ability estimation by maximum likelihood…

Descriptors: Educational Assessment, Item Response Theory, Computation, Maximum Likelihood Statistics

Estimating Trends from Censored Assessment Data under No Child Left Behind

Peer reviewed

Direct link

Furgol, Katherine E.; Ho, Andrew D.; Zimmerman, Dale L. – Educational and Psychological Measurement, 2010

Under the No Child Left Behind Act, large-scale test score trend analyses are widespread. These analyses often gloss over interesting changes in test score distributions and involve unrealistic assumptions. Further complications arise from analyses of unanchored, censored assessment data, or proportions of students lying within performance levels…

Descriptors: Trend Analysis, Sample Size, Federal Legislation, Simulation

The Rasch Poisson Counts Model for Incomplete Data: An Application of the EM Algorithm.

Peer reviewed

Jansen, Margo G. H. – Applied Psychological Measurement, 1995

The Rasch Poisson counts model is a latent trait model for the situation in which "K" tests are administered to "N" examinees and the test score is a count (repeated number of some event). A mixed model is presented that applies the EM algorithm and that can allow for missing data. (SLD)

Descriptors: Estimation (Mathematics), Item Response Theory, Maximum Likelihood Statistics, Scores

The Order-Restricted Association Model: Two Estimation Algorithms and Issues in Testing

Peer reviewed

Direct link

Galindo-Garre, Francisca; Vermunt, Jeroen K. – Psychometrika, 2004

This paper presents a row-column (RC) association model in which the estimated row and column scores are forced to be in agreement with a priori specified ordering. Two efficient algorithms for finding the order-restricted maximum likelihood (ML) estimates are proposed and their reliability under different degrees of association is investigated by…

Descriptors: Mathematics, Test Reliability, Computation, Testing

Using Patterns of Summed Scores in Paper-and-Pencil Tests and Computer-Adaptive Tests to Detect Misfitting Item Score Patterns

Peer reviewed

Direct link

Meijer, Rob R. – Journal of Educational Measurement, 2004

Two new methods have been proposed to determine unexpected sum scores on sub-tests (testlets) both for paper-and-pencil tests and computer adaptive tests. A method based on a conservative bound using the hypergeometric distribution, denoted p, was compared with a method where the probability for each score combination was calculated using a…

Descriptors: Probability, Adaptive Testing, Item Response Theory, Scores

Effects of Item Difficulty Heterogeneity on the Estimation of True-Score and Classification Consistency.

Spray, Judith A.; Welch, Catherine J. – 1986

The purpose of this study was to examine the effect that large within-examinee item difficulty variability had on estimates of the proportion of consistent classification of examinees into mastery categories over two test administrations. The classification consistency estimate was based on a single test administration from an estimation procedure…

Descriptors: Adults, Difficulty Level, Estimation (Mathematics), Mathematical Models

Measuring the Appropriateness of Multiple-Choice Test Scores.

Peer reviewed

Levine, Michael V.; Rubin, Donald B. – Journal of Educational Statistics, 1979

A student may be so unlike other students that his/her aptitude test score fails to be a completely appropriate measure. We consider the problem of using the student's pattern of multiple-choice aptitude test answers to decide whether his/her score is an appropriate ability measure. (Author/CTM)

Descriptors: Answer Sheets, College Entrance Examinations, Guessing (Tests), Latent Trait Theory

Andersson, Björn	1
Bogaert, Jasper	1
Burket, George	1
Cai, Li	1
Chen, Li-Sue	1
Chia, Mike	1
Eckerly, Carol	1
Enders, Craig K.	1
Furgol, Katherine E.	1
Galindo-Garre, Francisca	1
Gao, Furong	1
Gorney, Kylie	1
Ho, Andrew D.	1
Jansen, Margo G. H.	1
Jorgensen, Terrence D.	1
Levine, Michael V.	1
Li, Zhen	1
Loh, Wen Wei	1
Mansolf, Maxwell	1
Meijer, Rob R.	1
Rosseel, Yves	1
Rubin, Donald B.	1
Shu, Lianghua	1
Sinharay, Sandip	1
Spray, Judith A.	1
More ▼