ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	12

Descriptor

Computation	13
Item Response Theory	13
Grade 8	10
Mathematics Tests	6
Difficulty Level	5
Markov Processes	5
Test Items	5
Models	4
Reading Tests	4
Foreign Countries	3
Grade 4	3
Measurement	3
Monte Carlo Methods	3
National Competency Tests	3
Bias	2
Comparative Analysis	2
Computer Software	2
Data Analysis	2
Error of Measurement	2
Grade 5	2
Grade 6	2
Grade 7	2
Hierarchical Linear Modeling	2
Knowledge Level	2
Mathematics	2
More ▼

Source

Journal of Educational and…	3
Applied Measurement in…	2
Behavioral Research and…	2
Applied Psychological…	1
ETS Research Report Series	1
Educational Testing Service	1
Educational and Psychological…	1
International Educational…	1
International Journal of…	1

Publication Type

Journal Articles	9
Reports - Research	9
Numerical/Quantitative Data	2
Reports - Evaluative	2
Collected Works - Proceedings	1
Reports - Descriptive	1

Education Level

Grade 8	13
Junior High Schools	9
Middle Schools	9
Secondary Education	9
Elementary Education	7
Grade 4	4
Grade 6	4
Grade 7	4
Elementary Secondary Education	3
Grade 5	3
Grade 1	2
Grade 3	2
Grade 2	1
High Schools	1
Higher Education	1
Intermediate Grades	1
Postsecondary Education	1
More ▼

Audience

Location

Brazil	1
Colorado	1
Florida	1
Hong Kong	1
Italy	1
New York	1
North Carolina	1
Oregon	1
Tennessee	1
Texas	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
Trends in International…	2
Program for International…	1

What Works Clearinghouse Rating

Showing all 13 results Save | Export

An Empirical Study for the Statistical Adjustment of Rater Bias

Peer reviewed
PDF on ERIC

Download full text

Ilhan, Mustafa – International Journal of Assessment Tools in Education, 2019

This study investigated the effectiveness of statistical adjustments applied to rater bias in many-facet Rasch analysis. Some changes were first made in the dataset that did not include "rater × examinee" bias to cause to have "rater × examinee" bias. Later, bias adjustment was applied to rater bias included in the data file,…

Descriptors: Statistical Analysis, Item Response Theory, Evaluators, Bias

A Third-Order Item Response Theory Model for Modeling the Effects of Domains and Subdomains in Large-Scale Educational Assessment Surveys

Peer reviewed

Direct link

Rijmen, Frank; Jeon, Minjeong; von Davier, Matthias; Rabe-Hesketh, Sophia – Journal of Educational and Behavioral Statistics, 2014

Second-order item response theory models have been used for assessments consisting of several domains, such as content areas. We extend the second-order model to a third-order model for assessments that include subdomains nested in domains. Using a graphical model framework, it is shown how the model does not suffer from the curse of…

Descriptors: Item Response Theory, Models, Educational Assessment, Computation

Item Response Theory Models for Wording Effects in Mixed-Format Scales

Peer reviewed

Direct link

Wang, Wen-Chung; Chen, Hui-Fang; Jin, Kuan-Yu – Educational and Psychological Measurement, 2015

Many scales contain both positively and negatively worded items. Reverse recoding of negatively worded items might not be enough for them to function as positively worded items do. In this study, we commented on the drawbacks of existing approaches to wording effect in mixed-format scales and used bi-factor item response theory (IRT) models to…

Descriptors: Item Response Theory, Test Format, Language Usage, Test Items

Selection of Common Items as an Unrecognized Source of Variability in Test Equating: A Bootstrap Approximation Assuming Random Sampling of Common Items

Peer reviewed

Direct link

Michaelides, Michalis P.; Haertel, Edward H. – Applied Measurement in Education, 2014

The standard error of equating quantifies the variability in the estimation of an equating function. Because common items for deriving equated scores are treated as fixed, the only source of variability typically considered arises from the estimation of common-item parameters from responses of samples of examinees. Use of alternative, equally…

Descriptors: Equated Scores, Test Items, Sampling, Statistical Inference

A Comparison of Teacher Effectiveness Measures Calculated Using Three Multilevel Models for Raters Effects

Peer reviewed

Direct link

Murphy, Daniel L.; Beretvas, S. Natasha – Applied Measurement in Education, 2015

This study examines the use of cross-classified random effects models (CCrem) and cross-classified multiple membership random effects models (CCMMrem) to model rater bias and estimate teacher effectiveness. Effect estimates are compared using CTT versus item response theory (IRT) scaling methods and three models (i.e., conventional multilevel…

Descriptors: Teacher Effectiveness, Comparative Analysis, Hierarchical Linear Modeling, Test Theory

Assessment of School Performance through a Multilevel Latent Markov Rasch Model

Peer reviewed

Direct link

Bartolucci, Francesco; Pennoni, Fulvia; Vittadini, Giorgio – Journal of Educational and Behavioral Statistics, 2011

An extension of the latent Markov Rasch model is described for the analysis of binary longitudinal data with covariates when subjects are collected in clusters, such as students clustered in classes. For each subject, a latent process is used to represent the characteristic of interest (e.g., ability) conditional on the effect of the cluster to…

Descriptors: Markov Processes, Data Analysis, Maximum Likelihood Statistics, Computation

Improving the Quality of Ability Estimates through Multidimensional Scoring and Incorporation of Ancillary Variables

Peer reviewed

Direct link

de la Torre, Jimmy – Applied Psychological Measurement, 2009

For one reason or another, various sources of information, namely, ancillary variables and correlational structure of the latent abilities, which are usually available in most testing situations, are ignored in ability estimation. A general model that incorporates these sources of information is proposed in this article. The model has a general…

Descriptors: Scoring, Multivariate Analysis, Ability, Computation

Stochastic Approximation Methods for Latent Regression Item Response Models. Research Report. ETS RR-09-09

Download full text

von Davier, Matthias; Sinharay, Sandip – Educational Testing Service, 2009

This paper presents an application of a stochastic approximation EM-algorithm using a Metropolis-Hastings sampler to estimate the parameters of an item response latent regression model. Latent regression models are extensions of item response theory (IRT) to a 2-level latent variable model in which covariates serve as predictors of the…

Descriptors: Item Response Theory, Regression (Statistics), Models, Methods

On the Estimation of Hierarchical Latent Regression Models for Large-Scale Assessments

Peer reviewed

Direct link

Li, Deping; Oranje, Andreas; Jiang, Yanlin – Journal of Educational and Behavioral Statistics, 2009

To find population proficiency distributions, a two-level hierarchical linear model may be applied to large-scale survey assessments such as the National Assessment of Educational Progress (NAEP). The model and parameter estimation are developed and a simulation was carried out to evaluate parameter recovery. Subsequently, both a hierarchical and…

Descriptors: Computation, National Competency Tests, Measurement, Regression (Statistics)

Examining Item Functioning of Math Screening Measures for Grades 1-8 Students. Technical Report Number 08-04

Download full text

Liu, Kimy; Ketterlin-Geller, Leanne R.; Yovanoff, Paul; Tindal, Gerald – Behavioral Research and Teaching, 2008

BRT Math Screening Measures focus on students' mathematics performance in grade-level standards for students in grades 1-8. A total of 24 test forms are available with three test forms per grade corresponding to fall, winter, and spring testing periods. Each form contains computation problems and application problems. BRT Math Screening Measures…

Descriptors: Test Items, Test Format, Test Construction, Item Response Theory

Instrument Development Procedures for Maze Measures. Technical Report # 08-06

Download full text

Liu, Kimy; Sundstrom-Hebert, Krystal; Ketterlin-Geller, Leanne R.; Tindal, Gerald – Behavioral Research and Teaching, 2008

The purpose of this study was to document the instrument development of maze measures for grades 3-8. Each maze passage contained twelve omitted words that students filled in by choosing the best-fit word from among the provided options. In this technical report, we describe the process of creating, reviewing, and pilot testing the maze measures.…

Descriptors: Test Construction, Cloze Procedure, Multiple Choice Tests, Reading Tests

Proceedings of the International Conference on Educational Data Mining (EDM) (12th, Montreal, Canada, July 2-5, 2019)

Peer reviewed
PDF on ERIC

Download full text

Lynch, Collin F., Ed.; Merceron, Agathe, Ed.; Desmarais, Michel, Ed.; Nkambou, Roger, Ed. – International Educational Data Mining Society, 2019

The 12th iteration of the International Conference on Educational Data Mining (EDM 2019) is organized under the auspices of the International Educational Data Mining Society in Montreal, Canada. The theme of this year's conference is EDM in Open-Ended Domains. As EDM has matured it has increasingly been applied to open-ended and ill-defined tasks…

Descriptors: Data Collection, Data Analysis, Information Retrieval, Content Analysis

A Bayesian Hierarchical Model for Large-Scale Educational Surveys: An Application to the National Assessment of Educational Progress. Research Report. ETS RR-04-38

Peer reviewed
PDF on ERIC

Download full text

Johnson, Matthew S.; Jenkins, Frank – ETS Research Report Series, 2005

Large-scale educational assessments such as the National Assessment of Educational Progress (NAEP) sample examinees to whom an exam will be administered. In most situations the sampling design is not a simple random sample and must be accounted for in the estimating model. After reviewing the current operational estimation procedure for NAEP, this…

Descriptors: Bayesian Statistics, Hierarchical Linear Modeling, National Competency Tests, Sampling

Ketterlin-Geller, Leanne R.	2
Liu, Kimy	2
Tindal, Gerald	2
von Davier, Matthias	2
Bartolucci, Francesco	1
Beretvas, S. Natasha	1
Chen, Hui-Fang	1
Desmarais, Michel, Ed.	1
Haertel, Edward H.	1
Ilhan, Mustafa	1
Jenkins, Frank	1
Jeon, Minjeong	1
Jiang, Yanlin	1
Jin, Kuan-Yu	1
Johnson, Matthew S.	1
Li, Deping	1
Lynch, Collin F., Ed.	1
Merceron, Agathe, Ed.	1
Michaelides, Michalis P.	1
Murphy, Daniel L.	1
Nkambou, Roger, Ed.	1
Oranje, Andreas	1
Pennoni, Fulvia	1
Rabe-Hesketh, Sophia	1
Rijmen, Frank	1
More ▼