ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	17

Descriptor

Comparative Analysis	20
Computation	20
Item Response Theory	9
Error of Measurement	6
Maximum Likelihood Statistics	6
National Competency Tests	6
Sampling	6
Test Items	5
Ability	4
Grade 8	4
Monte Carlo Methods	4
Probability	4
Reading Tests	4
Regression (Statistics)	4
Simulation	4
Statistical Analysis	4
Statistical Bias	4
Accuracy	3
Hierarchical Linear Modeling	3
Markov Processes	3
Models	3
Scores	3
Statistical Distributions	3
Test Bias	3
Adaptive Testing	2
More ▼

Source

ETS Research Report Series

Publication Type

Journal Articles	20
Reports - Research	20
Numerical/Quantitative Data	1
Speeches/Meeting Papers	1

Education Level

Elementary Education	6
Grade 8	4
Junior High Schools	4
Middle Schools	4
Secondary Education	4
Early Childhood Education	2
Grade 4	2
Intermediate Grades	2
Primary Education	2
Grade 1	1
Grade 3	1
Kindergarten	1
More ▼

Audience

Location

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

National Assessment of…	6
Early Childhood Longitudinal…	2

What Works Clearinghouse Rating

Showing 1 to 15 of 20 results Save | Export

Applying the Hájek Approach in Formula-Based Variance Estimation. Research Report. ETS RR-17-24

Peer reviewed
PDF on ERIC

Download full text

Qian, Jiahe – ETS Research Report Series, 2017

The variance formula derived for a two-stage sampling design without replacement employs the joint inclusion probabilities in the first-stage selection of clusters. One of the difficulties encountered in data analysis is the lack of information about such joint inclusion probabilities. One way to solve this issue is by applying Hájek's…

Descriptors: Mathematical Formulas, Computation, Sampling, Research Design

An Empirical Investigation of the Potential Impact of Item Misfit on Test Scores. Research Report. ETS RR-17-60

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Robin, Frederic – ETS Research Report Series, 2017

In this study, we examined the potential impact of item misfit on the reported scores of an admission test from the subpopulation invariance perspective. The target population of the test consisted of 3 major subgroups with different geographic regions. We used the logistic regression function to estimate item parameters of the operational items…

Descriptors: Scores, Test Items, Test Bias, International Assessment

Effectiveness of Item Response Theory (IRT) Proficiency Estimation Methods under Adaptive Multistage Testing. Research Report. ETS RR-15-11

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim; Yoo, Hanwook Henry – ETS Research Report Series, 2015

The purpose of this inquiry was to investigate the effectiveness of item response theory (IRT) proficiency estimators in terms of estimation bias and error under multistage testing (MST). We chose a 2-stage MST design in which 1 adaptation to the examinees' ability levels takes place. It includes 4 modules (1 at Stage 1, 3 at Stage 2) and 3 paths…

Descriptors: Item Response Theory, Computation, Statistical Bias, Error of Measurement

Weighting Test Samples in IRT Linking and Equating: Toward an Improved Sampling Design for Complex Equating. Research Report. ETS RR-13-39

Peer reviewed
PDF on ERIC

Download full text

Qian, Jiahe; Jiang, Yanming; von Davier, Alina A. – ETS Research Report Series, 2013

Several factors could cause variability in item response theory (IRT) linking and equating procedures, such as the variability across examinee samples and/or test items, seasonality, regional differences, native language diversity, gender, and other demographic variables. Hence, the following question arises: Is it possible to select optimal…

Descriptors: Item Response Theory, Test Items, Sampling, True Scores

The Effects of Rater Severity and Rater Distribution on Examinees' Ability Estimation for Constructed-Response Items. Research Report. ETS RR-13-23

Peer reviewed
PDF on ERIC

Download full text

Wang, Zhen; Yao, Lihua – ETS Research Report Series, 2013

The current study used simulated data to investigate the properties of a newly proposed method (Yao's rater model) for modeling rater severity and its distribution under different conditions. Our study examined the effects of rater severity, distributions of rater severity, the difference between item response theory (IRT) models with rater effect…

Descriptors: Test Format, Test Items, Responses, Computation

Continuous Exponential Families: An Equating Tool. Research Report. ETS RR-08-05

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J. – ETS Research Report Series, 2008

Continuous exponential families may be employed to find continuous distributions with the same initial moments as the discrete distributions encountered in typical applications of classical equating. These continuous distributions provide distribution functions and quantile functions that may be employed in equating. To illustrate, an application…

Descriptors: Equated Scores, Statistical Distributions, Probability, Computation

Linking with Continuous Exponential Families: Single-Group Designs. Research Report. ETS RR-08-61

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J. – ETS Research Report Series, 2008

Continuous exponential families are applied to linking forms via a single-group design. In this application, a distribution from the continuous bivariate exponential family is used that has selected moments that match those of the bivariate distribution of scores on the forms to be linked. The selected continuous bivariate distribution then yields…

Descriptors: Equated Scores, Probability, Statistical Distributions, Models

Comparing Multiple-Group Multinomial Log-Linear Models for Multidimensional Skill Distributions in the General Diagnostic Model. Research Report. ETS RR-08-35

Peer reviewed
PDF on ERIC

Download full text

Xu, Xueli; von Davier, Matthias – ETS Research Report Series, 2008

The general diagnostic model (GDM) utilizes located latent classes for modeling a multidimensional proficiency variable. In this paper, the GDM is extended by employing a log-linear model for multiple populations that assumes constraints on parameters across multiple groups. This constrained model is compared to log-linear models that assume…

Descriptors: Comparative Analysis, Models, Computation, National Competency Tests

Parameter Recovery and Subpopulation Proficiency Estimation in Hierarchical Latent Regression Models. Research Report. ETS RR-07-27

Peer reviewed
PDF on ERIC

Download full text

Li, Deping; Oranje, Andreas; Jiang, Yanlin – ETS Research Report Series, 2007

The hierarchical latent regression model (HLRM) is a flexible framework for estimating group-level proficiency while taking into account the complex sample designs often found in large-scale educational surveys. A complex assessment design in which information is collected at different levels (such as student, school, and district), the model also…

Descriptors: Hierarchical Linear Modeling, Regression (Statistics), Computation, Comparative Analysis

Comparing Different Approaches of Bias Correction for Ability Estimation in IRT Models. Research Report. ETS RR-08-13

Peer reviewed
PDF on ERIC

Download full text

Lee, Yi-Hsuan; Zhang, Jinming – ETS Research Report Series, 2008

The method of maximum-likelihood is typically applied to item response theory (IRT) models when the ability parameter is estimated while conditioning on the true item parameters. In practice, the item parameters are unknown and need to be estimated first from a calibration sample. Lewis (1985) and Zhang and Lu (2007) proposed the expected response…

Descriptors: Item Response Theory, Comparative Analysis, Computation, Ability

Robustness of Value-Added Analysis of School Effectiveness. Research Report. ETS RR-08-22

Peer reviewed
PDF on ERIC

Download full text

Braun, Henry; Qu, Yanxuan – ETS Research Report Series, 2008

This paper reports on a study conducted to investigate the consistency of the results between 2 approaches to estimating school effectiveness through value-added modeling. Estimates of school effects from the layered model employing item response theory (IRT) scaled data are compared to estimates derived from a discrete growth model based on the…

Descriptors: Value Added Models, School Effectiveness, Robustness (Statistics), Computation

Small-Sample DIF Estimation Using Log-Linear Smoothing: A SIBTEST Application. Research Report. ETS RR-07-10

Peer reviewed
PDF on ERIC

Download full text

Puhan, Gautam; Moses, Tim P.; Yu, Lei; Dorans, Neil J. – ETS Research Report Series, 2007

The purpose of the current study was to examine whether log-linear smoothing of observed score distributions in small samples results in more accurate differential item functioning (DIF) estimates under the simultaneous item bias test (SIBTEST) framework. Data from a teacher certification test were analyzed using White candidates in the reference…

Descriptors: Test Bias, Computation, Sample Size, Accuracy

Refinement of a Bias-Correction Procedure for the Weighted Likelihood Estimator of Ability. Research Report. ETS RR-07-23

Peer reviewed
PDF on ERIC

Download full text

Zhang, Jinming; Lu, Ting – ETS Research Report Series, 2007

In practical applications of item response theory (IRT), item parameters are usually estimated first from a calibration sample. After treating these estimates as fixed and known, ability parameters are then estimated. However, the statistical inferences based on the estimated abilities can be misleading if the uncertainty of the item parameter…

Descriptors: Item Response Theory, Ability, Error of Measurement, Maximum Likelihood Statistics

Mapping State Standards to the NAEP Scale. Research Report. ETS RR-08-57

Peer reviewed
PDF on ERIC

Download full text

Braun, Henry; Qian, Jiahe – ETS Research Report Series, 2008

This report describes the derivation and evaluation of a method for comparing the performance standards for public school students set by different states. It is based on an approach proposed by McLaughlin and associates, which constituted an innovative attempt to resolve the confusion and concern that occurs when very different proportions of…

Descriptors: State Standards, Comparative Analysis, Public Schools, National Competency Tests

Confidence Intervals for Proportion Estimates in Complex Samples. Research Report. ETS RR-06-21

Peer reviewed
PDF on ERIC

Download full text

Oranje, Andreas – ETS Research Report Series, 2006

Confidence intervals are an important tool to indicate uncertainty of estimates and to give an idea of probable values of an estimate if a different sample from the population was drawn or a different sample of measures was used. Standard symmetric confidence intervals for proportion estimates based on a normal approximation can yield bounds…

Descriptors: Computation, Statistical Analysis, National Competency Tests, Comparative Analysis

Previous Page | Next Page »

Pages: 1 | 2

Zhang, Jinming	4
Oranje, Andreas	3
Qian, Jiahe	3
Braun, Henry	2
Haberman, Shelby J.	2
Kim, Sooyeon	2
Deping, Li	1
Dorans, Neil J.	1
Jenkins, Frank	1
Jiang, Yanlin	1
Jiang, Yanming	1
Johnson, Matthew S.	1
Lee, Yi-Hsuan	1
Li, Deping	1
Lu, Ting	1
Moses, Tim	1
Moses, Tim P.	1
Puhan, Gautam	1
Qu, Yanxuan	1
Robin, Frederic	1
Rock, Donald A.	1
Wang, Zhen	1
Xu, Xueli	1
Yao, Lihua	1
Yoo, Hanwook Henry	1
More ▼