ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	8

Descriptor

Comparative Analysis	11
Item Analysis	11
Test Items	9
Item Response Theory	5
Scores	5
Statistical Analysis	5
Models	4
Reading Tests	4
Regression (Statistics)	4
Scoring	4
Computer Assisted Testing	3
English (Second Language)	3
Language Tests	3
Second Language Learning	3
Writing Tests	3
Correlation	2
Data Analysis	2
Diagnostic Tests	2
Difficulty Level	2
Effect Size	2
Foreign Countries	2
Gender Differences	2
Mathematics Tests	2
National Competency Tests	2
Prompting	2
More ▼

Source

ETS Research Report Series

Publication Type

Journal Articles	11
Reports - Research	11
Tests/Questionnaires	2
Numerical/Quantitative Data	1

Education Level

Secondary Education	3
Elementary Education	2
Higher Education	2
Postsecondary Education	2
Grade 12	1
Grade 7	1
Grade 8	1
High Schools	1
Junior High Schools	1
Middle Schools	1

Audience

Location

Australia

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
Graduate Record Examinations	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing all 11 results Save | Export

An Empirical Investigation of the Potential Impact of Item Misfit on Test Scores. Research Report. ETS RR-17-60

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Robin, Frederic – ETS Research Report Series, 2017

In this study, we examined the potential impact of item misfit on the reported scores of an admission test from the subpopulation invariance perspective. The target population of the test consisted of 3 major subgroups with different geographic regions. We used the logistic regression function to estimate item parameters of the operational items…

Descriptors: Scores, Test Items, Test Bias, International Assessment

Evaluation of Different Scoring Rules for a Noncognitive Test in Development. Research Report. ETS RR-16-03

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Zu, Jiyun; Kyllonen, Patrick; Schmitt, Neal – ETS Research Report Series, 2016

In this report, systematic applications of statistical and psychometric methods are used to develop and evaluate scoring rules in terms of test reliability. Data collected from a situational judgment test are used to facilitate the comparison. For a well-developed item with appropriate keys (i.e., the correct answers), agreement among various…

Descriptors: Scoring, Test Reliability, Statistical Analysis, Psychometrics

A Learning Progression for Variability. Research Report. ETS RR-20-05

Peer reviewed
PDF on ERIC

Download full text

Fife, James H.; James, Kofi; Peters, Stephanie – ETS Research Report Series, 2020

The concept of variability is central to statistics. In this research report, we review mathematics education research on variability and, based on that review and on feedback from an expert panel, propose a learning progression (LP) for variability. The structure of the proposed LP consists of 5 levels of sophistication in understanding…

Descriptors: Mathematics Education, Statistics Education, Feedback (Response), Research Reports

Statistical Report of 2011 "CBAL"™ Multistate Administration of Reading and Writing Tests. Research Report. ETS RR-12-24

Peer reviewed
PDF on ERIC

Download full text

Fu, Jianbin; Wise, Maxwell – ETS Research Report Series, 2012

In the Cognitively Based Assessment of, for, and as Learning ("CBAL"™) research initiative, innovative K-12 prototype tests based on cognitive competency models are developed. This report presents the statistical results of the 2 CBAL Grade 8 writing tests and 2 Grade 7 reading tests administered to students in 20 states in spring 2011.…

Descriptors: Cognitive Ability, Grade 8, Writing Tests, Grade 7

Investigating the Suitability of Implementing the "e-rater"® Scoring Engine in a Large-Scale English Language Testing Program. Research Report. ETS RR-13-36

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mo; Breyer, F. Jay; Lorenz, Florian – ETS Research Report Series, 2013

In this research, we investigated the suitability of implementing "e-rater"® automated essay scoring in a high-stakes large-scale English language testing program. We examined the effectiveness of generic scoring and 2 variants of prompt-based scoring approaches. Effectiveness was evaluated on a number of dimensions, including agreement…

Descriptors: Computer Assisted Testing, Computer Software, Scoring, Language Tests

Examining an Alternative to Score Equating: A Randomly Equivalent Forms Approach. Research Report. ETS RR-08-14

Peer reviewed
PDF on ERIC

Download full text

Liao, Chi-Wen; Livingston, Samuel A. – ETS Research Report Series, 2008

Randomly equivalent forms (REF) of tests in listening and reading for nonnative speakers of English were created by stratified random assignment of items to forms, stratifying on item content and predicted difficulty. The study included 50 replications of the procedure for each test. Each replication generated 2 REFs. The equivalence of those 2…

Descriptors: Equated Scores, Item Analysis, Test Items, Difficulty Level

Estimation of Standard Error of Regression Effects in Latent Regression Models Using Binder's Linearization. Research Report. ETS RR-07-09

Peer reviewed
PDF on ERIC

Download full text

Li, Deping; Oranje, Andreas – ETS Research Report Series, 2007

Two versions of a general method for approximating standard error of regression effect estimates within an IRT-based latent regression model are compared. The general method is based on Binder's (1983) approach, accounting for complex samples and finite populations by Taylor series linearization. In contrast, the current National Assessment of…

Descriptors: Error of Measurement, Regression (Statistics), Trend Analysis, National Competency Tests

Analysis of Data from an Admissions Test with Item Models. Research Report. ETS RR-05-06

Peer reviewed
PDF on ERIC

Download full text

Sinharay, Sandip; Johnson, Matthew – ETS Research Report Series, 2005

"Item models" (LaDuca, Staples, Templeton, & Holzman, 1986) are classes from which it is possible to generate/produce items that are equivalent/isomorphic to other items from the same model (e.g., Bejar, 1996; Bejar, 2002). They have the potential to produce large number of high-quality items at reduced cost. This paper introduces…

Descriptors: Item Analysis, Test Items, Scoring, Psychometrics

Evaluating the Comparability of Paper-and-Pencil and Computerized Versions of a Large-Scale Certification Test. Research Report. ETS RR-05-21

Peer reviewed
PDF on ERIC

Download full text

Puhan, Gautam; Boughton, Keith A.; Kim, Sooyeon – ETS Research Report Series, 2005

The study evaluated the comparability of two versions of a teacher certification test: a paper-and-pencil test (PPT) and computer-based test (CBT). Standardized mean difference (SMD) and differential item functioning (DIF) analyses were used as measures of comparability at the test and item levels, respectively. Results indicated that effect sizes…

Descriptors: Comparative Analysis, Test Items, Statistical Analysis, Teacher Certification

Cognitive Diagnosis for NAEP Proficiency Data. Research Report. ETS RR-06-08

Peer reviewed
PDF on ERIC

Download full text

Xu, Xueli; von Davier, Matthias – ETS Research Report Series, 2006

More than a dozen statistical models have been developed for the purpose of cognitive diagnosis. These models are supposed to extract a much finer level of information from item responses than traditional unidimensional item response models. In this paper, a general diagnostic model (GDM) was used to analyze a set of simulated sparse data and real…

Descriptors: Statistical Analysis, National Competency Tests, Diagnostic Tests, Item Response Theory

Comparability of TOEFL CBT Writing Prompts for Different Native Language Groups. TOEFL® Research Reports. RR-77. ETS RR-04-24

Peer reviewed
PDF on ERIC

Download full text

Lee, Yong-Won; Breland, Hunter; Muraki, Eiji – ETS Research Report Series, 2004

This study has investigated the comparability of computer-based testing (CBT) writing prompts in the Test of English as a Foreign Language™ (TOEFL®) for examinees of different native language backgrounds. A total of 81 writing prompts introduced from July 1998 through August 2000 were examined using a three-step logistic regression procedure for…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Computer Assisted Testing

Kim, Sooyeon	2
Boughton, Keith A.	1
Breland, Hunter	1
Breyer, F. Jay	1
Fife, James H.	1
Fu, Jianbin	1
Guo, Hongwen	1
James, Kofi	1
Johnson, Matthew	1
Kyllonen, Patrick	1
Lee, Yong-Won	1
Li, Deping	1
Liao, Chi-Wen	1
Livingston, Samuel A.	1
Lorenz, Florian	1
Muraki, Eiji	1
Oranje, Andreas	1
Peters, Stephanie	1
Puhan, Gautam	1
Robin, Frederic	1
Schmitt, Neal	1
Sinharay, Sandip	1
Wise, Maxwell	1
Xu, Xueli	1
Zhang, Mo	1
More ▼