ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	7

Descriptor

Computation	8
Item Response Theory	8
Test Items	5
Test Construction	4
Cutting Scores	3
Goodness of Fit	3
Ability	2
Academic Standards	2
College Entrance Examinations	2
Comparative Analysis	2
Difficulty Level	2
English	2
Error of Measurement	2
Interrater Reliability	2
Mathematics Achievement	2
Measures (Individuals)	2
Psychometrics	2
Public Education	2
Raw Scores	2
Reading Achievement	2
Scoring	2
Spanish	2
Standard Setting	2
Statistical Bias	2
Student Evaluation	2
More ▼

Source

Behavioral Research and…	2
ETS Research Report Series	2
New Mexico Public Education…	2
College Board	1
Educational Testing Service	1

Author

Ketterlin-Geller, Leanne R.	2
Liu, Kimy	2
Tindal, Gerald	2
DeCarlo, Lawrence T.	1
Griph, Gerald W.	1
Kim, Sooyeon	1
Kim, YoungKoung	1
Moses, Tim	1
Rose, Norman	1
Sundstrom-Hebert, Krystal	1
Xu, Xueli	1
Yovanoff, Paul	1
Zhang, Jinming	1
von Davier, Matthias	1
More ▼

Publication Type

Numerical/Quantitative Data	8
Reports - Research	4
Journal Articles	2
Reports - Descriptive	2
Reports - Evaluative	2

Education Level

Elementary Secondary Education	3
Grade 3	2
Grade 4	2
Grade 5	2
Grade 6	2
Grade 7	2
Grade 8	2
Higher Education	2
Postsecondary Education	2
Secondary Education	2
Elementary Education	1
Grade 1	1
Grade 2	1
High Schools	1
Junior High Schools	1
Middle Schools	1
More ▼

Audience

Location

New Mexico	2
Oregon	1

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	1
National Merit Scholarship…	1
Preliminary Scholastic…	1
Program for International…	1

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Investigating Robustness of Item Response Theory Proficiency Estimators to Atypical Response Behaviors under Two-Stage Multistage Testing. ETS GRE® Board Research Report. ETS GRE®-16-03. ETS Research Report No. RR-16-22

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim – ETS Research Report Series, 2016

The purpose of this study is to evaluate the extent to which item response theory (IRT) proficiency estimation methods are robust to the presence of aberrant responses under the "GRE"® General Test multistage adaptive testing (MST) design. To that end, a wide range of atypical response behaviors affecting as much as 10% of the test items…

Descriptors: Item Response Theory, Computation, Robustness (Statistics), Response Style (Tests)

Evaluating Equity at the Local Level Using Bootstrap Tests. Research Report 2016-4

Download full text

Kim, YoungKoung; DeCarlo, Lawrence T. – College Board, 2016

Because of concerns about test security, different test forms are typically used across different testing occasions. As a result, equating is necessary in order to get scores from the different test forms that can be used interchangeably. In order to assure the quality of equating, multiple equating methods are often examined. Various equity…

Descriptors: Equated Scores, Evaluation Methods, Sampling, Statistical Inference

Modeling Nonignorable Missing Data with Item Response Theory (IRT). Research Report. ETS RR-10-11

Download full text

Rose, Norman; von Davier, Matthias; Xu, Xueli – Educational Testing Service, 2010

Large-scale educational surveys are low-stakes assessments of educational outcomes conducted using nationally representative samples. In these surveys, students do not receive individual scores, and the outcome of the assessment is inconsequential for respondents. The low-stakes nature of these surveys, as well as variations in average performance…

Descriptors: Item Response Theory, Educational Assessment, Data Analysis, Case Studies

Examining Item Functioning of Math Screening Measures for Grades 1-8 Students. Technical Report Number 08-04

Download full text

Liu, Kimy; Ketterlin-Geller, Leanne R.; Yovanoff, Paul; Tindal, Gerald – Behavioral Research and Teaching, 2008

BRT Math Screening Measures focus on students' mathematics performance in grade-level standards for students in grades 1-8. A total of 24 test forms are available with three test forms per grade corresponding to fall, winter, and spring testing periods. Each form contains computation problems and application problems. BRT Math Screening Measures…

Descriptors: Test Items, Test Format, Test Construction, Item Response Theory

Instrument Development Procedures for Maze Measures. Technical Report # 08-06

Download full text

Liu, Kimy; Sundstrom-Hebert, Krystal; Ketterlin-Geller, Leanne R.; Tindal, Gerald – Behavioral Research and Teaching, 2008

The purpose of this study was to document the instrument development of maze measures for grades 3-8. Each maze passage contained twelve omitted words that students filled in by choosing the best-fit word from among the provided options. In this technical report, we describe the process of creating, reviewing, and pilot testing the maze measures.…

Descriptors: Test Construction, Cloze Procedure, Multiple Choice Tests, Reading Tests

Bias Correction for the Maximum Likelihood Estimate of Ability. Research Report. ETS RR-05-15

Peer reviewed
PDF on ERIC

Download full text

Zhang, Jinming – ETS Research Report Series, 2005

Lord's bias function and the weighted likelihood estimation method are effective in reducing the bias of the maximum likelihood estimate of an examinee's ability under the assumption that the true item parameters are known. This paper presents simulation studies to determine the effectiveness of these two methods in reducing the bias when the item…

Descriptors: Statistical Bias, Maximum Likelihood Statistics, Computation, Ability

New Mexico Standards-Based Assessment Technical Report: Spring 2007 Administration

Download full text

New Mexico Public Education Department, 2007

The purpose of the NMSBA technical report is to provide users and other interested parties with a general overview of and technical characteristics of the 2007 NMSBA. The 2007 technical report contains the following information: (1) Test development; (2) Scoring procedures; (3) Summary of student performance; (4) Statistical analyses of item and…

Descriptors: Interrater Reliability, Standard Setting, Measures (Individuals), Scoring

New Mexico Standards Based Assessment (NMSBA) Technical Report: 2006 Spring Administration

Download full text

Griph, Gerald W. – New Mexico Public Education Department, 2006

The purpose of the NMSBA technical report is to provide users and other interested parties with a general overview of and technical characteristics of the 2006 NMSBA. The 2006 technical report contains the following information: (1) Test development; (2) Scoring procedures; (3) Calibration, scaling, and equating procedures; (4) Standard setting;…

Descriptors: Interrater Reliability, Standard Setting, Measures (Individuals), Scoring