ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	4
Since 2007 (last 20 years)	10

Descriptor

Data Collection	12
Scores	6
Data Analysis	4
Item Response Theory	4
Computation	3
Computer Assisted Testing	3
Equated Scores	3
Evaluation Methods	3
Scaling	3
Test Construction	3
Error of Measurement	2
Games	2
Hierarchical Linear Modeling	2
Longitudinal Studies	2
Models	2
Problem Solving	2
Psychometrics	2
Sample Size	2
Simulation	2
Statistical Analysis	2
Statistical Bias	2
Test Format	2
Test Items	2
Accuracy	1
Achievement Gains	1
More ▼

Source

ETS Research Report Series

Publication Type

Journal Articles	12
Reports - Research	11
Collected Works - General	1
Information Analyses	1
Reports - Descriptive	1

Education Level

Junior High Schools	2
Middle Schools	2
Secondary Education	2
Elementary Education	1
Grade 4	1
Grade 8	1
Higher Education	1
Intermediate Grades	1
Postsecondary Education	1

Audience

Location

California	1
Nevada	1
New Jersey	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Advanced Placement…	1
College Level Examination…	1
Law School Admission Test	1
National Assessment of…	1
SAT (College Admission Test)	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Practical Considerations in Item Calibration with Small Samples under Multistage Test Design: A Case Study. Research Report. ETS RR-24-03

Peer reviewed
PDF on ERIC

Download full text

Hongwen Guo; Matthew S. Johnson; Daniel F. McCaffrey; Lixong Gu – ETS Research Report Series, 2024

The multistage testing (MST) design has been gaining attention and popularity in educational assessments. For testing programs that have small test-taker samples, it is challenging to calibrate new items to replenish the item pool. In the current research, we used the item pools from an operational MST program to illustrate how research studies…

Descriptors: Test Items, Test Construction, Sample Size, Scaling

Evaluations of Automated Scoring Systems in Practice. Research Report. ETS RR-20-10

Peer reviewed
PDF on ERIC

Download full text

Rotou, Ourania; Rupp, André A. – ETS Research Report Series, 2020

This research report provides a description of the processes of evaluating the "deployability" of automated scoring (AS) systems from the perspective of large-scale educational assessments in operational settings. It discusses a comprehensive psychometric evaluation that entails analyses that take into consideration the specific purpose…

Descriptors: Computer Assisted Testing, Scoring, Educational Assessment, Psychometrics

Taming Log Files from Game/Simulation-Based Assessments: Data Models and Data Analysis Tools. Research Report. ETS RR-16-10

Peer reviewed
PDF on ERIC

Download full text

Hao, Jiangang; Smith, Lawrence; Mislevy, Robert; von Davier, Alina; Bauer, Malcolm – ETS Research Report Series, 2016

Extracting information efficiently from game/simulation-based assessment (G/SBA) logs requires two things: a well-structured log file and a set of analysis methods. In this report, we propose a generic data model specified as an extensible markup language (XML) schema for the log files of G/SBAs. We also propose a set of analysis methods for…

Descriptors: Evaluation Methods, Games, Computer Assisted Testing, Data Collection

Analysis of Keystroke Sequences in Writing Logs. Research Report. ETS RR-19-11

Peer reviewed
PDF on ERIC

Download full text

Zhu, Mengxiao; Zhang, Mo; Deane, Paul – ETS Research Report Series, 2019

The research on using event logs and item response time to study test-taking processes is rapidly growing in the field of educational measurement. In this study, we analyzed the keystroke logs collected from 761 middle school students in the United States as they completed a persuasive writing task. Seven variables were extracted from the…

Descriptors: Keyboarding (Data Entry), Data Collection, Data Analysis, Writing Processes

Exploring Online Learning Data Using Fractal Dimensions. Research Report. ETS RR-17-15

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen – ETS Research Report Series, 2017

Data collected from online learning and tutoring systems for individual students showed strong autocorrelation or dependence because of content connection, knowledge-based dependency, or persistence of learning behavior. When the response data show little dependence or negative autocorrelations for individual students, it is suspected that…

Descriptors: Data Collection, Electronic Learning, Intelligent Tutoring Systems, Information Utilization

Statistical Methods for Assessments in Simulations and Serious Games. Research Report. ETS RR-14-12

Peer reviewed
PDF on ERIC

Download full text

Fu, Jianbin; Zapata, Diego; Mavronikolas, Elia – ETS Research Report Series, 2014

Simulation or game-based assessments produce outcome data and process data. In this article, some statistical models that can potentially be used to analyze data from simulation or game-based assessments are introduced. Specifically, cognitive diagnostic models that can be used to estimate latent skills from outcome data so as to scale these…

Descriptors: Simulation, Evaluation Methods, Games, Data Collection

Evaluating the "TOEFL Junior"® Standard Test as a Measure of Progress for Young English Language Learners. Research Report. ETS RR-15-22

Peer reviewed
PDF on ERIC

Download full text

Gu, Lin; Lockwood, John; Powers, Donald E. – ETS Research Report Series, 2015

Standardized tests are often designed to provide only a snapshot of test takers' knowledge, skills, or abilities at a single point in time. Sometimes, however, they are expected to serve more demanding functions, one of them is assessing change in knowledge, skills, or ability over time because of learning effects.The latter is the case for the…

Descriptors: Language Tests, Second Language Learning, English (Second Language), Standardized Tests

Collaborative Problem Solving and the Assessment of Cognitive Skills: Psychometric Considerations. Research Report. ETS RR-13-41

Peer reviewed
PDF on ERIC

Download full text

von Davier, Alina A.; Halpin, Peter F. – ETS Research Report Series, 2013

Collaboration is generally recognized as a core competency of today's knowledge economy and has taken a central role in recent theoretical and technological developments in education research. Yet, the methodology for assessing the learning benefits of collaboration continues to rely on educational tests designed for isolated individuals. Thus,…

Descriptors: Cooperative Learning, Problem Solving, Research, Psychometrics

An Alternative Data Collection Design for Equating with Very Small Samples. Research Report. ETS RR-08-11

Peer reviewed
PDF on ERIC

Download full text

Puhan, Gautam; Moses, Tim; Grant, Mary; McHale, Fred – ETS Research Report Series, 2008

A single group (SG) equating design with nearly equivalent test forms (SiGNET) design was developed by Grant (2006) to equate small volume tests. The basis of this design is that examinees take two largely overlapping test forms within a single administration. The scored items for the operational form are divided into mini-tests called testlets.…

Descriptors: Data Collection, Equated Scores, Item Sampling, Sample Size

Disclosure Risk in Educational Surveys: An Application to the National Assessment of Educational Progress. Research Report. ETS RR-07-24

Peer reviewed
PDF on ERIC

Download full text

Oranje, Andreas; Freund, David; Lin, Mei-jang; Tang, Yuxin – ETS Research Report Series, 2007

In this paper, a data perturbation method for minimizing the possibility of disclosure of participants' identities on a survey is described in the context of the National Assessment of Educational Progress (NAEP). The method distinguishes itself from most approaches because of the presence of cognitive tasks. Hence, a data edit should have minimal…

Descriptors: Student Surveys, Risk, National Competency Tests, Data Analysis

Linking Competencies in Educational Settings and Measuring Growth. Research Report. ETS RR-06-12

Peer reviewed
PDF on ERIC

Download full text

von Davier, Alina A.; Carstensen, Claus H.; von Davier, Matthias – ETS Research Report Series, 2006

Measuring and linking competencies require special instruments, special data collection designs, and special statistical models. The measurement instruments are tests or tests forms, which can be used in the following situations: The same test can be given repeatedly; two or more parallel tests forms (i.e., forms intended to be similar in…

Descriptors: Scores, Measurement Techniques, Competence, Comparative Analysis

Population Invariance of Test Equating and Linking: Theory Extension and Applications across Exams. Research Report. ETS RR-06-31

Peer reviewed
PDF on ERIC

Download full text

von Davier, Alina A., Ed.; Liu, Mei, Ed. – ETS Research Report Series, 2006

This report builds on and extends existent research on population invariance to new tests and issues. The authors lay the foundation for a deeper understanding of the use of population invariance measures in a wide variety of practical contexts. The invariance of linear, equipercentile and IRT equating methods are examined using data from five…

Descriptors: Equated Scores, Statistical Analysis, Data Collection, Test Format

von Davier, Alina A.	2
Bauer, Malcolm	1
Carstensen, Claus H.	1
Daniel F. McCaffrey	1
Deane, Paul	1
Freund, David	1
Fu, Jianbin	1
Grant, Mary	1
Gu, Lin	1
Guo, Hongwen	1
Halpin, Peter F.	1
Hao, Jiangang	1
Hongwen Guo	1
Lin, Mei-jang	1
Liu, Mei, Ed.	1
Lixong Gu	1
Lockwood, John	1
Matthew S. Johnson	1
Mavronikolas, Elia	1
McHale, Fred	1
Mislevy, Robert	1
Moses, Tim	1
Oranje, Andreas	1
Powers, Donald E.	1
Puhan, Gautam	1
More ▼