Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 4 |
| Since 2007 (last 20 years) | 10 |
Descriptor
| Data Collection | 12 |
| Scores | 6 |
| Data Analysis | 4 |
| Item Response Theory | 4 |
| Computation | 3 |
| Computer Assisted Testing | 3 |
| Equated Scores | 3 |
| Evaluation Methods | 3 |
| Scaling | 3 |
| Test Construction | 3 |
| Error of Measurement | 2 |
| More ▼ | |
Source
| ETS Research Report Series | 12 |
Author
| von Davier, Alina A. | 2 |
| Bauer, Malcolm | 1 |
| Carstensen, Claus H. | 1 |
| Daniel F. McCaffrey | 1 |
| Deane, Paul | 1 |
| Freund, David | 1 |
| Fu, Jianbin | 1 |
| Grant, Mary | 1 |
| Gu, Lin | 1 |
| Guo, Hongwen | 1 |
| Halpin, Peter F. | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 12 |
| Reports - Research | 11 |
| Collected Works - General | 1 |
| Information Analyses | 1 |
| Reports - Descriptive | 1 |
Education Level
| Junior High Schools | 2 |
| Middle Schools | 2 |
| Secondary Education | 2 |
| Elementary Education | 1 |
| Grade 4 | 1 |
| Grade 8 | 1 |
| Higher Education | 1 |
| Intermediate Grades | 1 |
| Postsecondary Education | 1 |
Audience
Location
| California | 1 |
| Nevada | 1 |
| New Jersey | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| ACT Assessment | 1 |
| Advanced Placement… | 1 |
| College Level Examination… | 1 |
| Law School Admission Test | 1 |
| National Assessment of… | 1 |
| SAT (College Admission Test) | 1 |
| Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Hongwen Guo; Matthew S. Johnson; Daniel F. McCaffrey; Lixong Gu – ETS Research Report Series, 2024
The multistage testing (MST) design has been gaining attention and popularity in educational assessments. For testing programs that have small test-taker samples, it is challenging to calibrate new items to replenish the item pool. In the current research, we used the item pools from an operational MST program to illustrate how research studies…
Descriptors: Test Items, Test Construction, Sample Size, Scaling
Rotou, Ourania; Rupp, André A. – ETS Research Report Series, 2020
This research report provides a description of the processes of evaluating the "deployability" of automated scoring (AS) systems from the perspective of large-scale educational assessments in operational settings. It discusses a comprehensive psychometric evaluation that entails analyses that take into consideration the specific purpose…
Descriptors: Computer Assisted Testing, Scoring, Educational Assessment, Psychometrics
Hao, Jiangang; Smith, Lawrence; Mislevy, Robert; von Davier, Alina; Bauer, Malcolm – ETS Research Report Series, 2016
Extracting information efficiently from game/simulation-based assessment (G/SBA) logs requires two things: a well-structured log file and a set of analysis methods. In this report, we propose a generic data model specified as an extensible markup language (XML) schema for the log files of G/SBAs. We also propose a set of analysis methods for…
Descriptors: Evaluation Methods, Games, Computer Assisted Testing, Data Collection
Zhu, Mengxiao; Zhang, Mo; Deane, Paul – ETS Research Report Series, 2019
The research on using event logs and item response time to study test-taking processes is rapidly growing in the field of educational measurement. In this study, we analyzed the keystroke logs collected from 761 middle school students in the United States as they completed a persuasive writing task. Seven variables were extracted from the…
Descriptors: Keyboarding (Data Entry), Data Collection, Data Analysis, Writing Processes
Guo, Hongwen – ETS Research Report Series, 2017
Data collected from online learning and tutoring systems for individual students showed strong autocorrelation or dependence because of content connection, knowledge-based dependency, or persistence of learning behavior. When the response data show little dependence or negative autocorrelations for individual students, it is suspected that…
Descriptors: Data Collection, Electronic Learning, Intelligent Tutoring Systems, Information Utilization
Fu, Jianbin; Zapata, Diego; Mavronikolas, Elia – ETS Research Report Series, 2014
Simulation or game-based assessments produce outcome data and process data. In this article, some statistical models that can potentially be used to analyze data from simulation or game-based assessments are introduced. Specifically, cognitive diagnostic models that can be used to estimate latent skills from outcome data so as to scale these…
Descriptors: Simulation, Evaluation Methods, Games, Data Collection
Gu, Lin; Lockwood, John; Powers, Donald E. – ETS Research Report Series, 2015
Standardized tests are often designed to provide only a snapshot of test takers' knowledge, skills, or abilities at a single point in time. Sometimes, however, they are expected to serve more demanding functions, one of them is assessing change in knowledge, skills, or ability over time because of learning effects.The latter is the case for the…
Descriptors: Language Tests, Second Language Learning, English (Second Language), Standardized Tests
von Davier, Alina A.; Halpin, Peter F. – ETS Research Report Series, 2013
Collaboration is generally recognized as a core competency of today's knowledge economy and has taken a central role in recent theoretical and technological developments in education research. Yet, the methodology for assessing the learning benefits of collaboration continues to rely on educational tests designed for isolated individuals. Thus,…
Descriptors: Cooperative Learning, Problem Solving, Research, Psychometrics
Puhan, Gautam; Moses, Tim; Grant, Mary; McHale, Fred – ETS Research Report Series, 2008
A single group (SG) equating design with nearly equivalent test forms (SiGNET) design was developed by Grant (2006) to equate small volume tests. The basis of this design is that examinees take two largely overlapping test forms within a single administration. The scored items for the operational form are divided into mini-tests called testlets.…
Descriptors: Data Collection, Equated Scores, Item Sampling, Sample Size
Oranje, Andreas; Freund, David; Lin, Mei-jang; Tang, Yuxin – ETS Research Report Series, 2007
In this paper, a data perturbation method for minimizing the possibility of disclosure of participants' identities on a survey is described in the context of the National Assessment of Educational Progress (NAEP). The method distinguishes itself from most approaches because of the presence of cognitive tasks. Hence, a data edit should have minimal…
Descriptors: Student Surveys, Risk, National Competency Tests, Data Analysis
von Davier, Alina A.; Carstensen, Claus H.; von Davier, Matthias – ETS Research Report Series, 2006
Measuring and linking competencies require special instruments, special data collection designs, and special statistical models. The measurement instruments are tests or tests forms, which can be used in the following situations: The same test can be given repeatedly; two or more parallel tests forms (i.e., forms intended to be similar in…
Descriptors: Scores, Measurement Techniques, Competence, Comparative Analysis
von Davier, Alina A., Ed.; Liu, Mei, Ed. – ETS Research Report Series, 2006
This report builds on and extends existent research on population invariance to new tests and issues. The authors lay the foundation for a deeper understanding of the use of population invariance measures in a wide variety of practical contexts. The invariance of linear, equipercentile and IRT equating methods are examined using data from five…
Descriptors: Equated Scores, Statistical Analysis, Data Collection, Test Format

Peer reviewed
