ERIC - Search Results

Publication Date

In 2026	0
Since 2025	3
Since 2022 (last 5 years)	20
Since 2017 (last 10 years)	38
Since 2007 (last 20 years)	45

Descriptor

Computer Assisted Testing	81
Test Construction	34
Elementary Secondary Education	19
Test Items	19
Adaptive Testing	16
Microcomputers	15
Test Validity	14
Computer Software	12
Educational Testing	11
Item Banks	11
Scoring	10
Educational Assessment	9
Evaluation Methods	9
Item Response Theory	9
Testing	8
Item Analysis	7
Models	7
Scores	7
Student Evaluation	7
Test Use	7
Achievement Tests	6
Mathematics Tests	6
Psychometrics	6
Reaction Time	6
Testing Problems	6
More ▼

Source

Educational Measurement:…

Publication Type

Journal Articles	81
Reports - Research	32
Reports - Evaluative	23
Reports - Descriptive	15
Opinion Papers	10
Information Analyses	6
Speeches/Meeting Papers	3
Book/Product Reviews	2
Collected Works - Serials	1
Guides - Non-Classroom	1
Tests/Questionnaires	1
More ▼

Audience

Researchers	8
Practitioners	1

Location

Canada	1
Germany	1
Hong Kong	1
Texas	1
West Virginia	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	2
ACT Assessment	1
Learning Potential Assessment…	1
United States Medical…	1

What Works Clearinghouse Rating

Educational Measurement: Issues and Practice X

Showing 31 to 45 of 81 results Save | Export

Digital Module 10: Rasch Measurement Theory

Peer reviewed

Direct link

Wang, Jue; Engelhard, George, Jr. – Educational Measurement: Issues and Practice, 2019

In this digital ITEMS module, Dr. Jue Wang and Dr. George Engelhard Jr. describe the Rasch measurement framework for the construction and evaluation of new measures and scales. From a theoretical perspective, they discuss the historical and philosophical perspectives on measurement with a focus on Rasch's concept of specific objectivity and…

Descriptors: Item Response Theory, Evaluation Methods, Measurement, Goodness of Fit

Speed Gaps: Exploring Differences in Response Latencies among Groups

Peer reviewed

Direct link

Wright, Daniel B. – Educational Measurement: Issues and Practice, 2019

There is much discussion about and many policies to address achievement gaps in education among groups of students. The focus here is on a different gap and it is argued that it also should be of concern. Speed gaps are differences in how quickly different groups of students answer the questions on academic assessments. To investigate some speed…

Descriptors: Academic Achievement, Achievement Gap, Reaction Time, Educational Testing

Are There Gender Differences in "How" Students Write Their Essays? An Analysis of Writing Processes

Peer reviewed

Direct link

Zhang, Mo; Bennett, Randy E.; Deane, Paul; van Rijn, Peter W. – Educational Measurement: Issues and Practice, 2019

This study compared gender groups on the processes used in writing essays in an online assessment. Middle-school students from four grades responded to essays in two persuasive subgenres, argumentation and policy recommendation. Writing processes were inferred from four indicators extracted from students' keystroke logs. In comparison to males, on…

Descriptors: Gender Differences, Essays, Computer Assisted Testing, Persuasive Discourse

Can Item Response Times Provide Insight into Students' Motivation and Self-Efficacy in Math? An Initial Application of Test Metadata to Understand Students' Social-Emotional Needs

Peer reviewed

Direct link

Soland, James – Educational Measurement: Issues and Practice, 2019

As computer-based tests become more common, there is a growing wealth of metadata related to examinees' response processes, which include solution strategies, concentration, and operating speed. One common type of metadata is item response time. While response times have been used extensively to improve estimates of achievement, little work…

Descriptors: Test Items, Item Response Theory, Metadata, Self Efficacy

Rapid-Guessing Behavior: Its Identification, Interpretation, and Implications

Peer reviewed

Direct link

Wise, Steven L. – Educational Measurement: Issues and Practice, 2017

The rise of computer-based testing has brought with it the capability to measure more aspects of a test event than simply the answers selected or constructed by the test taker. One behavior that has drawn much research interest is the time test takers spend responding to individual multiple-choice items. In particular, very short response…

Descriptors: Guessing (Tests), Multiple Choice Tests, Test Items, Reaction Time

Evaluating Content Alignment in Computerized Adaptive Testing

Peer reviewed

Direct link

Wise, Steven L.; Kingsbury, G. Gage; Webb, Norman L. – Educational Measurement: Issues and Practice, 2015

The alignment between a test and the content domain it measures represents key evidence for the validation of test score inferences. Although procedures have been developed for evaluating the content alignment of linear tests, these procedures are not readily applicable to computerized adaptive tests (CATs), which require large item pools and do…

Descriptors: Computer Assisted Testing, Adaptive Testing, Alignment (Education), Test Content

Examining Effectiveness and Validity of Accommodations for English Language Learners in Mathematics: An Evidence-Based Computer Accommodation Decision System

Peer reviewed

Direct link

Abedi, Jamal; Zhang, Yu; Rowe, Susan E.; Lee, Hansol – Educational Measurement: Issues and Practice, 2020

Research indicates that the performance-gap between English Language Learners (ELLs) and their non-ELL peers is partly due to ELLs' difficulty in understanding assessment language. Accommodations have been shown to narrow this performance-gap, but many accommodations studies have not used a randomized design and are based on relatively small…

Descriptors: English Language Learners, Achievement Gap, Mathematics Tests, Standards

Disaggregated Effects of Device on Score Comparability

Peer reviewed

Direct link

Davis, Laurie; Morrison, Kristin; Kong, Xiaojing; McBride, Yuanyuan – Educational Measurement: Issues and Practice, 2017

The use of tablets for large-scale testing programs has transitioned from concept to reality for many state testing programs. This study extended previous research on score comparability between tablets and computers with high school students to compare score distributions across devices for reading, math, and science and to evaluate device…

Descriptors: Computer Assisted Testing, Handheld Devices, Telecommunications, Scoring

Using Response Time to Detect Item Preknowledge in Computer-Based Licensure Examinations

Peer reviewed

Direct link

Qian, Hong; Staniewska, Dorota; Reckase, Mark; Woo, Ada – Educational Measurement: Issues and Practice, 2016

This article addresses the issue of how to detect item preknowledge using item response time data in two computer-based large-scale licensure examinations. Item preknowledge is indicated by an unexpected short response time and a correct response. Two samples were used for detecting item preknowledge for each examination. The first sample was from…

Descriptors: Reaction Time, Licensing Examinations (Professions), Computer Assisted Testing, Prior Learning

A Process for Reviewing and Evaluating Generated Test Items

Peer reviewed

Direct link

Gierl, Mark J.; Lai, Hollis – Educational Measurement: Issues and Practice, 2016

Testing organization needs large numbers of high-quality items due to the proliferation of alternative test administration methods and modern test designs. But the current demand for items far exceeds the supply. Test items, as they are currently written, evoke a process that is both time-consuming and expensive because each item is written,…

Descriptors: Test Items, Test Construction, Psychometrics, Models

Instructional Topics in Educational Measurement (ITEMS) Module: Using Automated Processes to Generate Test Items

Peer reviewed

Direct link

Gierl, Mark J.; Lai, Hollis – Educational Measurement: Issues and Practice, 2013

Changes to the design and development of our educational assessments are resulting in the unprecedented demand for a large and continuous supply of content-specific test items. One way to address this growing demand is with automatic item generation (AIG). AIG is the process of using item models to generate test items with the aid of computer…

Descriptors: Educational Assessment, Test Items, Automation, Computer Assisted Testing

Automated Scoring of Students' Small-Group Discussions to Assess Reading Ability

Peer reviewed

Direct link

Kosh, Audra E.; Greene, Jeffrey A.; Murphy, P. Karen; Burdick, Hal; Firetto, Carla M.; Elmore, Jeff – Educational Measurement: Issues and Practice, 2018

We explored the feasibility of using automated scoring to assess upper-elementary students' reading ability through analysis of transcripts of students' small-group discussions about texts. Participants included 35 fourth-grade students across two classrooms that engaged in a literacy intervention called Quality Talk. During the course of one…

Descriptors: Computer Assisted Testing, Small Group Instruction, Group Discussion, Student Evaluation

Evaluating the Comparability of Paper- and Computer-Based Science Tests across Sex and SES Subgroups

Peer reviewed

Direct link

Randall, Jennifer; Sireci, Stephen; Li, Xueming; Kaira, Leah – Educational Measurement: Issues and Practice, 2012

As access and reliance on technology continue to increase, so does the use of computerized testing for admissions, licensure/certification, and accountability exams. Nonetheless, full computer-based test (CBT) implementation can be difficult due to limited resources. As a result, some testing programs offer both CBT and paper-based test (PBT)…

Descriptors: Science Tests, Computer Assisted Testing, Scores, Test Bias

Comments on Neil Dorans's NCME Career Award Address: The Contestant Perspective on Taking Tests--Emanations from the Statue within

Peer reviewed

Direct link

Pommerich, Mary – Educational Measurement: Issues and Practice, 2012

Neil Dorans has made a career of advocating for the examinee. He continues to do so in his NCME career award address, providing a thought-provoking commentary on some current trends in educational measurement that could potentially affect the integrity of test scores. Concerns expressed in the address call attention to a conundrum that faces…

Descriptors: Testing, Scores, Measurement, Test Construction

Formative Assessment: A Meta-Analysis and a Call for Research

Peer reviewed

Direct link

Kingston, Neal; Nash, Brooke – Educational Measurement: Issues and Practice, 2011

An effect size of about 0.70 (or 0.40-0.70) is often claimed for the efficacy of formative assessment, but is not supported by the existing research base. More than 300 studies that appeared to address the efficacy of formative assessment in grades K-12 were reviewed. Many of the studies had severely flawed research designs yielding…

Descriptors: Elementary Secondary Education, Formative Evaluation, Program Effectiveness, Effect Size

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Wise, Steven L.	4
Kingsbury, G. Gage	3
Sinharay, Sandip	3
Arslan, Burcu	2
Bennett, Randy E.	2
Deane, Paul	2
Gierl, Mark J.	2
Hiscox, Michael D.	2
Hsu, Tse-chi	2
Keehner, Madeleine	2
Lai, Hollis	2
Plake, Barbara S.	2
Stone, Clement A.	2
Zhang, Mo	2
Abedi, Jamal	1
Agrimson, Jared	1
Ahmadi, Alireza	1
Allalouf, Avi	1
Andrew Hoang	1
April L. Zenisky	1
Arthur, Ann M.	1
Averitt, Jason	1
Baker, Frank B.	1
Balizet, Sha	1
More ▼

Secondary Education	7
High Schools	4
Higher Education	3
Junior High Schools	3
Middle Schools	3
Postsecondary Education	3
Adult Education	2
Elementary Education	2
Elementary Secondary Education	2
Grade 4	2
Intermediate Grades	2
Early Childhood Education	1
Grade 10	1
Grade 3	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Grade 9	1
High School Equivalency…	1
Primary Education	1
More ▼