ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	6

Descriptor

Computer Assisted Testing	12
Effect Size	12
Test Items	12
Scores	5
Comparative Analysis	4
Item Response Theory	4
Difficulty Level	3
Statistical Analysis	3
Test Bias	3
Context Effect	2
Evaluation Methods	2
Foreign Countries	2
Item Analysis	2
Mathematics Tests	2
Primary Education	2
Psychometrics	2
Reading Tests	2
Student Evaluation	2
Test Content	2
Testing	2
Ability Identification	1
Accuracy	1
Achievement Tests	1
Adaptive Testing	1
Algebra	1
More ▼

Source

Educational and Psychological…	2
Applied Measurement in…	1
ETS Research Report Series	1
Grantee Submission	1
Journal of Career and…	1
Journal of Education and…	1
Journal of Educational…	1
Journal of Educational…	1
Journal of Special Education…	1
Mathematics Education…	1
Partnership for Assessment of…	1
More ▼

Publication Type

Journal Articles	9
Reports - Research	7
Reports - Evaluative	4
Numerical/Quantitative Data	1
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

Primary Education	4
Early Childhood Education	2
Elementary Education	2
Grade 3	2
Grade 5	2
Grade 7	2
Grade 9	2
Elementary Secondary Education	1
Grade 10	1
Grade 11	1
Grade 4	1
Grade 6	1
Grade 8	1
High Schools	1
Higher Education	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
Secondary Education	1
More ▼

Audience

Location

Australia	1
Netherlands	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Beyond Semantic Distance: Automated Scoring of Divergent Thinking Greatly Improves with Large Language Models

Peer reviewed
PDF on ERIC

Download full text

Direct link

Peter Organisciak; Selcuk Acar; Denis Dumas; Kelly Berthiaume – Grantee Submission, 2023

Automated scoring for divergent thinking (DT) seeks to overcome a key obstacle to creativity measurement: the effort, cost, and reliability of scoring open-ended tests. For a common test of DT, the Alternate Uses Task (AUT), the primary automated approach casts the problem as a semantic distance between a prompt and the resulting idea in a text…

Descriptors: Automation, Computer Assisted Testing, Scoring, Creative Thinking

Investigating Measurement Invariance in Computer-Based Personality Testing: The Impact of Using Anchor Items on Effect Size Indices

Peer reviewed

Direct link

Egberink, Iris J. L.; Meijer, Rob R.; Tendeiro, Jorge N. – Educational and Psychological Measurement, 2015

A popular method to assess measurement invariance of a particular item is based on likelihood ratio tests with all other items as anchor items. The results of this method are often only reported in terms of statistical significance, and researchers proposed different methods to empirically select anchor items. It is unclear, however, how many…

Descriptors: Personality Measures, Computer Assisted Testing, Measurement, Test Items

Mode Comparability Study Based on Spring 2015 Operational Test Data

Download full text

Liu, Junhui; Brown, Terran; Chen, Jianshen; Ali, Usama; Hou, Likun; Costanzo, Kate – Partnership for Assessment of Readiness for College and Careers, 2016

The Partnership for Assessment of Readiness for College and Careers (PARCC) is a state-led consortium working to develop next-generation assessments that more accurately, compared to previous assessments, measure student progress toward college and career readiness. The PARCC assessments include both English Language Arts/Literacy (ELA/L) and…

Descriptors: Testing, Achievement Tests, Test Items, Test Bias

Entering the "New Frontier" of Mathematics Assessment: Designing and Trialling the PVAT-O (Online)

Download full text

Rogers, Angela – Mathematics Education Research Group of Australasia, 2013

As we move into the 21st century, educationalists are exploring the myriad of possibilities associated with Computer Based Assessment (CBA). At first glance this mode of assessment seems to provide many exciting opportunities in the mathematics domain, yet one must question the validity of CBA and whether our school systems, students and teachers…

Descriptors: Mathematics Tests, Student Evaluation, Computer Assisted Testing, Test Validity

Test Anxiety, Computer-Adaptive Testing and the Common Core

Peer reviewed
PDF on ERIC

Download full text

Colwell, Nicole Makas – Journal of Education and Training Studies, 2013

This paper highlights the current findings and issues regarding the role of computer-adaptive testing in test anxiety. The computer-adaptive test (CAT) proposed by one of the Common Core consortia brings these issues to the forefront. Research has long indicated that test anxiety impairs student performance. More recent research indicates that…

Descriptors: Test Anxiety, Computer Assisted Testing, Evaluation Methods, Standardized Tests

A Comparison of Computer-Based Testing and Pencil-and-Paper Testing for Students with a Read-Aloud Accommodation

Peer reviewed

Direct link

Flowers, Claudia; Kim, Do-Hong; Lewis, Preston; Davis, Violeta Carmen – Journal of Special Education Technology, 2011

This study examined the academic performance and preference of students with disabilities for two types of test administration conditions, computer-based testing (CBT) and pencil-and-paper testing (PPT). Data from a large-scale assessment program were used to examine differences between CBT and PPT academic performance for third to eleventh grade…

Descriptors: Testing, Test Items, Effect Size, Computer Assisted Testing

Does Deferring Items Change Their Psychometric Characteristics?

Peer reviewed

Direct link

Pomplun, Mark; Custer, Michael – Applied Measurement in Education, 2005

In this study, we investigated possible context effects when students chose to defer items and answer those items later during a computerized test. In 4 primary school reading tests, 126 items were studied. Logistic regression analyses identified 4 items across 4 grade levels as statistically significant. However, follow-up analyses indicated that…

Descriptors: Psychometrics, Reading Tests, Effect Size, Test Items

An Investigation of Context Effects for Item Randomization within Testlets

Peer reviewed

Direct link

Pomplun, Mark; Ritchie, Timothy – Journal of Educational Computing Research, 2004

This study investigated the statistical and practical significance of context effects for items randomized within testlets for administration during a series of computerized non-adaptive tests. One hundred and twenty-five items from four primary school reading tests were studied. Logistic regression analyses identified from one to four items for…

Descriptors: Psychometrics, Context Effect, Effect Size, Primary Education

Effects of Practical Constraints on Item Selection Rules at the Early Stages of Computerized Adaptive Testing

Peer reviewed

Direct link

Chen, Shu-Ying; Ankenman, Robert D. – Journal of Educational Measurement, 2004

The purpose of this study was to compare the effects of four item selection rules--(1) Fisher information (F), (2) Fisher information with a posterior distribution (FP), (3) Kullback-Leibler information with a posterior distribution (KP), and (4) completely randomized item selection (RN)--with respect to the precision of trait estimation and the…

Descriptors: Test Length, Adaptive Testing, Computer Assisted Testing, Test Selection

The Impact of Settable Test Item Exposure Control Interface Format on Postsecondary Business Student Test Performance

Peer reviewed
PDF on ERIC

Download full text

Truell, Allen D.; Zhao, Jensen J.; Alexander, Melody W. – Journal of Career and Technical Education, 2005

The purposes of this study were to determine if there is a significant difference in postsecondary business student scores and test completion time based on settable test item exposure control interface format, and to determine if there is a significant difference in student scores and test completion time based on settable test item exposure…

Descriptors: College Students, Scores, Tests, Gender Differences

A Monte Carlo Comparison of Measures of Relative and Absolute Monitoring Accuracy

Peer reviewed

Direct link

Nietfeld, John L.; Enders, Craig K; Schraw, Gregory – Educational and Psychological Measurement, 2006

Researchers studying monitoring accuracy currently use two different indexes to estimate accuracy: relative accuracy and absolute accuracy. The authors compared the distributional properties of two measures of monitoring accuracy using Monte Carlo procedures that fit within these categories. They manipulated the accuracy of judgments (i.e., chance…

Descriptors: Monte Carlo Methods, Test Items, Computation, Metacognition

Evaluating the Comparability of Paper-and-Pencil and Computerized Versions of a Large-Scale Certification Test. Research Report. ETS RR-05-21

Peer reviewed
PDF on ERIC

Download full text

Puhan, Gautam; Boughton, Keith A.; Kim, Sooyeon – ETS Research Report Series, 2005

The study evaluated the comparability of two versions of a teacher certification test: a paper-and-pencil test (PPT) and computer-based test (CBT). Standardized mean difference (SMD) and differential item functioning (DIF) analyses were used as measures of comparability at the test and item levels, respectively. Results indicated that effect sizes…

Descriptors: Comparative Analysis, Test Items, Statistical Analysis, Teacher Certification

Pomplun, Mark	2
Alexander, Melody W.	1
Ali, Usama	1
Ankenman, Robert D.	1
Boughton, Keith A.	1
Brown, Terran	1
Chen, Jianshen	1
Chen, Shu-Ying	1
Colwell, Nicole Makas	1
Costanzo, Kate	1
Custer, Michael	1
Davis, Violeta Carmen	1
Denis Dumas	1
Egberink, Iris J. L.	1
Enders, Craig K	1
Flowers, Claudia	1
Hou, Likun	1
Kelly Berthiaume	1
Kim, Do-Hong	1
Kim, Sooyeon	1
Lewis, Preston	1
Liu, Junhui	1
Meijer, Rob R.	1
Nietfeld, John L.	1
Peter Organisciak	1
More ▼