ERIC - Search Results

Publication Date

In 2026	0
Since 2025	4
Since 2022 (last 5 years)	11
Since 2017 (last 10 years)	43
Since 2007 (last 20 years)	68

Descriptor

Scoring	114
Test Items	114
Test Reliability	114
Test Validity	70
Test Construction	58
Item Response Theory	32
Psychometrics	29
Item Analysis	26
Scores	22
Difficulty Level	21
Multiple Choice Tests	20
Testing	20
Test Bias	18
Mathematics Tests	16
Test Format	16
Foreign Countries	15
Achievement Tests	13
Computer Assisted Testing	13
Correlation	12
Interrater Reliability	12
Language Tests	12
Comparative Analysis	11
Elementary School Students	11
English (Second Language)	11
Higher Education	11
More ▼

Publication Type

Journal Articles	63
Reports - Research	57
Reports - Evaluative	29
Reports - Descriptive	16
Speeches/Meeting Papers	13
Numerical/Quantitative Data	10
Tests/Questionnaires	10
Guides - Non-Classroom	5
Guides - Classroom - Teacher	2
Information Analyses	2
Opinion Papers	2
Books	1
Collected Works - General	1
Dissertations/Theses -…	1
Guides - General	1
Reference Materials -…	1
More ▼

Education Level

Secondary Education	16
Elementary Education	14
Higher Education	13
High Schools	9
Postsecondary Education	9
Early Childhood Education	8
Middle Schools	8
Primary Education	7
Elementary Secondary Education	6
Intermediate Grades	6
Junior High Schools	6
Grade 4	5
Grade 3	4
Grade 5	4
Grade 7	4
Grade 6	3
Grade 1	2
Grade 2	2
Grade 8	2
Grade 9	2
Kindergarten	2
More ▼

Audience

Practitioners	3
Researchers	2
Teachers	2

Location

Florida	5
Nebraska	5
California	3
New Mexico	3
Canada	2
Alabama	1
Germany	1
Idaho	1
Iran	1
Israel	1
Maryland	1
Nebraska (Lincoln)	1
New York	1
North Dakota	1
Ohio	1
Oman	1
Taiwan	1
Tennessee	1
Texas	1
Turkey	1
United Kingdom (England)	1
United Kingdom (London)	1
Washington	1
West Virginia	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	1
Individuals with Disabilities…	1

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Showing 1 to 15 of 114 results Save | Export

Is Effort Moderated Scoring Robust to Multidimensional Rapid Guessing?

Peer reviewed

Direct link

Joseph A. Rios; Jiayi Deng – Educational and Psychological Measurement, 2025

To mitigate the potential damaging consequences of rapid guessing (RG), a form of noneffortful responding, researchers have proposed a number of scoring approaches. The present simulation study examines the robustness of the most popular of these approaches, the unidimensional effort-moderated (EM) scoring procedure, to multidimensional RG (i.e.,…

Descriptors: Scoring, Guessing (Tests), Reaction Time, Item Response Theory

TOEFL iBT® Technical Manual. TOEFL® Research Series. RR-106. ETS Research Report. RR-25-12

Peer reviewed
PDF on ERIC

Download full text

Venessa F. Manna; Shuhong Li; Spiros Papageorgiou; Lixiong Gu – ETS Research Report Series, 2025

This technical manual describes the purpose and intended uses of the TOEFL iBT test, its target test-taker population, and relevant language use domains. The test design and scoring procedures are presented first, followed by a research agenda intended to support the interpretation and use of test scores. Given the updates to the test starting…

Descriptors: Second Language Learning, English (Second Language), Language Tests, Test Construction

Item Response Theory Modeling of the Verb Naming Test

Peer reviewed

Direct link

Fergadiotis, Gerasimos; Casilio, Marianne; Dickey, Michael Walsh; Steel, Stacey; Nicholson, Hannele; Fleegle, Mikala; Swiderski, Alexander; Hula, William D. – Journal of Speech, Language, and Hearing Research, 2023

Purpose: Item response theory (IRT) is a modern psychometric framework with several advantageous properties as compared with classical test theory. IRT has been successfully used to model performance on anomia tests in individuals with aphasia; however, all efforts to date have focused on noun production accuracy. The purpose of this study is to…

Descriptors: Item Response Theory, Psychometrics, Verbs, Naming

Differential Item Functioning Analysis of the Fundamental Concepts for Organic Reaction Mechanisms Inventory

Peer reviewed

Direct link

Sachin Nedungadi; Corina E. Brown; Sue Hyeon Paek – Journal of Chemical Education, 2022

The Fundamental Concepts for Organic Reaction Mechanisms Inventory (FC-ORMI) is a concept inventory with most items in a two-tier design in which an answer tier is followed by a reasoning tier. Statistical results provided strong evidence for the validity and reliability of the data obtained using the FC-ORMI. In this study, differential item…

Descriptors: Test Bias, Test Validity, Test Reliability, Gender Differences

Can High-Dimensional Questionnaires Resolve the Ipsativity Issue of Forced-Choice Response Formats?

Peer reviewed

Direct link

Schulte, Niklas; Holling, Heinz; Bürkner, Paul-Christian – Educational and Psychological Measurement, 2021

Forced-choice questionnaires can prevent faking and other response biases typically associated with rating scales. However, the derived trait scores are often unreliable and ipsative, making interindividual comparisons in high-stakes situations impossible. Several studies suggest that these problems vanish if the number of measured traits is high.…

Descriptors: Questionnaires, Measurement Techniques, Test Format, Scoring

The Competent Computational Thinking Test (cCTt): A Valid, Reliable and Gender-Fair Test for Longitudinal CT Studies in Grades 3-6

Peer reviewed

Direct link

Laila El-Hamamsy; María Zapata-Cáceres; Estefanía Martín-Barroso; Francesco Mondada; Jessica Dehler Zufferey; Barbara Bruno; Marcos Román-González – Technology, Knowledge and Learning, 2025

The introduction of computing education into curricula worldwide requires multi-year assessments to evaluate the long-term impact on learning. However, no single Computational Thinking (CT) assessment spans primary school, and no group of CT assessments provides a means of transitioning between instruments. This study therefore investigated…

Descriptors: Cognitive Tests, Computation, Thinking Skills, Test Validity

A New Scoring Method for Item Response Theory Analysis of C-Tests

Peer reviewed

Direct link

Farshad Effatpanah; Purya Baghaei; Mona Tabatabaee-Yazdi; Esmat Babaii – Language Testing, 2025

This study aimed to propose a new method for scoring C-Tests as measures of general language proficiency. In this approach, the unit of analysis is sentences rather than gaps or passages. That is, the gaps correctly reformulated in each sentence were aggregated as sentence score, and then each sentence was entered into the analysis as a polytomous…

Descriptors: Item Response Theory, Language Tests, Test Items, Test Construction

Coefficient [beta] as Extension of KR-21 Reliability for Summed and Scaled Scores for Polytomously-Scored Tests

Peer reviewed

Direct link

Almehrizi, Rashid S. – Applied Measurement in Education, 2021

KR-21 reliability and its extension (coefficient [alpha]) gives the reliability estimate of test scores under the assumption of tau-equivalent forms. KR-21 reliability gives the reliability estimate for summed scores for dichotomous items when items are randomly sampled from an infinite pool of similar items (randomly parallel forms). The article…

Descriptors: Test Reliability, Scores, Scoring, Computation

Establishing a Physics Concept Inventory Using Computer Marked Free-Response Questions

Peer reviewed
PDF on ERIC

Download full text

Parker, Mark A. J.; Hedgeland, Holly; Jordan, Sally E.; Braithwaite, Nicholas St. J. – European Journal of Science and Mathematics Education, 2023

The study covers the development and testing of the alternative mechanics survey (AMS), a modified force concept inventory (FCI), which used automatically marked free-response questions. Data were collected over a period of three academic years from 611 participants who were taking physics classes at high school and university level. A total of…

Descriptors: Test Construction, Scientific Concepts, Physics, Test Reliability

Development of a Protein Concept Inventory: A Proposal for Item Scoring and Responding

Peer reviewed
PDF on ERIC

Download full text

Güntay Tasçi – Science Insights Education Frontiers, 2024

The present study has aimed to develop and validate a protein concept inventory (PCI) consisting of 25 multiple-choice (MC) questions to assess students' understanding of protein, which is a fundamental concept across different biology disciplines. The development process of the PCI involved a literature review to identify protein-related content,…

Descriptors: Science Instruction, Science Tests, Multiple Choice Tests, Biology

A Mokken Scale Analysis of the Last Series of the Standard Progressive Matrices (SPM-LS)

Peer reviewed
PDF on ERIC

Download full text

Myszkowski, Nils – Journal of Intelligence, 2020

Raven's Standard Progressive Matrices (Raven 1941) is a widely used 60-item long measure of general mental ability. It was recently suggested that, for situations where taking this test is too time consuming, a shorter version, comprised of only the last series of the Standard Progressive Matrices (Myszkowski and Storme 2018) could be used, while…

Descriptors: Intelligence Tests, Psychometrics, Nonparametric Statistics, Item Response Theory

Using Existing Data to Inform Development of New Item Types. Research Report. ETS RR-20-01

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Ling, Guangming; Frankel, Lois – ETS Research Report Series, 2020

With advances in technology, researchers and test developers are developing new item types to measure complex skills like problem solving and critical thinking. Analyzing such items is often challenging because of their complicated response patterns, and thus it is important to develop psychometric methods for practitioners and researchers to…

Descriptors: Test Construction, Test Items, Item Analysis, Psychometrics

Partial Credit in Answer-Until-Correct Multiple-Choice Tests Deployed in a Classroom Setting

Peer reviewed

Direct link

Slepkov, Aaron D.; Godfrey, Alan T. K. – Applied Measurement in Education, 2019

The answer-until-correct (AUC) method of multiple-choice (MC) testing involves test respondents making selections until the keyed answer is identified. Despite attendant benefits that include improved learning, broad student adoption, and facile administration of partial credit, the use of AUC methods for classroom testing has been extremely…

Descriptors: Multiple Choice Tests, Test Items, Test Reliability, Scores

Exploring the Relationship between Optimal Methods of Item Scoring and Selection and Predictive Validity. Conference Paper

Direct link

Benton, Tom – Cambridge Assessment, 2018

One of the questions with the longest history in educational assessment is whether it is possible to increase the reliability of a test simply by altering the way in which scores on individual test items are combined to make the overall test score. Most usually, the score available on each item is communicated to the candidate within a question…

Descriptors: Test Items, Scoring, Predictive Validity, Test Reliability

A Design for Comparing CTT and IRT in Test Assembly, Scoring and Argumentation: Differences among Reliability, Information and Validation

Peer reviewed

Direct link

Alqarni, Abdulelah Mohammed – Journal on Educational Psychology, 2019

This study compares the psychometric properties of reliability in Classical Test Theory (CTT), item information in Item Response Theory (IRT), and validation from the perspective of modern validity theory for the purpose of bringing attention to potential issues that might exist when testing organizations use both test theories in the same testing…

Descriptors: Test Theory, Item Response Theory, Test Construction, Scoring

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8

Journal of Psychoeducational…	8
ETS Research Report Series	5
Grantee Submission	5
Applied Psychological…	4
Nebraska Department of…	4
Applied Measurement in…	3
Online Submission	3
Educational Measurement:…	2
Educational and Psychological…	2
International Journal of…	2
Journal of Educational…	2
Language Testing	2
New Meridian Corporation	2
New Mexico Public Education…	2
OECD Publishing	2
ACT, Inc.	1
Advances in Health Sciences…	1
American Journal of…	1
American Language Review	1
Assessment	1
Assessment & Evaluation in…	1
Assessment and Evaluation in…	1
CBE - Life Sciences Education	1
Cambridge Assessment	1
College Board	1
More ▼

Schoen, Robert C.	7
Yang, Xiaotong	4
Anderson, Daniel	3
Bauduin, Charity	3
Paek, Insu	3
Stansfield, Charles W.	3
Burton, Richard F.	2
Dorans, Neil J.	2
Downey, Ronald G.	2
Guo, Hongwen	2
Haladyna, Thomas M.	2
Liu, Sicong	2
Segall, Daniel O.	2
Slepkov, Aaron D.	2
Albanese, Mark A.	1
Alderson, J. Charles	1
Aleamoni, Lawrence M.	1
Almehrizi, Rashid S.	1
Alqarni, Abdulelah Mohammed	1
Anderson, Paul S.	1
Ault, Marilyn	1
Aviad-Levitzky, Tami	1
Bae, Yunhee	1
Barbara Bruno	1
More ▼

SAT (College Admission Test)	4
ACT Assessment	3
Program for International…	2
Raven Progressive Matrices	2
Test of English as a Foreign…	2
ACT Interest Inventory	1
Advanced Placement…	1
Alberta Grade Twelve Diploma…	1
Autism Diagnostic Observation…	1
Clinical Evaluation of…	1
Computer Attitude Scale	1
Cornell Critical Thinking Test	1
Graduate Management Admission…	1
Graduate Record Examinations	1
International Association for…	1
International English…	1
Kaufman Test of Educational…	1
National Assessment of…	1
Preliminary Scholastic…	1
Progress in International…	1
Strengths and Difficulties…	1
Teaching and Learning…	1
Test of Nonverbal Intelligence	1
Test of Written English	1
Trends in International…	1
More ▼