ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	17
Since 2017 (last 10 years)	43
Since 2007 (last 20 years)	73

Descriptor

Test Items	357
Testing Problems	357
Test Construction	162
Item Analysis	84
Test Validity	72
Higher Education	67
Test Bias	65
Multiple Choice Tests	61
Test Format	60
Elementary Secondary Education	55
Difficulty Level	53
Test Reliability	53
Foreign Countries	50
Computer Assisted Testing	47
Achievement Tests	46
Latent Trait Theory	43
Scores	40
Item Response Theory	38
Mathematical Models	37
Adaptive Testing	36
Test Interpretation	31
Testing	30
Scoring	29
Psychometrics	27
Item Banks	26
More ▼

Publication Type

Reports - Research	195
Journal Articles	145
Speeches/Meeting Papers	100
Reports - Evaluative	78
Reports - Descriptive	23
Opinion Papers	21
Guides - Non-Classroom	18
Information Analyses	16
Guides - Classroom - Teacher	10
Tests/Questionnaires	10
Books	7
Collected Works - General	5
Numerical/Quantitative Data	5
Dissertations/Theses -…	3
Collected Works - Proceedings	2
Guides - Classroom - Learner	2
Collected Works - Serials	1
ERIC Publications	1
More ▼

Education Level

Higher Education	16
Postsecondary Education	13
Secondary Education	13
Elementary Secondary Education	9
Elementary Education	4
High Schools	3
Adult Education	2
Intermediate Grades	2
Junior High Schools	2
Middle Schools	2
Early Childhood Education	1
Grade 3	1
Grade 4	1
Grade 6	1
Grade 9	1
Primary Education	1
More ▼

Audience

Researchers	37
Practitioners	18
Teachers	11
Students	3
Counselors	2
Administrators	1

Location

Netherlands	6
Canada	4
United Kingdom (Great Britain)	4
Germany	3
Sweden	3
United Kingdom	3
United States	3
Colombia	2
Japan	2
Latin America	2
South Africa	2
Arizona	1
Brazil	1
Burma	1
California	1
China	1
Hawaii	1
Hong Kong	1
Indonesia	1
Kentucky	1
Massachusetts	1
New Jersey	1
New Zealand	1
Russia	1
South Korea	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	3
Individuals with Disabilities…	2
Education for All Handicapped…	1
Elementary and Secondary…	1
Immigration Reform and…	1
Perkins Loan Program	1

What Works Clearinghouse Rating

Showing 1 to 15 of 357 results Save | Export

Population Invariance in Composite-Score Equating with the Random Groups Design

Direct link

Chang, Kuo-Feng – ProQuest LLC, 2022

This dissertation was designed to foster a deeper understanding of population invariance in the context of composite-score equating and provide practitioners with guidelines for addressing score equity concerns at the composite score level. The purpose of this dissertation was threefold. The first was to compare different composite equating…

Descriptors: Test Items, Equated Scores, Methods, Design

The Development of a Standardized Effect Size for the SIBTEST Procedure

Peer reviewed

Direct link

James D. Weese; Ronna C. Turner; Allison Ames; Xinya Liang; Brandon Crawford – Journal of Experimental Education, 2024

In this study a standardized effect size was created for use with the SIBTEST procedure. Using this standardized effect size, a single set of heuristics was developed that are appropriate for data fitting different item response models (e.g., 2-parameter logistic, 3-parameter logistic). The standardized effect size rescales the raw beta-uni value…

Descriptors: Test Bias, Test Items, Item Response Theory, Effect Size

Reporting Pass-Fail Decisions to Examinees with Incomplete Data: A Commentary on Feinberg (2021)

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2022

Administrative problems such as computer malfunction and power outage occasionally lead to missing item scores, and hence to incomplete data, on credentialing tests such as the United States Medical Licensing examination. Feinberg compared four approaches for reporting pass-fail decisions to the examinees with incomplete data on credentialing…

Descriptors: Testing Problems, High Stakes Tests, Credentials, Test Items

Adjusting for Ability Differences of Equating Samples When Randomization Is Suboptimal

Peer reviewed

Direct link

Kim, Sooyeon; Walker, Michael E. – Educational Measurement: Issues and Practice, 2022

Test equating requires collecting data to link the scores from different forms of a test. Problems arise when equating samples are not equivalent and the test forms to be linked share no common items by which to measure or adjust for the group nonequivalence. Using data from five operational test forms, we created five pairs of research forms for…

Descriptors: Ability, Tests, Equated Scores, Testing Problems

A Robust Method for Detecting Item Misfit in Large-Scale Assessments

Peer reviewed

Direct link

von Davier, Matthias; Bezirhan, Ummugul – Educational and Psychological Measurement, 2023

Viable methods for the identification of item misfit or Differential Item Functioning (DIF) are central to scale construction and sound measurement. Many approaches rely on the derivation of a limiting distribution under the assumption that a certain model fits the data perfectly. Typical DIF assumptions such as the monotonicity and population…

Descriptors: Robustness (Statistics), Test Items, Item Analysis, Goodness of Fit

IRTrees for Skipping Items in PIRLS

Peer reviewed

Direct link

Andrés Christiansen; Rianne Janssen – Educational Assessment, Evaluation and Accountability, 2024

In international large-scale assessments, students may not be compelled to answer every test item: a student can decide to skip a seemingly difficult item or may drop out before the end of the test is reached. The way these missing responses are treated will affect the estimation of the item difficulty and student ability, and ultimately affect…

Descriptors: Test Items, Item Response Theory, Grade 4, International Assessment

Problematizing the Measurement of Gender Identity in K-12 Education Survey Research: A Systematic Review

Peer reviewed

Direct link

Mario I. Suárez – Educational Studies: Journal of the American Educational Studies Association, 2024

The increase in youth's self-identification as trans in the United States and Canada has created new urgency in schools to meet the needs of these students, yet education survey researchers have yet to find ways to assess their educational outcomes based on sex and gender. In this critical systematic review, I provide an overview of surveys from…

Descriptors: Measures (Individuals), Sexual Identity, Identification (Psychology), LGBTQ People

Item Pool Quality Control in Educational Testing: Change Point Model, Compound Risk, and Sequential Detection

Peer reviewed

Direct link

Chen, Yunxiao; Lee, Yi-Hsuan; Li, Xiaoou – Journal of Educational and Behavioral Statistics, 2022

In standardized educational testing, test items are reused in multiple test administrations. To ensure the validity of test scores, the psychometric properties of items should remain unchanged over time. In this article, we consider the sequential monitoring of test items, in particular, the detection of abrupt changes to their psychometric…

Descriptors: Standardized Tests, Test Items, Test Validity, Scores

Better Remedies for Bad Exams: Correcting for Difficult Questions in a Fair and Systematic Way

Peer reviewed
PDF on ERIC

Download full text

Camenares, Devin – International Journal for the Scholarship of Teaching and Learning, 2022

Balancing assessment of learning outcomes with the expectations of students is a perennial challenge in education. Difficult exams, in which many students perform poorly, exacerbate this problem and can inspire a wide variety of interventions, such as a grading curve. However, addressing poor performance can sometimes distort or inflate grades and…

Descriptors: College Students, Student Evaluation, Tests, Test Items

Using Diagnostic Profiles to Describe Borderline Performance in Standard Setting

Peer reviewed

Direct link

Skaggs, Gary; Hein, Serge F.; Wilkins, Jesse L. M. – Educational Measurement: Issues and Practice, 2020

In test-centered standard-setting methods, borderline performance can be represented by many different profiles of strengths and weaknesses. As a result, asking panelists to estimate item or test performance for a hypothetical group study of borderline examinees, or a typical borderline examinee, may be an extremely difficult task and one that can…

Descriptors: Standard Setting (Scoring), Cutting Scores, Testing Problems, Profiles

Assessing Mode Effects of At-Home Testing without a Randomized Trial. Research Report. ETS RR-21-10

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Walker, Michael – ETS Research Report Series, 2021

In this investigation, we used real data to assess potential differential effects associated with taking a test in a test center (TC) versus testing at home using remote proctoring (RP). We used a pseudo-equivalent groups (PEG) approach to examine group equivalence at the item level and the total score level. If our assumption holds that the PEG…

Descriptors: Testing, Distance Education, Comparative Analysis, Test Items

Local Placement Test Retrofit and Building Language Assessment Literacy with Teacher Stakeholders: A Case Study from Colombia

Peer reviewed

Direct link

Janssen, Gerriet – Language Testing, 2022

This article provides a single, common-case study of a test retrofit project at one Colombian university. It reports on how the test retrofit project was carried out and describes the different areas of language assessment literacy the project afforded local teacher stakeholders. This project was successful in that it modified the test constructs…

Descriptors: Language Tests, Placement Tests, Language Teachers, College Faculty

It's Not Just Angoff: Misperceptions of Hard and Easy Items in Bookmark-Type Ratings

Peer reviewed

Direct link

Wyse, Adam E.; Babcock, Ben – Educational Measurement: Issues and Practice, 2020

A common belief is that the Bookmark method is a cognitively simpler standard-setting method than the modified Angoff method. However, a limited amount of research has investigated panelist's ability to perform well the Bookmark method, and whether some of the challenges panelists face with the Angoff method may also be present in the Bookmark…

Descriptors: Standard Setting (Scoring), Evaluation Methods, Testing Problems, Test Items

Hybrid Threshold-Based Sequential Procedures for Detecting Compromised Items in a Computerized Adaptive Testing Licensure Exam

Peer reviewed

Direct link

Lee, Chansoon; Qian, Hong – Educational and Psychological Measurement, 2022

Using classical test theory and item response theory, this study applied sequential procedures to a real operational item pool in a variable-length computerized adaptive testing (CAT) to detect items whose security may be compromised. Moreover, this study proposed a hybrid threshold approach to improve the detection power of the sequential…

Descriptors: Computer Assisted Testing, Adaptive Testing, Licensing Examinations (Professions), Item Response Theory

Effect of Item Parameter Drift in Mixed Format Common Items on Test Equating

Peer reviewed
PDF on ERIC

Download full text

Uysal, Ibrahim; Sahin-Kürsad, Merve; Kiliç, Abdullah Faruk – Participatory Educational Research, 2022

The aim of the study was to examine the common items in the mixed format (e.g., multiple-choices and essay items) contain parameter drifts in the test equating processes performed with the common item nonequivalent groups design. In this study, which was carried out using Monte Carlo simulation with a fully crossed design, the factors of test…

Descriptors: Test Items, Test Format, Item Response Theory, Equated Scores

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 24

Journal of Educational…	22
Educational Measurement:…	12
Educational and Psychological…	9
Applied Measurement in…	5
Journal of Educational and…	5
Journal of Experimental…	4
Applied Psychological…	3
ETS Research Report Series	3
Economics	3
Language Testing in Asia	3
ProQuest LLC	3
Assessment in Education:…	2
International Journal of…	2
International Journal of…	2
Journal of Economic Education	2
Journal of Educational…	2
Writing Program Administration	2
AERA Online Paper Repository	1
Alberta Journal of…	1
Annual Review of Applied…	1
Arts Education Policy Review	1
Biochemical Education	1
British Journal of Language…	1
Business and Professional…	1
Canadian Journal of Education	1
More ▼

Hambleton, Ronald K.	7
Stocking, Martha L.	7
Wainer, Howard	7
Lord, Frederic M.	5
Plake, Barbara S.	4
Wilcox, Rand R.	4
Wise, Steven L.	4
Davey, Tim	3
Jaeger, Richard M.	3
Kelderman, Henk	3
Mills, Craig N.	3
Parshall, Cynthia G.	3
Sarvela, Paul D.	3
Secolsky, Charles	3
Sinharay, Sandip	3
Smith, Richard M.	3
van der Linden, Wim J.	3
Boekkooi-Timminga, Ellen	2
Childs, Ruth A.	2
Debeer, Dries	2
Diamond, Esther E.	2
Ebel, Robert L.	2
Frary, Robert B.	2
Gilmer, Jerry S.	2
More ▼

National Assessment of…	9
SAT (College Admission Test)	9
Program for International…	7
Graduate Record Examinations	4
ACT Assessment	3
Stanford Achievement Tests	3
State Trait Anxiety Inventory	3
Comprehensive Tests of Basic…	2
Graduate Management Admission…	2
Iowa Tests of Basic Skills	2
New Jersey College Basic…	2
Sequential Tests of…	2
Wechsler Adult Intelligence…	2
Wechsler Intelligence Scale…	2
Advanced Placement…	1
Armed Services Vocational…	1
California Achievement Tests	1
Expressive One Word Picture…	1
Eysenck Personality Inventory	1
Gates MacGinitie Reading Tests	1
High School Longitudinal…	1
Massachusetts Comprehensive…	1
Medical College Admission Test	1
National Teacher Examinations	1
Peabody Picture Vocabulary…	1
More ▼