ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	10
Since 2017 (last 10 years)	28
Since 2007 (last 20 years)	71

Descriptor

Test Construction	801
Testing Problems	801
Test Validity	234
Elementary Secondary Education	183
Test Items	162
Test Reliability	156
Higher Education	133
Achievement Tests	124
Testing	119
Standardized Tests	114
Educational Testing	111
Test Use	108
Educational Assessment	94
Multiple Choice Tests	90
Test Bias	90
Test Interpretation	86
Test Format	85
Evaluation Methods	82
Student Evaluation	80
Computer Assisted Testing	78
Criterion Referenced Tests	74
Foreign Countries	74
Testing Programs	74
Language Tests	71
Measurement Techniques	70
More ▼

Education Level

Higher Education	23
Postsecondary Education	19
Elementary Secondary Education	13
Elementary Education	5
Secondary Education	5
High Schools	3
Adult Education	1
Grade 6	1
Grade 9	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
More ▼

Audience

Researchers	52
Practitioners	50
Teachers	20
Policymakers	4
Students	4
Administrators	3
Counselors	3
Parents	2
Community	1
Support Staff	1

Location

United Kingdom	9
Canada	7
United States	7
Australia	5
United Kingdom (Great Britain)	5
Illinois	4
Iran	4
United Kingdom (England)	4
China	3
Kentucky	3
Netherlands	3
New York	3
Ohio	3
Brazil	2
Colombia	2
Florida	2
Georgia	2
Germany	2
Israel	2
Japan	2
Michigan	2
New Jersey	2
North Carolina	2
Pennsylvania	2
South Africa	2
More ▼

Laws, Policies, & Programs

Debra P v Turlington	2
Elementary and Secondary…	1
Elementary and Secondary…	1
Individuals with Disabilities…	1
Kentucky Education Reform Act…	1
Manpower Development and…	1
No Child Left Behind Act 2001	1
Rehabilitation Act 1973…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 801 results Save | Export

Scale Development: Identifying and Addressing Potential Validity Threats Linked with Online Piloting Using Paid-For Samples. Sage Research Methods: Doing Research Online

Direct link

Zita Lysaght; Michael O'Leary; Angela Mazzone; Conor Scully – Sage Research Methods Cases, 2022

Since 2018, colleagues from two research centers at Dublin City University have been collaborating to develop a measurement scale to assess individuals' ability to identify workplace bullying. Having agreed on an operational definition of the construct, an item pool of 26 workplace bullying scenarios, that is, short descriptions of…

Descriptors: Foreign Countries, Test Construction, Test Validity, Test Reliability

Principles for Minimizing Errors in Examination Papers and Other Educational Assessment Instruments

Peer reviewed
PDF on ERIC

Download full text

Suto, Irenka; Ireland, Jo – International Journal of Assessment Tools in Education, 2021

Errors in examination papers and other assessment instruments can compromise fairness. For example, a history question containing an incorrect historical date could be impossible for students to answer. Incorrect instructions at the start of an examination could lead students to answer the wrong number of questions. As there is little research on…

Descriptors: Testing Problems, Educational Testing, Test Construction, Work Environment

Hurdles to Learning Assessment Quality: Their Detrimental Effects on Student Learning

Peer reviewed
PDF on ERIC

Download full text

Firdissa J. Aga – Intersection: A Journal at the Intersection of Assessment and Learning, 2024

The study investigated hurdles to the quality of student learning assessment by examining issues related to assessment procedures and practices, learners and learning, learning resources and test constructs, and test admin and feedback. Quantitative and qualitative data were collected from two Ethiopian universities using two types of…

Descriptors: Foreign Countries, College Faculty, College Students, Test Construction

Reframing Research and Assessment Practices: Advancing an Antiracist and Anti-Ableist Research Agenda

Peer reviewed

Direct link

Angela Johnson; Elizabeth Barker; Marcos Viveros Cespedes – Educational Measurement: Issues and Practice, 2024

Educators and researchers strive to build policies and practices on data and evidence, especially on academic achievement scores. When assessment scores are inaccurate for specific student populations or when scores are inappropriately used, even data-driven decisions will be misinformed. To maximize the impact of the research-practice-policy…

Descriptors: Equal Education, Inclusion, Evaluation Methods, Error of Measurement

Let Their Voices be Heard: IELTS Candidates' Problems with the IELTS Academic Writing Test

Peer reviewed
PDF on ERIC

Download full text

Arefsadr, Sajjad; Babaii, Esmat – TESL-EJ, 2023

According to the IELTS official website, IELTS candidates usually score lower in the IELTS Writing test than in the other language skills. This is disappointing for the many IELTS candidates who fail to get the overall band score they need. Surprisingly enough, few studies have addressed this issue. The present study, then, is aimed at shedding…

Descriptors: Second Language Learning, Language Tests, English (Second Language), Foreign Countries

Local Placement Test Retrofit and Building Language Assessment Literacy with Teacher Stakeholders: A Case Study from Colombia

Peer reviewed

Direct link

Janssen, Gerriet – Language Testing, 2022

This article provides a single, common-case study of a test retrofit project at one Colombian university. It reports on how the test retrofit project was carried out and describes the different areas of language assessment literacy the project afforded local teacher stakeholders. This project was successful in that it modified the test constructs…

Descriptors: Language Tests, Placement Tests, Language Teachers, College Faculty

Adding Objectivity to Standard Setting: Evaluating Consequence Using the Conscious and Subconscious Weight Methods

Peer reviewed

Direct link

Leventhal, Brian C.; Grabovsky, Irina – Educational Measurement: Issues and Practice, 2020

Standard setting is arguably one of the most subjective techniques in test development and psychometrics. The decisions when scores are compared to standards, however, are arguably the most consequential outcomes of testing. Providing licensure to practice in a profession has high stake consequences for the public. Denying graduation or forcing…

Descriptors: Standard Setting (Scoring), Weighted Scores, Test Construction, Psychometrics

A Case Study of Washback and Test Preparation of the New Version of PTE Academic

Peer reviewed
PDF on ERIC

Download full text

Yi Zou; Ying Zheng; Jingwen Wang – International Journal of Language Testing, 2025

The Pearson Test of English Academic (PTE-A), a widely used high-stakes language proficiency test for university admissions and migration purposes, underwent a notable change from a three-hour to a two-hour version in November 2021. The implementation of the new version has prompted inquiries into the washback effects on various stakeholders.…

Descriptors: Testing Problems, Test Preparation, High Stakes Tests, English (Second Language)

Item Calibration Methods with Multiple Subscale Multistage Testing

Peer reviewed

Direct link

Chun Wang; Ping Chen; Shengyu Jiang – Journal of Educational Measurement, 2020

Many large-scale educational surveys have moved from linear form design to multistage testing (MST) design. One advantage of MST is that it can provide more accurate latent trait [theta] estimates using fewer items than required by linear tests. However, MST generates incomplete response data by design; hence, questions remain as to how to…

Descriptors: Test Construction, Test Items, Adaptive Testing, Maximum Likelihood Statistics

Evaluating Iranian L2 Teachers' Assessment Literacy for L2 Pragmatics by Applying the CEFR's Pragmatic Competence Model: Possible Sociocultural-Informed Solutions

Peer reviewed
PDF on ERIC

Download full text

Ayad Kamalvand; Mohammad Javad Mohammadi – MEXTESOL Journal, 2024

Nearly all multidimensional models of communication competence have pragmatic competence at their core. Proper assessment of second language (L2) pragmatics makes many demands on L2 teachers, both in terms of understanding the construct and in language test development. Therefore, being assessment literate helps teachers in developing effective…

Descriptors: Rating Scales, Guidelines, Language Teachers, Language Proficiency

Challenges to the Cattell-Horn-Carroll Theory: Empirical, Clinical, and Policy Implications

Peer reviewed

Direct link

Canivez, Gary L.; Youngstrom, Eric A. – Applied Measurement in Education, 2019

The Cattell-Horn-Carroll (CHC) taxonomy of cognitive abilities married John Horn and Raymond Cattell's Extended Gf-Gc theory with John Carroll's Three-Stratum Theory. While there are some similarities in arrangements or classifications of tasks (observed variables) within similar broad or narrow dimensions, other salient theoretical features and…

Descriptors: Taxonomy, Cognitive Ability, Intelligence, Cognitive Tests

Reconsidering the Assessment Policy: Practical Use of Liberal Multiple-Choice Tests (SAC Method)

Peer reviewed
PDF on ERIC

Download full text

Cesur, Kursat – Educational Policy Analysis and Strategic Research, 2019

Examinees' performances are assessed using a wide variety of different techniques. Multiple-choice (MC) tests are among the most frequently used ones. Nearly, all standardized achievement tests make use of MC test items and there is a variety of ways to score these tests. The study compares number right and liberal scoring (SAC) methods. Mixed…

Descriptors: Multiple Choice Tests, Scoring, Evaluation Methods, Guessing (Tests)

Evaluating Research Reports on the Qualities of Tests of English Language Skills in Indonesian Schools: A Systematic Review

Peer reviewed
PDF on ERIC

Download full text

Patrisius Istiarto Djiwandono; Daniel Ginting – Language Education & Assessment, 2025

The teaching of English as a foreign language in Indonesia has a long history, and it is always important to ask whether the assessment of the students' language skills has been valid and reliable. A screening of many articles in several prominent databases reveal that a number of evaluation studies have been done by Indonesian scholars in the…

Descriptors: Foreign Countries, Language Tests, English (Second Language), Second Language Learning

FIPC Linking across Multidimensional Test Forms: Effects of Confounding Difficulty within Dimensions

Peer reviewed

Direct link

Kim, Sohee; Cole, Ki Lynn; Mwavita, Mwarumba – International Journal of Testing, 2018

This study investigated the effects of linking potentially multidimensional test forms using the fixed item parameter calibration. Forms had equal or unequal total test difficulty with and without confounding difficulty. The mean square errors and bias of estimated item and ability parameters were compared across the various confounding tests. The…

Descriptors: Test Items, Item Response Theory, Test Format, Difficulty Level

Differential Washback Effects of a High-Stakes Test on Students' English Learning Process: Evidence from a Large-Scale Stratified Survey in China

Peer reviewed

Direct link

Dong, Manxia; Fan, Jason; Xu, Jian – Asia Pacific Journal of Education, 2023

Understanding of the differential washback effects of high-stakes tests on students' learning remains limited. This study attempts to fill this research gap by investigating the differential washback effects of the National Matriculation English Test (NMET) in China on students' English learning process across genders, grades and English…

Descriptors: Testing Problems, English (Second Language), Second Language Learning, Second Language Instruction

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 54

Educational Measurement:…	25
Journal of Educational…	14
Educational and Psychological…	11
Measurement:…	11
Applied Measurement in…	6
Applied Psychological…	5
Educational Technology	5
Phi Delta Kappan	5
Evaluation and the Health…	4
Journal of Consulting and…	4
Mathematics Teacher	4
National Elementary Principal	4
Reading Horizons	4
American Journal of Mental…	3
Journal of Economic Education	3
Reading Teacher	3
American School Board Journal	2
Clearing House	2
Curriculum Review	2
Developmental Psychology	2
ERS Spectrum	2
ETS Research Report Series	2
Economics	2
Educational Assessment	2
Educational Evaluation and…	2
More ▼

Hambleton, Ronald K.	8
Popham, W. James	8
Green, Donald Ross	7
Lord, Frederic M.	5
Stocking, Martha L.	5
Weiss, David J.	5
Ebel, Robert L.	4
Roeber, Edward D.	4
Wainer, Howard	4
Bormuth, John R.	3
Carroll, John B.	3
Davey, Tim	3
Diamond, Esther E.	3
Fremer, John	3
Hiscox, Michael D.	3
Hogan, Thomas P.	3
Millman, Jason	3
Parshall, Cynthia G.	3
Powers, Donald E.	3
Reckase, Mark D.	3
Stricker, Lawrence J.	3
Wise, Steven L.	3
Aschbacher, Pamela R.	2
More ▼

Journal Articles	264
Reports - Research	245
Speeches/Meeting Papers	159
Reports - Evaluative	126
Opinion Papers	109
Reports - Descriptive	77
Information Analyses	53
Guides - Non-Classroom	47
Tests/Questionnaires	28
Books	23
Guides - Classroom - Teacher	20
Collected Works - Proceedings	14
Collected Works - General	8
ERIC Publications	8
Numerical/Quantitative Data	8
Collected Works - Serials	6
ERIC Digests in Full Text	5
Legal/Legislative/Regulatory…	5
Reports - General	5
Dissertations/Theses -…	4
Guides - General	4
Historical Materials	4
Book/Product Reviews	3
Guides - Classroom - Learner	3
Reference Materials -…	3
More ▼

National Assessment of…	22
SAT (College Admission Test)	6
Graduate Record Examinations	5
Wechsler Intelligence Scale…	5
California Achievement Tests	3
Comprehensive Tests of Basic…	3
Stanford Achievement Tests	3
Test of English as a Foreign…	3
ACT Assessment	2
Armed Services Vocational…	2
General Aptitude Test Battery	2
International English…	2
National Teacher Examinations	2
Program for International…	2
Sequential Tests of…	2
Teacher Performance…	2
Adaptive Behavior Scale	1
Advanced Placement…	1
Alabama High School…	1
Cognitive Assessment System	1
College Board Achievement…	1
Computer Anxiety Scale	1
Eysenck Personality Inventory	1
General Educational…	1
Graduate Management Admission…	1
More ▼