ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	3
Since 2007 (last 20 years)	71

Descriptor

Comparative Analysis	169
Educational Testing	169
Test Results	46
Educational Assessment	36
Academic Achievement	35
Foreign Countries	35
Evaluation Methods	30
Achievement Tests	25
Standardized Tests	25
Student Evaluation	24
Program Effectiveness	23
Test Items	22
Educational Policy	19
Measurement Techniques	19
Psychometrics	19
Scores	19
Computer Assisted Testing	17
Mathematics Tests	17
Reading Tests	17
Test Interpretation	17
High Stakes Tests	16
Test Construction	16
Test Use	16
Elementary Secondary Education	15
Secondary Education	15
More ▼

Education Level

Elementary Secondary Education	42
Elementary Education	19
Higher Education	19
Postsecondary Education	13
Grade 8	12
Secondary Education	11
Grade 4	8
Grade 6	6
High Schools	5
Middle Schools	4
Grade 10	3
Grade 5	3
Adult Education	2
Early Childhood Education	2
Grade 11	2
Grade 7	2
Intermediate Grades	2
Junior High Schools	2
Grade 3	1
Primary Education	1
More ▼

Audience

Practitioners	4
Policymakers	3
Researchers	3
Teachers	2
Counselors	1
Students	1
Support Staff	1

Location

United Kingdom	10
United Kingdom (England)	8
Australia	7
United States	7
Florida	5
United Kingdom (Wales)	4
California	3
Canada	3
New Jersey	3
Indiana	2
Minnesota (Minneapolis)	2
North Carolina	2
Ohio	2
Sweden	2
United Kingdom (Great Britain)	2
Asia	1
Finland	1
France	1
Georgia	1
Hong Kong	1
Israel	1
Japan	1
Kansas	1
Kentucky	1
Maine	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	7
Elementary and Secondary…	1
Individuals with Disabilities…	1
Stewart B McKinney Homeless…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 169 results Save | Export

Establishing a Fair Cut Score for an In-House English Test: A Case Study on Integrating Two Standard-Setting Methods

Peer reviewed

Direct link

Suthathip Thirakunkovit – Language Testing in Asia, 2025

Establishing a cut score is a crucial aspect of the test development process since the selected cut score has the potential to impact students' performance outcomes and shape instructional strategies within the classroom. Therefore, it is vital for those involved in test development to set a cut score that is both fair and justifiable. This cut…

Descriptors: Cutting Scores, Culture Fair Tests, Language Tests, Test Construction

Application of Item Response Tree (IRTree) Models on Testing Data: Comparing Its Performance with Binary and Polytomous Item Response Models

Direct link

Yixi Wang – ProQuest LLC, 2020

Binary item response theory (IRT) models are widely used in educational testing data. These models are not perfect because they simplify the individual item responding process, ignore the differences among different response patterns, cannot handle multidimensionality that lay behind options within a single item, and cannot manage missing response…

Descriptors: Item Response Theory, Educational Testing, Data, Models

Does It Matter Whether One Takes a Test on an iPad or a Desktop Computer?

Peer reviewed

Direct link

Ling, Guangming – International Journal of Testing, 2016

To investigate possible iPad related mode effect, we tested 403 8th graders in Indiana, Maryland, and New Jersey under three mode conditions through random assignment: a desktop computer, an iPad alone, and an iPad with an external keyboard. All students had used an iPad or computer for six months or longer. The 2-hour test included reading, math,…

Descriptors: Educational Testing, Computer Assisted Testing, Handheld Devices, Computers

Research to Controversy in 10 Decades

Peer reviewed

Direct link

Baker, Eva L. – Educational Researcher, 2016

This article investigates the persistent and change elements of educational testing and assessment from 1920 to the present day. I show by examining the addresses and texts of American Educational Research Association presidents a continuing focus on schools, from early experiments and development up through applications in accountability systems.…

Descriptors: Research, Educational Testing, Presidents, Professional Associations

Comparability of Large-Scale Educational Assessments: Issues and Recommendations

Download full text

Berman, Amy I.; Haertel, Edward H.; Pellegrino, James W. – National Academy of Education, 2020

This National Academy of Education (NAEd) volume provides guidance to key stakeholders on how to accurately report and interpret comparability assertions concerning large-scale educational assessments as well as how to ensure greater comparability by paying close attention to key aspects of assessment design, content, and procedures. The goal of…

Descriptors: Educational Assessment, Educational Testing, Scores, Comparative Analysis

On the Issue of Item Selection in Computerized Adaptive Testing with Response Times

Peer reviewed

Direct link

Veldkamp, Bernard P. – Journal of Educational Measurement, 2016

Many standardized tests are now administered via computer rather than paper-and-pencil format. The computer-based delivery mode brings with it certain advantages. One advantage is the ability to adapt the difficulty level of the test to the ability level of the test taker in what has been termed computerized adaptive testing (CAT). A second…

Descriptors: Computer Assisted Testing, Reaction Time, Standardized Tests, Difficulty Level

Using No-Stakes Educational Testing to Mitigate Summer Learning Loss: A Pilot Study. Research Report. ETS RR-14-21

Peer reviewed
PDF on ERIC

Download full text

Zaromb, Franklin; Adler, Rachel M.; Bruce, Kelly; Attali, Yigal; Rock, JoAnn – ETS Research Report Series, 2014

This study investigates the benefits of no-stakes educational testing during students' summer vacation as a strategy to mitigate summer learning loss. Fifty-one students in Grades 3-8 from the Every Child Valued (ECV) and Lawrence Community Center (LCC) summer programs in Lawrenceville, NJ, took short, online assessments throughout the summer,…

Descriptors: Educational Testing, Summer Programs, Grade 3, Grade 4

A Multicomponent Latent Trait Model for Diagnosis

Peer reviewed

Direct link

Embretson, Susan E.; Yang, Xiangdong – Psychometrika, 2013

This paper presents a noncompensatory latent trait model, the multicomponent latent trait model for diagnosis (MLTM-D), for cognitive diagnosis. In MLTM-D, a hierarchical relationship between components and attributes is specified to be applicable to permit diagnosis at two levels. MLTM-D is a generalization of the multicomponent latent trait…

Descriptors: Mathematics Achievement, Achievement Tests, Item Response Theory, Measurement

Findings from the 2012 West Virginia Online Writing Scoring Comparability Study

Download full text

Hixson, Nate; Rhudy, Vaughn – West Virginia Department of Education, 2013

Student responses to the West Virginia Educational Standards Test (WESTEST) 2 Online Writing Assessment are scored by a computer-scoring engine. The scoring method is not widely understood among educators, and there exists a misperception that it is not comparable to hand scoring. To address these issues, the West Virginia Department of Education…

Descriptors: Scoring Formulas, Scoring Rubrics, Interrater Reliability, Test Scoring Machines

Large-Scale Assessment, Locally-Developed Measures, and Automated Scoring of Essays: Fishing for Red Herrings?

Peer reviewed

Direct link

Condon, William – Assessing Writing, 2013

Automated Essay Scoring (AES) has garnered a great deal of attention from the rhetoric and composition/writing studies community since the Educational Testing Service began using e-rater[R] and the "Criterion"[R] Online Writing Evaluation Service as products in scoring writing tests, and most of the responses have been negative. While the…

Descriptors: Measurement, Psychometrics, Evaluation Methods, Educational Testing

Interpreting Standardized Assessment Test Scores and Setting Performance Goals in the Context of Student Characteristics: The Case of the Major Field Test in Business

Peer reviewed

Direct link

Bielinska-Kwapisz, Agnieszka; Brown, F. William; Semenik, Richard – Journal of Education for Business, 2012

The Major Field Test in Business (MFT-B), a standardized assessment test of business knowledge among undergraduate business seniors, is widely used to measure student achievement. The Educational Testing Service, publisher of the assessment, provides data that allow institutions to compare their own MFT-B performance to national norms, but that…

Descriptors: Educational Testing, Academic Achievement, Field Tests, National Norms

Score Comparability for Language Minority Students on the Content Assessments Used by Two States. Research Report. ETS RR-11-27

Download full text

Young, John W.; Holtzman, Steven; Steinberg, Jonathan – Educational Testing Service, 2011

In this research investigation of score comparability for language minority students (English language learners [ELLs] and former English language learners), we examined 3 indicators of score comparability (reliability, internal test structure, and differential item functioning) for 4th and 8th grade students who took the NCLB-mandated content…

Descriptors: Language Minorities, Second Language Learning, Grade 8, Minority Group Students

A Comparison of Equating/Linking Using the Stocking-Lord Method and Concurrent Calibration with Mixed-Format Tests in the Non-Equivalent Groups Common-Item Design under IRT

Direct link

Tian, Feng – ProQuest LLC, 2011

There has been a steady increase in the use of mixed-format tests, that is, tests consisting of both multiple-choice items and constructed-response items in both classroom and large-scale assessments. This calls for appropriate equating methods for such tests. As Item Response Theory (IRT) has rapidly become mainstream as the theoretical basis for…

Descriptors: Item Response Theory, Comparative Analysis, Equated Scores, Statistical Analysis

Audio Adapted Assessment Data: Does the Addition of Audio to Written Items Modify the Item Calibration?

Direct link

Snyder, James – ProQuest LLC, 2010

This dissertation research examined the changes in item RIT calibration that occurred when adding audio to a set of currently calibrated RIT items and then placing these new items as field test items in the modified assessments on the NWEA MAP test platform. The researcher used test results from over 600 students in the Poway School District in…

Descriptors: Test Results, Test Items, Field Tests, Data Analysis

Policy Options for Turkey: A Critique of the Interpretation and Utilization of PISA Results in Turkey

Peer reviewed

Direct link

Gur, Bekir S.; Celik, Zafer; Ozoglu, Murat – Journal of Education Policy, 2012

In this article we provide a critique of the interpretation and utilization of Programme for International Student Assessment (PISA) results by the National Education Authorities in Turkey. First, we define and explain what OECD's PISA is. Second, we make an overview of the media coverage in Turkey of the PISA 2003 and 2006 results. Third, we…

Descriptors: Foreign Countries, Curriculum Development, Educational Quality, News Reporting

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12

ProQuest LLC	17
Measurement:…	5
Ministerial Council on…	5
Journal of Technology,…	4
Applied Measurement in…	3
Assessment & Evaluation in…	3
Educational Measurement:…	3
Educational Research	3
Journal of Educational…	3
Research Papers in Education	3
Educational Research and…	2
Journal of Education for…	2
Psychology in the Schools	2
ACT, Inc.	1
Applied Psychological…	1
Assessing Writing	1
Black Issues in Higher…	1
British Journal of…	1
Bureau of Education,…	1
Canadian Journal of Education	1
Center on Education Policy	1
Centre for the Economics of…	1
Change: The Magazine of…	1
Children & Schools	1
Computers & Education	1
More ▼

Donovan, Jenny	3
Lennon, Melissa	3
Hutton, Penny	2
Llaudet, Elena	2
Morrissey, Noni	2
Newton, Paul E.	2
O'Connor, Gayl	2
Peterson, Paul E.	2
Yang, Xiangdong	2
Abbott, Judith A.	1
Adler, Rachel M.	1
Adrian, Mitchell	1
Ainley, John	1
Allen, Nancy	1
Amster, Harriett	1
Anderson, Kenneth E.	1
Angoff, William H.	1
Attali, Yigal	1
Badua, Frank	1
Baird, Jo-Anne	1
Baker, Eva L.	1
Ban, Jae-Chun	1
Banerjee, Manju	1
Banks, Kathleen	1
More ▼

Reports - Research	65
Journal Articles	62
Reports - Evaluative	35
Dissertations/Theses -…	17
Numerical/Quantitative Data	11
Opinion Papers	9
Reports - Descriptive	9
Dissertations/Theses	6
Information Analyses	6
Speeches/Meeting Papers	6
Tests/Questionnaires	4
Guides - Non-Classroom	3
Books	2
Dissertations/Theses -…	2
Historical Materials	2
Collected Works - General	1
Collected Works - Proceedings	1
Collected Works - Serials	1
Reference Materials -…	1
Reports - General	1
More ▼

Program for International…	8
SAT (College Admission Test)	7
National Assessment of…	6
ACT Assessment	4
Iowa Tests of Basic Skills	4
California Achievement Tests	3
Advanced Placement…	2
Metropolitan Achievement Tests	2
Metropolitan Readiness Tests	2
Wechsler Intelligence Scale…	2
California Test of Mental…	1
College Level Academic Skills…	1
Collegiate Assessment of…	1
Comprehensive Tests of Basic…	1
Differential Aptitude Test	1
Gates MacGinitie Reading Tests	1
General Aptitude Test Battery	1
General Educational…	1
Law School Admission Test	1
Major Field Achievement Test…	1
New Jersey College Basic…	1
Peabody Individual…	1
Praxis Series	1
Sequential Tests of…	1
Stanford Achievement Tests	1
More ▼