Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 3 |
| Since 2017 (last 10 years) | 21 |
| Since 2007 (last 20 years) | 41 |
Descriptor
| Comparative Analysis | 75 |
| Test Content | 75 |
| Test Items | 37 |
| Scores | 22 |
| Foreign Countries | 21 |
| Test Construction | 17 |
| Computer Assisted Testing | 14 |
| Test Format | 13 |
| Test Results | 13 |
| Testing | 13 |
| Mathematics Tests | 12 |
| More ▼ | |
Source
Author
| Donovan, Jenny | 3 |
| Lennon, Melissa | 3 |
| Binkley, Marilyn | 2 |
| Dorans, Neil J. | 2 |
| Hutton, Penny | 2 |
| Kenney, Patricia Ann | 2 |
| Lundeberg, Mary A. | 2 |
| Morrissey, Noni | 2 |
| Neidorf, Teresa Smith | 2 |
| O'Connor, Gayl | 2 |
| Silver, Edward A. | 2 |
| More ▼ | |
Publication Type
Education Level
| Elementary Secondary Education | 11 |
| Higher Education | 11 |
| Postsecondary Education | 9 |
| Elementary Education | 7 |
| Secondary Education | 7 |
| Grade 8 | 5 |
| High Schools | 5 |
| Grade 10 | 3 |
| Grade 4 | 3 |
| Grade 6 | 3 |
| Middle Schools | 2 |
| More ▼ | |
Audience
| Teachers | 1 |
Location
| United States | 6 |
| Australia | 4 |
| Japan | 3 |
| California | 2 |
| Canada | 2 |
| China | 2 |
| United Kingdom | 2 |
| Delaware | 1 |
| Denmark | 1 |
| Europe | 1 |
| Finland | 1 |
| More ▼ | |
Laws, Policies, & Programs
| Individuals with Disabilities… | 1 |
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Shanmugam, S. Kanageswari Suppiah; Veloo, Arsaythamby; Md-Ali, Ruzlan – Diaspora, Indigenous, and Minority Education, 2021
This study examined the validity of trilingual test as a test accommodation to assess the Indigenous pupils' mathematical performance in Malaysia. The study employed two tests; BM-only test with items written in Malay language (BM) and trilingual test, which had items written in BM and English, and oral audio recording in their native Temiar…
Descriptors: Multilingualism, Testing Accommodations, Grade 5, Elementary School Students
Clauser, Jerome C.; Hambleton, Ronald K.; Baldwin, Peter – Educational and Psychological Measurement, 2017
The Angoff standard setting method relies on content experts to review exam items and make judgments about the performance of the minimally proficient examinee. Unfortunately, at times content experts may have gaps in their understanding of specific exam content. These gaps are particularly likely to occur when the content domain is broad and/or…
Descriptors: Scores, Item Analysis, Classification, Decision Making
Kam, Chester Chun Seng – Educational and Psychological Measurement, 2016
To measure the response style of acquiescence, researchers recommend the use of at least 15 items with heterogeneous content. Such an approach is consistent with its theoretical definition and is a substantial improvement over traditional methods. Nevertheless, measurement of acquiescence can be enhanced by two additional considerations: first, to…
Descriptors: Test Items, Response Style (Tests), Test Content, Measurement
Solheim, Oddny Judith; Lundetrae, Kjersti – Assessment in Education: Principles, Policy & Practice, 2018
Gender differences in reading seem to increase throughout schooling and then decrease or even disappear with age, but the reasons for this are unclear. In this study, we explore whether differences in the way "reading literacy" is operationalised can add to our understanding of varying gender differences in international large-scale…
Descriptors: Achievement Tests, Foreign Countries, Grade 4, Reading Achievement
Choi, Kilchan; Kao, Jenny C.; Rivera, Nichole M.; Cai, Li – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2018
This report is the third in a series considering career-readiness features within high school assessments. The goal of this study was to explore international comparisons by applying feature analysis to Korean assessment items. Twenty math test items from the Gyeonggi Province in South Korea along with performance data from roughly 4,000 Grade 12…
Descriptors: Career Readiness, High School Students, Cross Cultural Studies, Test Items
Dadey, Nathan; Lyons, Susan; DePascale, Charles – Applied Measurement in Education, 2018
Evidence of comparability is generally needed whenever there are variations in the conditions of an assessment administration, including variations introduced by the administration of an assessment on multiple digital devices (e.g., tablet, laptop, desktop). This article is meant to provide a comprehensive examination of issues relevant to the…
Descriptors: Evaluation Methods, Computer Assisted Testing, Educational Technology, Technology Uses in Education
Solanki, Vibhakumari; Evans, Brian R. – Curriculum and Teaching, 2020
The United States and the United Kingdom have used standardized high-stakes testing as a measurement of students' cognitive level to determine success in the 21st century. Standardized tests have given teachers guidance to help them determine what to teach students and how to teach to the test. With such increased emphasis on high-stakes…
Descriptors: High Stakes Tests, Standardized Tests, Foreign Countries, Academic Achievement
Davis, Doris Bitler – Teaching of Psychology, 2017
Providing two or more versions of multiple-choice exams has long been a popular strategy for reducing the opportunity for students to engage in academic dishonesty. While the results of studies comparing exam scores under different question-order conditions have been inconclusive, the potential importance of contextual cues to aid student recall…
Descriptors: Test Construction, Multiple Choice Tests, Sequential Approach, Cues
Weiss, Lawrence G.; Gregoire, Jacques; Zhu, Jianjun – Journal of Psychoeducational Assessment, 2016
Many Flynn effect (FE) studies compare scores across different editions of Wechsler's IQ tests. When construct changes are introduced by the test developers in the new edition, however, the presumed generational effects are difficult to untangle from changes due to test content. To remove this confound, we use the same edition of Wechsler…
Descriptors: Generational Differences, Intelligence Tests, Comparative Analysis, Scores
Baker, Eva L. – Educational Researcher, 2016
This article investigates the persistent and change elements of educational testing and assessment from 1920 to the present day. I show by examining the addresses and texts of American Educational Research Association presidents a continuing focus on schools, from early experiments and development up through applications in accountability systems.…
Descriptors: Research, Educational Testing, Presidents, Professional Associations
Wendt, Heike; Kasper, Daniel – Large-scale Assessments in Education, 2016
Background: In 2011 the Progress in International Reading Literacy Study (PIRLS) and the Trends in International Mathematics and Science Study (TIMSS) were conducted at fourth grade in a number of participating countries with a shared representative sample. In this article we investigate whether there are multidimensional proficiency patterns…
Descriptors: Achievement Tests, Elementary Secondary Education, Foreign Countries, International Assessment
Holme, Thomas – Journal of Chemical Education, 2014
Two different versions of "big ideas" rooted content maps have recently been published for general chemistry. As embodied in the content outline from the College Board, one of these maps is designed to guide curriculum development and testing for advanced placement (AP) chemistry. The Anchoring Concepts Content Map for general chemistry…
Descriptors: Chemistry, Advanced Placement, Curriculum Development, Curriculum Evaluation
Shin, Chingwei David; Chien, Yuehmei; Way, Walter Denny – Pearson, 2012
Content balancing is one of the most important components in the computerized adaptive testing (CAT) especially in the K to 12 large scale tests that complex constraint structure is required to cover a broad spectrum of content. The purpose of this study is to compare the weighted penalty model (WPM) and the weighted deviation method (WDM) under…
Descriptors: Computer Assisted Testing, Elementary Secondary Education, Test Content, Models
Blazer, Christie – Research Services, Miami-Dade County Public Schools, 2010
This Information Capsule reviews research conducted on computer-based assessments. Advantages and disadvantages associated with computer-based testing programs are summarized and research on the comparability of computer-based and paper-and-pencil assessments is reviewed. Overall, studies suggest that for most students, there are few if any…
Descriptors: Comparative Analysis, Multiple Choice Tests, Computer Assisted Testing, Demography
Li, Xin; Yan, Wenfan – Online Submission, 2012
This study followed the comparative research mode of description, interpretation, juxtaposition and comparison. Based on the literatures and data collected on the topic, the paper compared and analyzed the past, present and future of APTHS (academic proficiency test for high schools) in the two countries. Some contemplations on the common issues…
Descriptors: High Schools, Achievement Tests, Foreign Countries, Comparative Analysis

Peer reviewed
Direct link
