Publication Date
| In 2026 | 6 |
| Since 2025 | 2195 |
| Since 2022 (last 5 years) | 12710 |
| Since 2017 (last 10 years) | 33835 |
| Since 2007 (last 20 years) | 68326 |
Descriptor
| Foreign Countries | 30532 |
| Test Validity | 21728 |
| Scores | 18248 |
| Academic Achievement | 16912 |
| Test Construction | 16738 |
| Test Reliability | 15015 |
| Achievement Tests | 14839 |
| Standardized Tests | 14712 |
| Comparative Analysis | 14429 |
| Elementary Secondary Education | 13038 |
| Language Tests | 12549 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 5034 |
| Teachers | 3391 |
| Researchers | 2630 |
| Policymakers | 1229 |
| Administrators | 976 |
| Students | 687 |
| Parents | 325 |
| Counselors | 216 |
| Community | 162 |
| Support Staff | 50 |
| Media Staff | 34 |
| More ▼ | |
Location
| Turkey | 2815 |
| Australia | 2426 |
| Canada | 2269 |
| California | 1853 |
| United States | 1725 |
| Texas | 1615 |
| China | 1578 |
| United Kingdom | 1315 |
| Florida | 1312 |
| United Kingdom (England) | 1202 |
| Germany | 1121 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 121 |
| Meets WWC Standards with or without Reservations | 189 |
| Does not meet standards | 174 |
Shelby J. Haberman; Sabine Meinck; Ann-Kristin Koop – Large-scale Assessments in Education, 2024
This paper extends existing work on teacher weighting in student-centered surveys by looking into aspects of practical implementation of deriving and using weights for teacher-centered analysis in the Trends in International Mathematics and Science Study (TIMSS) and the Progress in International Reading Literacy Study (PIRLS). The formal…
Descriptors: Elementary Secondary Education, Foreign Countries, Achievement Tests, Mathematics Achievement
Gong, Kaixuan – Asian-Pacific Journal of Second and Foreign Language Education, 2023
The extensive use of automated speech scoring in large-scale speaking assessment can be revolutionary not only to test design and rating, but also to the learning and instruction of speaking based on how students and teachers perceive and react to this technology. However, its washback remained underexplored. This mixed-method study aimed to…
Descriptors: Second Language Learning, Language Tests, English (Second Language), Automation
Aleyna Altan; Zehra Taspinar Sener – Online Submission, 2023
This research aimed to develop a valid and reliable test to be used to detect sixth grade students' misconceptions and errors regarding the subject of fractions. A misconception diagnostic test has been developed that includes the concept of fractions, different representations of fractions, ordering and comparing fractions, equivalence of…
Descriptors: Diagnostic Tests, Mathematics Tests, Fractions, Misconceptions
Kohen, Zehavit; Gharra-Badran, Yasmin – Teaching Mathematics and Its Applications, 2023
Mathematics modelling is a vital competency for students of all ages. In this study, we aim to fill the research gap about valid and reliable tools for assessing and grading mathematical modeling problems, particularly those reflecting multiple steps of the modelling cycle. We present in this paper the design of a reliable and valid assessment…
Descriptors: Scoring Rubrics, Mathematical Models, Test Construction, Test Validity
Albert Weideman; Tobie van Dyk – Language Teaching Research Quarterly, 2023
This contribution investigates gains in technical economy in measuring language ability by considering one recurrent interest of JD Brown: cloze tests. In the various versions of the Test of Academic Literacy Levels (TALL), its Sesotho and Afrikaans (Toets van Akademiese Geletterdheidsvlakke -- TAG) counterparts, as well as related other tests…
Descriptors: Language Skills, Language Aptitude, Cloze Procedure, Reading Tests
Deniz Arslan; Ömer Faruk Tamul; Murat Dogan Sahin; Ugur Sak – Journal of Pedagogical Research, 2023
An examination of gender-related differential item functioning was conducted on the verbal subtests of the Anadolu-Sak Intelligence Scale. Analyses were conducted using the scale standardization data (N = 4641). A Mantel-Haenszel statistic was used to detect differential item functioning (DIF). A total of 58 verbal analogical reasoning items, 20…
Descriptors: Foreign Countries, Intelligence Tests, Gender Bias, Gender Differences
Balbuena, Sherwin – International Journal of Assessment Tools in Education, 2023
Depression is a latent characteristic that is measured through self-reported or clinician-mediated instruments such as scales and inventories. The precision of depression estimates largely depends on the validity of the items used and on the truthfulness of people responding to these items. The existing methodology in instrumentation based on a…
Descriptors: Depression (Psychology), Test Items, Test Validity, Test Reliability
NWEA, 2022
This technical report documents the processes and procedures employed by NWEA® to build and support the English MAP® Reading Fluency™ assessments administered during the 2020-2021 school year. It is written for measurement professionals and administrators to help evaluate the quality of MAP Reading Fluency. The seven sections of this report: (1)…
Descriptors: Achievement Tests, Reading Tests, Reading Achievement, Reading Fluency
Kayla V. Campaña; Benjamin G. Solomon – Assessment for Effective Intervention, 2025
The purpose of this study was to compare the classification accuracy of data produced by the previous year's end-of-year New York state assessment, a computer-adaptive diagnostic assessment ("i-Ready"), and the gating combination of both assessments to predict the rate of students passing the following year's end-of-year state assessment…
Descriptors: Accuracy, Classification, Diagnostic Tests, Adaptive Testing
Student, Sanford R.; Gong, Brian – Educational Measurement: Issues and Practice, 2022
We address two persistent challenges in large-scale assessments of the Next Generation Science Standards: (a) the validity of score interpretations that target the standards broadly and (b) how to structure claims for assessments of this complex domain. The NGSS pose a particular challenge for specifying claims about students that evidence from…
Descriptors: Science Tests, Test Validity, Test Items, Test Construction
Jila Niknejad; Margaret Bayer – International Journal of Mathematical Education in Science and Technology, 2025
In Spring 2020, the need for redesigning online assessments to preserve integrity became a priority to many educators. Many of us found methods to proctor examinations using Zoom and proctoring software. Such examinations pose their own issues. To reduce the technical difficulties and cost, many Zoom proctored examination sessions were shortened;…
Descriptors: Mathematics Instruction, Mathematics Tests, Computer Assisted Testing, Computer Software
Marjo Sirén; Sari Sulkunen – Scandinavian Journal of Educational Research, 2025
This study examined which aspects of critical literacy are focused on in the reading literacy assessment for the Programme for International Student Assessment (PISA) 2018 and what kinds of texts are related to the critical literacy items in the test. Based on theory-oriented qualitative content analysis, critical literacy items in PISA…
Descriptors: International Assessment, Achievement Tests, Foreign Countries, Secondary School Students
Eko Suhartoyo; Rida Afrilyasanti; Nur Mukminatien – Turkish Online Journal of Distance Education, 2025
In this paper, we investigated the impact of an online classroom-based reading assessment on implementing practices in reading instruction among 30 EFL learners in an intermediate reading course at a public university in East Java, Indonesia. Our study aimed to develop an online classroom-based reading assessment and evaluate its efficacy in…
Descriptors: Student Evaluation, Computer Assisted Testing, Reading Tests, Reading Instruction
Y. Yokhebed; Rexy Maulana Dwi Karmadi; Luvia Ranggi Nastiti – Journal of Biological Education Indonesia (Jurnal Pendidikan Biologi Indonesia), 2025
Although self-assessment in critical thinking is thought to help students recognise their strengths and weaknesses, the reliability and validity of the assessment tool is still questionable, so a more objective evaluation is needed. Objective of this investigation is to assess the self-assessment tools in evaluating students' critical thinking…
Descriptors: Self Evaluation (Individuals), Critical Thinking, Science and Society, Test Validity
Mark Feng Teng; Yachong Cui – Journal of Research in Reading, 2025
Background: Vocabulary knowledge (VK) and morphological awareness (MA) are crucial linguistic variables for reading comprehension. However, the extent to which MA subskills are intertwined with different facets of VK in their contributions to reading comprehensions, and how MA influences the reading abilities of English as a foreign language (EFL)…
Descriptors: Vocabulary Development, Morphology (Languages), Reading Comprehension, Second Language Learning

Peer reviewed
Direct link
