Publication Date
| In 2026 | 8 |
| Since 2025 | 2276 |
| Since 2022 (last 5 years) | 12791 |
| Since 2017 (last 10 years) | 33916 |
| Since 2007 (last 20 years) | 68407 |
Descriptor
| Foreign Countries | 30560 |
| Test Validity | 21743 |
| Scores | 18256 |
| Academic Achievement | 16928 |
| Test Construction | 16756 |
| Test Reliability | 15028 |
| Achievement Tests | 14859 |
| Standardized Tests | 14720 |
| Comparative Analysis | 14431 |
| Elementary Secondary Education | 13042 |
| Language Tests | 12551 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 5034 |
| Teachers | 3393 |
| Researchers | 2630 |
| Policymakers | 1232 |
| Administrators | 978 |
| Students | 687 |
| Parents | 325 |
| Counselors | 216 |
| Community | 162 |
| Support Staff | 50 |
| Media Staff | 34 |
| More ▼ | |
Location
| Turkey | 2822 |
| Australia | 2426 |
| Canada | 2270 |
| California | 1854 |
| United States | 1726 |
| Texas | 1615 |
| China | 1578 |
| United Kingdom | 1315 |
| Florida | 1312 |
| United Kingdom (England) | 1202 |
| Germany | 1122 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 121 |
| Meets WWC Standards with or without Reservations | 189 |
| Does not meet standards | 174 |
Jennifer D. Cribbs; Juliana Utley – Mathematics Education Research Journal, 2024
Given the importance of mathematics identity for students continued participation and engagement with mathematics, it is important for educators and researchers to be able to explore students' mathematics identity development. However, an instrument with validity evidence that can be used to explore mathematics identity efficiently and with groups…
Descriptors: Mathematics Education, Self Concept, Test Construction, Middle School Students
Daniel M. Settlage; Jim R. Wollscheid – Journal of the Scholarship of Teaching and Learning, 2024
The examination of the testing mode effect has received increased attention as higher education has shifted to remote testing during the COVID-19 pandemic. We believe the testing mode effect consists of four components: the ability to physically write on the test, the method of answer recording, the proctoring/testing environment, and the effect…
Descriptors: College Students, Macroeconomics, Tests, Answer Sheets
Melahat Çelik; Mustafa Dogru – Journal of Turkish Science Education, 2024
This study aimed to develop a valid and reliable scale that reveals secondary school pupils' attitudes toward science literacy. The survey model, one of the quantitative research methods, was used in the study. The validity and reliability study of the scale was carried out on secondary school pupils attending public schools in Turkey in the…
Descriptors: Test Construction, Secondary School Students, Student Attitudes, Scientific Literacy
Ella Anghel; Lale Khorramdel; Matthias von Davier – Large-scale Assessments in Education, 2024
As the use of process data in large-scale educational assessments is becoming more common, it is clear that data on examinees' test-taking behaviors can illuminate their performance, and can have crucial ramifications concerning assessments' validity. A thorough review of the literature in the field may inform researchers and practitioners of…
Descriptors: Educational Assessment, Test Validity, Test Items, Reaction Time
Yi-Hsin Chen – Applied Measurement in Education, 2024
This study aims to apply the differential item functioning (DIF) technique with the deterministic inputs, noisy "and" gate (DINA) model to validate the mathematics construct and diagnostic attribute profiles across American and Singaporean students. Even with the same ability level, every single item is expected to show uniform DIF…
Descriptors: Foreign Countries, Achievement Tests, Elementary Secondary Education, International Assessment
Yaoyao Zhang; Christina Ioanna Pappa; Wencan Zhang; Daniel Pittich – Cogent Education, 2024
This study aimed to evaluate the psychometric properties (factor structure, reliability and construct validity) of the Motivation of User-generated Technical Instructional Videos (MUTIV) scale. Employing a cross-sectional research design, two rounds of self-administered surveys were conducted in mainland China (N = 271/N = 318). Phase 1 involved…
Descriptors: Vocational Education, Video Technology, Educational Technology, Test Construction
Kristina Kintz Feldner – ProQuest LLC, 2024
The purpose of this study was to examine the relationship between the Measures of Academic Progress (MAP) reading Rasch unIT (RIT) score and State of Texas Assessment of Academic Readiness (STAAR) and Texas English Language Proficiency Assessment System (TELPAS) reading achievement among English learners in Grades 8 and 9. The study also…
Descriptors: Reading Achievement, English Language Learners, Grade 8, Grade 9
Angelina Lim; Sunanthiny Krishnan; Harjit Singh; Simon Furletti; Mahbub Sarkar; Derek Stewart; Daniel Malone – Advances in Health Sciences Education, 2024
Objective Structured Clinical Examinations (OSCEs) and Work Based Assessments (WBAs) are the mainstays of assessing clinical competency in health professions' education. Underpinned by the extrapolation inference in Kane's Validity Framework, the purpose of this study is to determine whether OSCEs translate to real life performance by comparing…
Descriptors: Allied Health Occupations Education, Clinical Experience, Performance Based Assessment, Vocational Evaluation
Yuting Han; Zhehan Jiang; Lingling Xu; Fen Cai – AERA Online Paper Repository, 2024
To address the computational constraints of parameter estimation in the polytomous Cognitive Diagnosis Model (pCDM) in large-scale high data volume situations, this study proposes two two-stage polytomous attribute estimation methods: P_max and P_linear. The effects of the two-stage methods were studied via a Monte Carlo simulation study, and the…
Descriptors: Medical Education, Licensing Examinations (Professions), Measurement Techniques, Statistical Data
Yuyang Shen; Nicole Sankofa; Rachel U. Mun – Journal of Advanced Academics, 2025
This study employs critical content analysis to investigate school districts' identification process of gifted Emergent Bilinguals (EBLs) in North Texas. Despite the diverse population and significant presence of EBLs in Texas, this group is notably underrepresented in gifted programs. In order to uncover the inequities for gifted EBLs, this study…
Descriptors: Equal Education, Gifted Education, Scoring Rubrics, School Districts
Nurussaniah Nurussaniah; Punaji Setyosari; Dedi Kuswandi; Saida Ulfa – Journal of Baltic Science Education, 2025
The accurate assessment of analytical thinking in physics, particularly in magnetism, poses substantial challenges due to the limitations of conventional tools in measuring higher-order cognitive skills. This study aimed to validate an analytical skills test in physics, based on Bloom's Revised Taxonomy, with an emphasis on the dimensions of…
Descriptors: Physics, Science Tests, Science Instruction, Thinking Skills
Sachin Nedungadi; Corina E. Brown; Sue Hyeon Paek – Journal of Chemical Education, 2022
The Fundamental Concepts for Organic Reaction Mechanisms Inventory (FC-ORMI) is a concept inventory with most items in a two-tier design in which an answer tier is followed by a reasoning tier. Statistical results provided strong evidence for the validity and reliability of the data obtained using the FC-ORMI. In this study, differential item…
Descriptors: Test Bias, Test Validity, Test Reliability, Gender Differences
Wicaksono, Azizul Ghofar Candra; Korom, Erzsébet – Participatory Educational Research, 2022
The accuracy of learning results relies on the evaluation and assessment. The learning goals, including problem solving ability must be aligned with the valid standardized measurement tools. The study on exploring the nature of problem-solving, framework, and assessment in the Indonesian context will make contributions to problem solving…
Descriptors: Problem Solving, Educational Research, Test Construction, Test Validity
New York State Education Department, 2023
The instructions in this manual explain the responsibilities of school administrators for the New York State Testing Program (NYSTP) Grades 3-8 English Language Arts and Mathematics Tests. School administrators must be thoroughly familiar with the contents of the manual, and the policies and procedures must be followed as written so that testing…
Descriptors: Testing Programs, Mathematics Tests, Test Format, Computer Assisted Testing
Saskia van Laar; Johan Braeken – International Journal of Testing, 2024
This study examined the impact of two questionnaire characteristics, scale position and questionnaire length, on the prevalence of random responders in the TIMSS 2015 eighth-grade student questionnaire. While there was no support for an absolute effect of questionnaire length, we did find a positive effect for scale position, with an increase of…
Descriptors: Middle School Students, Grade 8, Questionnaires, Test Length

Peer reviewed
Direct link
