Publication Date
In 2025 | 3 |
Since 2024 | 7 |
Since 2021 (last 5 years) | 15 |
Since 2016 (last 10 years) | 54 |
Since 2006 (last 20 years) | 1443 |
Descriptor
Comparative Analysis | 2031 |
Educational Assessment | 2031 |
Educational Indicators | 988 |
Scores | 971 |
Public Schools | 869 |
Racial Differences | 828 |
Gender Differences | 824 |
Elementary School Students | 817 |
National Competency Tests | 811 |
Achievement Gap | 794 |
Academic Achievement | 518 |
More ▼ |
Source
Author
Linn, Robert L. | 8 |
Bracey, Gerald W. | 5 |
Seppanen, Patricia | 5 |
Wolf, Patrick J. | 4 |
Alstete, Jeffrey W. | 3 |
Bassett, Katherine | 3 |
Casserly, Michael | 3 |
Davis, Benjamin G. | 3 |
Donovan, Jenny | 3 |
Joe, Jilliam | 3 |
Johnson, Martin | 3 |
More ▼ |
Publication Type
Education Level
Audience
Policymakers | 228 |
Community | 151 |
Practitioners | 74 |
Researchers | 28 |
Teachers | 11 |
Administrators | 9 |
Students | 4 |
Parents | 3 |
Location
California | 69 |
United States | 64 |
Texas | 61 |
Florida | 56 |
Michigan | 45 |
North Carolina | 45 |
Australia | 44 |
Illinois | 42 |
Maryland | 41 |
Georgia | 40 |
New York | 40 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 2 |
Meets WWC Standards with or without Reservations | 3 |
Does not meet standards | 2 |
Kazuhiro Yamaguchi – Journal of Educational and Behavioral Statistics, 2025
This study proposes a Bayesian method for diagnostic classification models (DCMs) for a partially known Q-matrix setting between exploratory and confirmatory DCMs. This Q-matrix setting is practical and useful because test experts have pre-knowledge of the Q-matrix but cannot readily specify it completely. The proposed method employs priors for…
Descriptors: Models, Classification, Bayesian Statistics, Evaluation Methods
Moses, Tim – Journal of Educational Measurement, 2022
One result of recent changes in testing is that previously established linking frameworks may not adequately address challenges in current linking situations. Test linking through equating, concordance, vertical scaling or battery scaling may not represent linkings for the scores of tests developed to measure constructs differently for different…
Descriptors: Measures (Individuals), Educational Assessment, Test Construction, Comparative Analysis
Song, Yoon Ah; Lee, Won-Chan – Applied Measurement in Education, 2022
This article presents the performance of item response theory (IRT) models when double ratings are used as item scores over single ratings when rater effects are present. Study 1 examined the influence of the number of ratings on the accuracy of proficiency estimation in the generalized partial credit model (GPCM). Study 2 compared the accuracy of…
Descriptors: Item Response Theory, Item Analysis, Scores, Accuracy
Kelly, Kate Tremain; Richardson, Mary; Isaacs, Talia – Assessment in Education: Principles, Policy & Practice, 2022
Comparative judgment is gaining popularity as an assessment tool, including for high-stakes testing purposes, despite relatively little research on the use of the technique. Advocates claim two main rationales for its use: that comparative judgment is valid because humans are better at comparative than absolute judgment, and because it distils the…
Descriptors: Comparative Analysis, Evaluation Methods, Evaluative Thinking, High Stakes Tests
Jordan M. Wheeler; Allan S. Cohen; Shiyu Wang – Journal of Educational and Behavioral Statistics, 2024
Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming…
Descriptors: Semantics, Educational Assessment, Evaluators, Reliability
Yubin Xu; Lin Liu; Jianwen Xiong; Guangtian Zhu – Journal of Baltic Science Education, 2025
As the development and application of large language models (LLMs) in physics education progress, the well-known AI-based chatbot ChatGPT4 has presented numerous opportunities for educational assessment. Investigating the potential of AI tools in practical educational assessment carries profound significance. This study explored the comparative…
Descriptors: Physics, Artificial Intelligence, Computer Software, Accuracy
Kyung-Mi O. – Language Testing in Asia, 2024
This study examines the efficacy of artificial intelligence (AI) in creating parallel test items compared to human-made ones. Two test forms were developed: one consisting of 20 existing human-made items and another with 20 new items generated with ChatGPT assistance. Expert reviews confirmed the content parallelism of the two test forms.…
Descriptors: Comparative Analysis, Artificial Intelligence, Computer Software, Test Items
Gabriel Attar – ProQuest LLC, 2021
The historical development of the EER program, from its initiative as WSU's first Ed. D., to its growth with the Ph. D. and master's program, was well documented (Irwin, 1960). The importance of the EER doctoral program was established, in terms of its role in the COE, within WSU, and in the outside business and industry communities (Ozkan, 2008).…
Descriptors: Research Universities, Reputation, Doctoral Programs, Statistical Analysis
Fuchimoto, Kazuma; Ishii, Takatoshi; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2022
Educational assessments often require uniform test forms, for which each test form has equivalent measurement accuracy but with a different set of items. For uniform test assembly, an important issue is the increase of the number of assembled uniform tests. Although many automatic uniform test assembly methods exist, the maximum clique algorithm…
Descriptors: Simulation, Efficiency, Test Items, Educational Assessment
Braun, Henry I.; Marion, Scott F. – Assessment in Education: Principles, Policy & Practice, 2022
State education systems in the U.S. experienced major disruptions due to the COVID-19 pandemic. Results from assessments administered during, and at the conclusion of, the 2020-21 school year indicate substantial 'unfinished learning', with the losses generally greater among disadvantaged and marginalized students. States' assessment systems are…
Descriptors: Accountability, Educational Assessment, COVID-19, Pandemics
Berman, Amy I.; Haertel, Edward H.; Pellegrino, James W. – National Academy of Education, 2020
This National Academy of Education (NAEd) volume provides guidance to key stakeholders on how to accurately report and interpret comparability assertions concerning large-scale educational assessments as well as how to ensure greater comparability by paying close attention to key aspects of assessment design, content, and procedures. The goal of…
Descriptors: Educational Assessment, Educational Testing, Scores, Comparative Analysis
von Davier, Matthias – Measurement: Interdisciplinary Research and Perspectives, 2018
This article critically reviews how diagnostic models have been conceptualized and how they compare to other approaches used in educational measurement. In particular, certain assumptions that have been taken for granted and used as defining characteristics of diagnostic models are reviewed and it is questioned whether these assumptions are the…
Descriptors: Criticism, Psychometrics, Diagnostic Tests, Educational Assessment
Crisp, Victoria – London Review of Education, 2017
This article discusses how comparability relates to current mainstream conceptions of validity, in the context of educational assessment. Relevant literature was used to consider the relationship between these concepts. The article concludes that, depending on the exact claims being made about the appropriate interpretations and uses of the…
Descriptors: Educational Assessment, Test Validity, Comparative Analysis, Scores
Melkamu Beyene Kitil; Amare Asgedom – Education and Urban Society, 2025
This research explores students' homework engagement across three dimensions: (a) cognitive, (b) emotional, and (c) behavioral activities, specifically contrasting private and public secondary schools in Addis Ababa. A qualitative research methodology was utilized, with reliability and validity assessments carried out through peer review, site…
Descriptors: Foreign Countries, Learner Engagement, Homework, Secondary School Students
Bartholomew, S. R.; Connolly, P. E. – Engineering Design Graphics Journal, 2018
The authors are investigating potential applications of adaptive comparative judgment (ACJ) across numerous environments and learning scenarios within the Purdue Polytechnic Institute as part of Purdue's efforts to transform the undergraduate learning experience. Six courses or program areas were selected for the study, involving a wide variation…
Descriptors: Educational Assessment, Judges, Feedback (Response), Summative Evaluation