Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 3 |
| Since 2007 (last 20 years) | 40 |
Descriptor
| Comparative Analysis | 65 |
| Educational Assessment | 65 |
| Test Validity | 33 |
| Validity | 22 |
| Academic Achievement | 21 |
| Evaluation Methods | 21 |
| Foreign Countries | 17 |
| Scores | 14 |
| Psychometrics | 13 |
| Elementary Secondary Education | 12 |
| Achievement Gains | 11 |
| More ▼ | |
Source
Author
| Daro, Phil | 2 |
| Linn, Robert L. | 2 |
| Steinberg, Jonathan | 2 |
| Wainer, Howard | 2 |
| Applegate, Brooks | 1 |
| Awad, Germine H. | 1 |
| Baartman, Liesbeth K. J. | 1 |
| Baird, Jo-Anne | 1 |
| Banta, Trudy W. | 1 |
| Bastiaens, Theo J. | 1 |
| Beaton, Albert E. | 1 |
| More ▼ | |
Publication Type
Education Level
| Elementary Secondary Education | 17 |
| Elementary Education | 11 |
| Grade 4 | 10 |
| Grade 8 | 8 |
| Middle Schools | 5 |
| Early Childhood Education | 4 |
| Grade 3 | 4 |
| Grade 5 | 4 |
| Postsecondary Education | 4 |
| Grade 2 | 3 |
| Grade 6 | 3 |
| More ▼ | |
Audience
Location
| United Kingdom | 6 |
| United States | 4 |
| United Kingdom (England) | 3 |
| New York | 2 |
| Taiwan | 2 |
| United Kingdom (Wales) | 2 |
| Australia | 1 |
| California | 1 |
| Canada | 1 |
| Georgia | 1 |
| Hong Kong | 1 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 5 |
| Carl D Perkins Vocational and… | 1 |
| Workforce Investment Act 1998… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Kelly, Kate Tremain; Richardson, Mary; Isaacs, Talia – Assessment in Education: Principles, Policy & Practice, 2022
Comparative judgment is gaining popularity as an assessment tool, including for high-stakes testing purposes, despite relatively little research on the use of the technique. Advocates claim two main rationales for its use: that comparative judgment is valid because humans are better at comparative than absolute judgment, and because it distils the…
Descriptors: Comparative Analysis, Evaluation Methods, Evaluative Thinking, High Stakes Tests
Crisp, Victoria – London Review of Education, 2017
This article discusses how comparability relates to current mainstream conceptions of validity, in the context of educational assessment. Relevant literature was used to consider the relationship between these concepts. The article concludes that, depending on the exact claims being made about the appropriate interpretations and uses of the…
Descriptors: Educational Assessment, Test Validity, Comparative Analysis, Scores
Walser, Tamara M. – Practical Assessment, Research & Evaluation, 2014
There is increased emphasis on using experimental and quasi-experimental methods to evaluate educational programs; however, educational evaluators and school leaders are often faced with challenges when implementing such designs in educational settings. Use of a historical cohort control group design provides a viable option for conducting…
Descriptors: Quasiexperimental Design, Cohort Analysis, Control Groups, Educational Assessment
Paulhus, Delroy L.; Dubois, Patrick J. – Educational and Psychological Measurement, 2014
The overclaiming technique is a novel assessment procedure that uses signal detection analysis to generate indices of knowledge accuracy (OC-accuracy) and self-enhancement (OC-bias). The technique has previously shown robustness over varied knowledge domains as well as low reactivity across administration contexts. Here we compared the OC-accuracy…
Descriptors: Educational Assessment, Knowledge Level, Accuracy, Cognitive Ability
Awad, Germine H.; Patall, Erika A.; Rackley, Kadie R.; Reilly, Erin D. – Journal of Educational & Psychological Consultation, 2016
As the US continues to diversify, methods for accurately assessing human behavior must evolve. This paper offers multicultural research considerations at several stages of the research process for psychological research and consultation. Implications regarding the comparative research framework are discussed and suggestions are offered on how to…
Descriptors: Cultural Awareness, Psychological Studies, Control Groups, Educational Assessment
Tendeiro, Jorge N.; Meijer, Rob R. – Journal of Educational Measurement, 2014
In recent guidelines for fair educational testing it is advised to check the validity of individual test scores through the use of person-fit statistics. For practitioners it is unclear on the basis of the existing literature which statistic to use. An overview of relatively simple existing nonparametric approaches to identify atypical response…
Descriptors: Educational Assessment, Test Validity, Scores, Statistical Analysis
Wagemaker, Hans, Ed. – International Association for the Evaluation of Educational Achievement, 2020
Although International Association for the Evaluation of Educational Achievement-pioneered international large-scale assessment (ILSA) of education is now a well-established science, non-practitioners and many users often substantially misunderstand how large-scale assessments are conducted, what questions and challenges they are designed to…
Descriptors: International Assessment, Achievement Tests, Educational Assessment, Comparative Analysis
Johnson, Martin; Nadas, Rita – Educational Research and Evaluation, 2012
Comparability between different educational qualifications is an important issue within policy discourse in the UK. In this context, the comparability of qualification demands has been explored through the use of expert human judgement. The involvement of human judgement in estimating assessment demands has consequences for methodology. This…
Descriptors: Educational Assessment, Evaluation Methods, Comparative Analysis, Research Methodology
Kim, Do-Hong; Smith, JaneDiane – NHSA Dialog, 2010
This study provided preliminary evidence for the reliability and validity of "Teaching Strategies GOLD", a recently developed observational system for assessing young children's development and learning. The measurement properties of "Teaching Strategies GOLD" were compared with those of an older instrument, "The Creative…
Descriptors: Construct Validity, Validity, Toddlers, Infants
Chiu, Pui Chi – ProQuest LLC, 2012
This study examines student growth on mathematics and reading assessments across academic years (Spring 2006 through Spring 2009) using three different growth models: hierarchical linear model (HLM), value-added model (VAM), and student growth percentile model (SGP). Comparisons across these three growth models were conducted to investigate the…
Descriptors: Longitudinal Studies, Mathematics Tests, Reading Tests, Educational Assessment
Wang, Jing-Ru; Chen, Shin-Feng; Fang, I.; Chou, Ching-Ting – International Journal of Science Education, 2014
This study used a Chinese-language version of the Index of Science Reading Awareness to explore the science reading metacognition and comprehension of Taiwanese students. Structural equation modelling results confirmed the underlying model comprised three clusters of metacognitive knowledge: beliefs and confidence in science reading, knowledge of…
Descriptors: Foreign Countries, Metacognition, Reading Comprehension, Science Instruction
Somers, Marie-Andrée; Zhu, Pei; Jacob, Robin; Bloom, Howard – MDRC, 2013
In this paper, we examine the validity and precision of two nonexperimental study designs (NXDs) that can be used in educational evaluation: the comparative interrupted time series (CITS) design and the difference-in-difference (DD) design. In a CITS design, program impacts are evaluated by looking at whether the treatment group deviates from its…
Descriptors: Research Design, Educational Assessment, Time, Intervals
Peisner-Feinberg, Ellen S.; Schaaf, Jennifer M.; Hildebrandt, Lisa M.; Pan, Yi; Warnaar, Bethany L. – FPG Child Development Institute, 2015
The 2013-2014 North Carolina Pre-Kindergarten (NC Pre-K) Evaluation study was designed to examine the longitudinal outcomes through kindergarten for children who attended the Pre-K program, along with comparisons to previous cohorts of program attendees. A sample of 561 children was included in the study, with data gathered at the beginning and…
Descriptors: Kindergarten, Educational Quality, State Surveys, Preschool Children
Hughes, Gerunda B.; Daro, Phil; Holtzman, Deborah; Middleton, Kyndra – American Institutes for Research, 2013
Introduction: For decades, prior to the inception of the Common Core State Standards (CCSS), the National Assessment of Educational Progress (NAEP) was the only vehicle through which states could assess the progress of their students using a common metric. Now, 45 states, 4 U.S. territories, and the District of Columbia have adopted the CCSS to…
Descriptors: National Competency Tests, Mathematics Tests, Mathematics Achievement, State Standards
Hsieh, Mingchuan – Language Assessment Quarterly, 2013
The Yes/No Angoff and Bookmark method for setting standards on educational assessment are currently two of the most popular standard-setting methods. However, there is no research into the comparability of these two methods in the context of language assessment. This study compared results from the Yes/No Angoff and Bookmark methods as applied to…
Descriptors: Standard Setting (Scoring), Comparative Analysis, Language Tests, Multiple Choice Tests

Peer reviewed
Direct link
