Publication Date
| In 2026 | 12 |
| Since 2025 | 958 |
| Since 2022 (last 5 years) | 4567 |
| Since 2017 (last 10 years) | 10500 |
| Since 2007 (last 20 years) | 21963 |
Descriptor
| Test Validity | 21786 |
| Validity | 13791 |
| Test Reliability | 10864 |
| Foreign Countries | 9887 |
| Test Construction | 6897 |
| Factor Analysis | 5761 |
| Measures (Individuals) | 5633 |
| Predictive Validity | 5022 |
| Psychometrics | 4820 |
| Reliability | 4635 |
| Correlation | 4376 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 1169 |
| Practitioners | 629 |
| Teachers | 336 |
| Administrators | 165 |
| Policymakers | 110 |
| Counselors | 63 |
| Students | 63 |
| Parents | 15 |
| Community | 12 |
| Media Staff | 10 |
| Support Staff | 8 |
| More ▼ | |
Location
| Turkey | 1397 |
| Australia | 705 |
| Canada | 626 |
| China | 528 |
| United States | 439 |
| Indonesia | 389 |
| United Kingdom | 363 |
| Germany | 340 |
| California | 338 |
| Netherlands | 336 |
| Spain | 311 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 7 |
| Meets WWC Standards with or without Reservations | 12 |
| Does not meet standards | 10 |
Johnson, Sandra – Research Papers in Education, 2013
For a number of reasons, increasing reliance is being placed on teacher assessment in high-stakes contexts in many countries around the world. Simultaneously, countries that have for some time relied to greater or lesser degrees on teacher assessment for high-stakes purposes are in the process of questioning the validity of that reliance. In…
Descriptors: Reliability, Student Evaluation, High Stakes Tests, Evidence
Porayska-Pomsta, Kaska; Mavrikis, Manolis; D'Mello, Sidney; Conati, Cristina; Baker, Ryan S. J. d. – International Journal of Artificial Intelligence in Education, 2013
Research on the relationship between affect and cognition in Artificial Intelligence in Education (AIEd) brings an important dimension to our understanding of how learning occurs and how it can be facilitated. Emotions are crucial to learning, but their nature, the conditions under which they occur, and their exact impact on learning for different…
Descriptors: Intelligent Tutoring Systems, Psychological Patterns, Data Collection, Affective Measures
Haskell, Brett C. – ProQuest LLC, 2013
Research has shown that a student's level of institutional integration is a better predictor of college persistence than academic performance (Pascarella & Terenzini, 1980). Tinto (1975) proposed that institutional integration is the student's perception of his/her fit to the university he/she is attending. Student athletes are a unique…
Descriptors: Athletes, College Athletics, Social Integration, Academic Persistence
Meacham, Paul Douglas, Jr. – ProQuest LLC, 2013
The purpose of this study was to explore the effect of instrument-specific rater training on interrater reliability (IRR) and counseling skills performance differentiation. Strong IRR is of primary concern to effective program evaluation (McCullough, Kuhn, Andrews, Valen, Hatch, & Osimo, 2003; Schanche, Nielsen, McCullough, Valen, &…
Descriptors: Counselor Training, Interrater Reliability, Measures (Individuals), Counseling Techniques
Calvery, Suzannah Vallejo – ProQuest LLC, 2013
Mentoring research to date focuses on outcomes related to program goals and theoretical background, and almost all of these relate to the experience of the mentee. Very little research has been completed on the other side of the dyad--the mentor--despite the fact that mentor expectations and experience contribute significantly to the perceived…
Descriptors: Mentors, Self Efficacy, Intervention, Test Construction
Jones, Ryan Seth – Society for Research on Educational Effectiveness, 2013
Estimates of fidelity of implementation are essential to interpret the effects of educational interventions in randomized controlled trials (RCTs). While random assignment protects against many threats to validity, and therefore provides the best approximation to a true counterfactual condition, it does not ensure that the treatment condition…
Descriptors: Fidelity, Construct Validity, Program Implementation, Intervention
Snyder, Patricia A.; Hemmeter, Mary Louise; Fox, Lise; Bishop, Crystal Crowe; Miller, M. David – Grantee Submission, 2013
Fidelity assessment has received renewed attention in recent years, particularly as distinctions have been made in implementation science between intervention fidelity and implementation fidelity. Considering both types of fidelity has been recommended when developing fidelity instruments. In the present article, we describe development of the…
Descriptors: Fidelity, Generalizability Theory, Intervention, Models
Fielding-Wells, Jill – Mathematics Education Research Group of Australasia, 2013
Argumentation in mathematics teaching has potential to move students beyond tacit understanding of mathematical concepts and procedures towards articulation and justification of their ideas; a practice in which evidence is central. Design-based research was used to examine the nature of evidence used by a class of primary students through levels…
Descriptors: Mathematics Instruction, Mathematical Concepts, Elementary School Mathematics, Foreign Countries
Snyder, Patricia A.; Hemmeter, Mary Louise; Fox, Lise; Bishop, Crystal Crowe; Miller, M. David – Journal of Early Intervention, 2013
Fidelity assessment has received renewed attention in recent years, particularly as distinctions have been made in implementation science between intervention fidelity and implementation fidelity. Considering both types of fidelity has been recommended when developing fidelity instruments. In the present article, we describe development of the…
Descriptors: Fidelity, Psychometrics, Rating Scales, Program Implementation
Elklit, Ask; Nielsen, Louise Hjort; Lasgaard, Mathias; Duch, Christina – Journal of Loss and Trauma, 2013
Research on childhood posttraumatic stress disorder (PTSD) is sparse. This is partly due to the limited availability of empirically validated measures for children who are insecure readers. The present study examined the reliability and validity of a cartoon-based measure of PTSD symptoms in children exposed to a disaster. Cartoons were generated…
Descriptors: Cartoons, Posttraumatic Stress Disorder, Psychometrics, Symptoms (Individual Disorders)
Lim, Siew Yee; Chapman, Elaine – Educational Studies in Mathematics, 2013
Existing instruments designed to measure mathematics attitudes were too long, dated, or assessed with only western samples. To address this issue, a shortened version of the Attitudes Toward Mathematics Inventory (short ATMI) which measures four subscales--;enjoyment of mathematics, motivation to do mathematics, self-confidence in mathematics, and…
Descriptors: Validity, Achievement Tests, Motivation, Foreign Countries
Ong, Yoke Mooi; Williams, Julian; Lamprianou, Iasonas – International Journal of Research & Method in Education, 2013
Researchers interested in exploring substantive group differences are increasingly attending to bundles of items (or testlets): the aim is to understand how gender differences, for instance, are explained by differential performances on different types or bundles of items, hence differential bundle functioning (DBF). Some previous work has…
Descriptors: Mathematics Tests, Gender Differences, Mathematics Instruction, Mathematical Models
Garza, Christelle Fabiola; Gasquoine, Philip Gerard – Hispanic Journal of Behavioral Sciences, 2013
Implicit race/ethnic prejudice was assessed using Spanish- and English-language versions of an Implicit Association Test that used Hispanic/Anglo first names and pleasant/unpleasant words as stimuli. This test was administered to a consecutive sample of Mexican American adults residing in the Rio Grande Valley region of Texas of whom about…
Descriptors: Association Measures, Correlation, Mexican Americans, Racial Bias
Briesch, Amy M.; Kilgus, Stephen P.; Chafouleas, Sandra M.; Riley-Tillman, T. Chris; Christ, Theodore J. – Assessment for Effective Intervention, 2013
The current study served to extend previous research on scaling construction of Direct Behavior Rating (DBR) in order to explore the potential flexibility of DBR to fit various intervention contexts. One hundred ninety-eight undergraduate students viewed the same classroom footage but rated student behavior using one of eight randomly assigned…
Descriptors: Validity, Intervention, Measures (Individuals), Student Behavior
Karami, Hossein – Educational Research and Evaluation, 2013
The search for fairness in language testing is distinct from other areas of educational measurement as the object of measurement, that is, language, is part of the identity of the test takers. So, a host of issues enter the scene when one starts to reflect on how to assess people's language abilities. As the quest for fairness in language testing…
Descriptors: Language Skills, Language Tests, Testing, Culture Fair Tests

Peer reviewed
Direct link
