Publication Date
| In 2026 | 0 |
| Since 2025 | 28 |
| Since 2022 (last 5 years) | 117 |
| Since 2017 (last 10 years) | 228 |
| Since 2007 (last 20 years) | 561 |
Descriptor
| Evaluation Methods | 1408 |
| Test Reliability | 1408 |
| Test Validity | 954 |
| Student Evaluation | 339 |
| Test Construction | 305 |
| Foreign Countries | 217 |
| Higher Education | 183 |
| Measurement Techniques | 170 |
| Psychometrics | 168 |
| Elementary Secondary Education | 147 |
| Evaluation Criteria | 122 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 74 |
| Practitioners | 72 |
| Teachers | 29 |
| Administrators | 18 |
| Policymakers | 11 |
| Students | 4 |
| Counselors | 3 |
| Support Staff | 3 |
| Community | 1 |
| Parents | 1 |
Location
| Australia | 24 |
| United Kingdom | 22 |
| Canada | 18 |
| Turkey | 16 |
| China | 14 |
| United States | 14 |
| California | 11 |
| Netherlands | 10 |
| Florida | 9 |
| Texas | 8 |
| United Kingdom (England) | 8 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Martínez, José Felipe; Schweig, Jonathan; Goldschmidt, Pete – Educational Evaluation and Policy Analysis, 2016
A key question facing teacher evaluation systems is how to combine multiple measures of complex constructs into composite indicators of performance. We use data from the Measures of Effective Teaching (MET) study to investigate the measurement properties of composite indicators obtained under various conjunctive, disjunctive (or complementary),…
Descriptors: Teacher Evaluation, Outcome Measures, Evaluation Methods, Educational Policy
Endedijk, Hinke M.; Cillessen, Antonius H. N. – International Journal of Behavioral Development, 2015
In preschool classes, sociometric peer ratings are used to measure children's peer relationships. The current study examined a computerized version of preschool sociometric ratings. The psychometric properties were compared of computerized sociometric ratings and traditional peer ratings for preschoolers. The distributions, inter-item…
Descriptors: Sociometric Techniques, Preschool Education, Preschool Children, Peer Relationship
Park, Siwon – Journal of Pan-Pacific Association of Applied Linguistics, 2017
This paper examines how different test methods may tap different aspects of second language knowledge. It employs multiple-choice (MC) and constructed response (CR) items which yield distinct or convergent information in the computer delivered testing of English in its presentation of this factor. In order to examine the effects of test method, a…
Descriptors: Evaluation Methods, Second Language Learning, English (Second Language), Computer Assisted Testing
McGhan, Anna C.; Lerman, Dorothea C. – Journal of Applied Behavior Analysis, 2013
Prior research indicates that the relative effectiveness of different error-correction procedures may be idiosyncratic across learners, suggesting the potential benefit of an individualized assessment prior to teaching. In this study, we evaluated the reliability and utility of a rapid error-correction assessment to identify the least intrusive,…
Descriptors: Error Correction, Autism, Test Reliability, Test Validity
Nasuti, Gabriella; Stuart-Hill, Lynneth; Temple, Viviene A. – Journal of Intellectual & Developmental Disability, 2013
Background: The Six-Minute Walk Test (6MWT) has been used with clinical and healthy populations to assess functional capacity and cardiovascular fitness. The aim of this study was to determine the test-retest reliability of a modified-6MWT as well as concurrent validity of walk distance with peak oxygen uptake (VO[subscript 2] peak). Method:…
Descriptors: Test Validity, Evaluation Methods, Mental Retardation, Adults
Kaspar, Roman; Döring, Ottmar; Wittmann, Eveline; Hartig, Johannes; Weyland, Ulrike; Nauerth, Annette; Möllers, Michaela; Rechenbach, Simone; Simon, Julia; Worofka, Iberé – Vocations and Learning, 2016
Valid and reliable standardized assessment of nursing competencies is needed to monitor the quality of vocational education and training (VET) in nursing and evaluate learning outcomes for care work trainees with increasingly heterogeneous learning backgrounds. To date, however, the modeling of professional competencies has not yet evolved into…
Descriptors: Nursing Education, Geriatrics, Video Technology, Computer Assisted Testing
Ronfeldt, Matthew; Campbell, Shanyce L. – Educational Evaluation and Policy Analysis, 2016
Despite growing calls for more accountability of teacher education programs (TEPs), there is little consensus about how to evaluate them. This study investigates the potential for using observational ratings of program completers to evaluate TEPs. Drawing on statewide data on almost 9,500 program completers, representing 44 providers (183…
Descriptors: Teacher Education Programs, Program Effectiveness, Program Evaluation, Observation
Taubman--Ben-Ari, Orit; Ben Shlomo, Shirley – Research on Social Work Practice, 2016
Objective: Two studies examined the validity of using the Posttraumatic Growth Inventory (PTGI) to assess growth following the transition to grandparenthood as an aid for social workers seeking to promote strengths-based interventions for this population. Method: In Study 1 (n = 210 grandparent/offspring pairs), first-time grandparents'…
Descriptors: Grandparents, Intervention, Qualitative Research, Factor Structure
Huang, Xiaoping; Hu, Zhongfeng – Higher Education Studies, 2015
The main problem of the educational evaluation validity is that it just copies the conceptual framework system of validity from educational measurement to its own conceptual system. The validity conceptual system that fits the need of theory and practice of educational evaluation has not been established yet. According to the inherent attributive…
Descriptors: Test Validity, Educational Assessment, Evaluation Problems, Theory Practice Relationship
Thomas, Jason E.; Hornsey, Philip E. – Journal of Instructional Research, 2014
Formative Classroom Assessment Techniques (CAT) have been well-established instructional tools in higher education since their exposition in the late 1980s (Angelo & Cross, 1993). A large body of literature exists surrounding the strengths and weaknesses of formative CATs. Simpson-Beck (2011) suggested insufficient quantitative evidence exists…
Descriptors: Classroom Techniques, Nontraditional Education, Adult Education, Formative Evaluation
International Journal of Testing, 2019
These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…
Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage
Fives, Helenrose; Barnes, Nicole; Dacey, Charity; Gillis, Anna – Teacher Educator, 2016
We conducted a content analysis of 27 assessment textbooks to determine how assessment planning was framed in texts for preservice teachers. We identified eight assessment planning themes: alignment, assessment purpose and types, reliability and validity, writing goals and objectives, planning specific assessments, unpacking, overall assessment…
Descriptors: Student Evaluation, Lesson Plans, Knowledge Base for Teaching, Textbook Evaluation
Dirlikov, Benjamin; Younes, Laurent; Nebel, Mary Beth; Martinelli, Mary Katherine; Tiedemann, Alyssa Nicole; Koch, Carolyn A.; Fiorilli, Diana; Bastian, Amy J.; Denckla, Martha Bridge; Miller, Michael I.; Mostofsky, Stewart H. – Journal of Occupational Therapy, Schools & Early Intervention, 2017
This study presents construct validity for a novel automated morphometric and kinematic handwriting assessment, including (1) convergent validity, establishing reliability of automated measures with traditional manual-derived Minnesota Handwriting Assessment (MHA), and (2) discriminant validity, establishing that the automated methods distinguish…
Descriptors: Handwriting, Evaluation Methods, Children, Preadolescents
Amrein-Beardsley, Audrey; Geiger, Tray – Phi Delta Kappan, 2017
Houston's experience with the Educational Value-Added Assessment System (R) (EVAAS) raises questions that other districts should consider before buying the software and using it for high-stakes decisions. Researchers found that teachers in Houston, all of whom were under the EVAAS gun, but who taught relatively more racial minority students,…
Descriptors: Value Added Models, School Districts, Computer Software, Educational Technology
Turner, Mark; Davila-Ross, Marina – Psychology Teaching Review, 2015
The ability to reason scientifically and communicate research appropriately is central to psychological literacy. Scientific research has little value unless scientists are able to convey results and their consequences clearly to others. In this study, we outline a method of assessing the development of psychological literacy in undergraduate…
Descriptors: Interviews, Research Projects, Psychological Studies, Verbal Communication

Peer reviewed
Direct link
