Publication Date
| In 2026 | 0 |
| Since 2025 | 28 |
| Since 2022 (last 5 years) | 117 |
| Since 2017 (last 10 years) | 228 |
| Since 2007 (last 20 years) | 561 |
Descriptor
| Evaluation Methods | 1408 |
| Test Reliability | 1408 |
| Test Validity | 954 |
| Student Evaluation | 339 |
| Test Construction | 305 |
| Foreign Countries | 217 |
| Higher Education | 183 |
| Measurement Techniques | 170 |
| Psychometrics | 168 |
| Elementary Secondary Education | 147 |
| Evaluation Criteria | 122 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 74 |
| Practitioners | 72 |
| Teachers | 29 |
| Administrators | 18 |
| Policymakers | 11 |
| Students | 4 |
| Counselors | 3 |
| Support Staff | 3 |
| Community | 1 |
| Parents | 1 |
Location
| Australia | 24 |
| United Kingdom | 22 |
| Canada | 18 |
| Turkey | 16 |
| China | 14 |
| United States | 14 |
| California | 11 |
| Netherlands | 10 |
| Florida | 9 |
| Texas | 8 |
| United Kingdom (England) | 8 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Bill & Melinda Gates Foundation, 2012
No one has a bigger stake in teaching effectiveness than students. Nor are there any better experts on how teaching is experienced by its intended beneficiaries. Only recently have many policymakers and practitioners come to recognize that--when asked the right questions, in the right ways--students can be an important source of information on the…
Descriptors: Student Surveys, Student Attitudes, Feedback (Response), Test Validity
Few, Lauren R.; Miller, Joshua D.; Morse, Jennifer Q.; Yaggi, Kirsten E.; Reynolds, Sarah K.; Pilkonis, Paul A. – Assessment, 2010
Despite substantial research use, measures of the five-factor model (FFM) are infrequently used in clinical settings due, in part, to issues related to administration time and a reluctance to use self-report instruments. The current study examines the reliability and validity of the Five-Factor Model Score Sheet (FFMSS), which is a 30-item…
Descriptors: Personality Traits, Personality Problems, Test Reliability, Test Validity
Tierney, Robin D.; Simon, Marielle; Charland, Julie – Educational Forum, 2011
Knowing that grades can have long-term consequences for students, teachers voice concern about being fair in the grading process. However, their interpretations of fairness are varied and sometimes contradictory. This study looked at how teachers in one standards-based educational system determined secondary students' grades, focusing specifically…
Descriptors: Grades (Scholastic), Academic Achievement, Grading, Educational Principles
Baartman, Liesbeth K. J.; Prins, Frans J.; Kirschner, Paul A.; van der Vleuten, Cees P. M. – Evaluation and Program Planning, 2011
The goal of this article is to contribute to the validation of a self-evaluation method, which can be used by schools to evaluate the quality of their Competence Assessment Program (CAP). The outcomes of the self-evaluations of two schools are systematically compared: a novice school with little experience in competence-based education and…
Descriptors: Educational Innovation, Competency Based Education, Self Evaluation (Groups), Program Validation
Liu, Feng; Black, Erik; Algina, James; Cavanaugh, Cathy; Dawson, Kara – Journal of Interactive Online Learning, 2010
Parental involvement has been recognized as an important factor for student achievement in traditional school settings. The lack of research regarding the effect of parental involvement on student achievement in virtual schooling is, in part, due to the absence of a valid and reliable instrument to measure this construct. This paper provides an…
Descriptors: Parent Participation, Academic Achievement, Parent School Relationship, Educational Environment
Jiang, Bo; Xu, Xiaoying; Garcia, Alicia; Lewis, Jennifer E. – Journal of Chemical Education, 2010
The Test of Logical Thinking (TOLT) and the Group Assessment of Logical Thinking (GALT) are two of the instruments most widely used by science educators and researchers to measure students' formal reasoning abilities. Based on Piaget's cognitive development theory, formal thinking ability has been shown to be essential for student achievement in…
Descriptors: Test Bias, Test Reliability, Chemistry, Logical Thinking
Clarke-Midura, Jody; Dede, Chris – Journal of Research on Technology in Education, 2010
Despite three decades of advances in information and communications technology (ICT) and a generation of research on cognition and new pedagogical strategies, the field of assessment has not progressed much beyond paper-and-pencil item-based tests. Research has shown these instruments are not valid measures of sophisticated intellectual…
Descriptors: Technology Integration, Computer Assisted Testing, Student Evaluation, Evaluation Methods
Pardo-Ballester, Cristina – Language Assessment Quarterly, 2010
This study describes research used for supporting a validity argument for a new Spanish Listening Exam, whose scores are intended to place examinees into appropriate levels of university Spanish classes. This study contributes to the field of argument-based approaches to language assessment by implementing Bachman's (2005) assessment use argument…
Descriptors: Second Language Instruction, Test Reliability, Test Validity, Language Aptitude
Way, Walter D.; Murphy, Daniel; Powers, Sonya; Keng, Leslie – Pearson, 2012
Significant momentum exists for next-generation assessments to increasingly utilize technology to develop and deliver performance-based assessments. Many traditional challenges with this assessment approach still apply, including psychometric concerns related to performance-based tasks (PBTs), which include low reliability, efficiency of…
Descriptors: Task Analysis, Performance Based Assessment, Technology Uses in Education, Models
Feldman, Moshe; Lazzara, Elizabeth H.; Vanderbilt, Allison A.; DiazGranados, Deborah – Journal of Continuing Education in the Health Professions, 2012
Competency-based assessment and an emphasis on obtaining higher-level outcomes that reflect physicians' ability to demonstrate their skills has created a need for more advanced assessment practices. Simulation-based assessments provide medical education planners with tools to better evaluate the 6 Accreditation Council for Graduate Medical…
Descriptors: Performance Based Assessment, Physicians, Accuracy, High Stakes Tests
Marson, Stephen M.; Wei, Guo; Wasserman, Deborah – American Journal of Evaluation, 2009
Goal attainment scaling (GAS) has been considered to be one of the most versatile and appealing evaluation protocols available for human services. Aspects of the protocol that make the method so appealing to practitioners--that is, collaboratively working with individual clients to identify and assign weights to goals they will work to…
Descriptors: Human Services, Scaling, Test Reliability, Interrater Reliability
Memon, Muhammed Ashraf; Joughin, Gordon Rowland; Memon, Breda – Advances in Health Sciences Education, 2010
The purpose of this review was to examine the practice of oral assessment in postgraduate medical education in the context of the core assessment constructs of validity, reliability and fairness. Although oral assessment has a long history in the certification process of medical specialists and is a well-established part of such proceedings for a…
Descriptors: Medical Education, Certification, Exit Examinations, Licensing Examinations (Professions)
Sood, Vishal – Journal on Educational Psychology, 2013
For identifying children with four major kinds of verbal learning disabilities viz. reading disability, speech and language comprehension disability, writing disability and mathematics disability, the present task was undertaken to construct and standardize verbal learning disabilities checklist. This checklist was developed by keeping in view the…
Descriptors: Verbal Learning, Learning Disabilities, Children, Disability Identification
Squires, Jane K.; Waddell, Misti L.; Clifford, Jantina R.; Funk, Kristin; Hoselton, Robert M.; Chen, Ching-I – Topics in Early Childhood Special Education, 2013
Psychometric and utility studies on Social Emotional Assessment Measure (SEAM), an innovative tool for assessing and monitoring social-emotional and behavioral development in infants and toddlers with disabilities, were conducted. The Infant and Toddler SEAM intervals were the study focus, using mixed methods, including item response theory…
Descriptors: Psychometrics, Evaluation Methods, Social Development, Emotional Development
Lick, David J.; Schmidt, Karen M.; Patterson, Charlotte J. – Journal of Applied Measurement, 2011
According to two decades of research, parental sexual orientation does not affect overall child development. Researchers have not found significant differences between offspring of heterosexual parents and those of lesbian and gay parents in terms of their cognitive, psychological, or emotional adjustment. Still, there are gaps in the literature…
Descriptors: Parent Child Relationship, Measures (Individuals), Emotional Adjustment, Homosexuality

Peer reviewed
Direct link
