Publication Date
| In 2026 | 0 |
| Since 2025 | 28 |
| Since 2022 (last 5 years) | 117 |
| Since 2017 (last 10 years) | 228 |
| Since 2007 (last 20 years) | 561 |
Descriptor
| Evaluation Methods | 1408 |
| Test Reliability | 1408 |
| Test Validity | 954 |
| Student Evaluation | 339 |
| Test Construction | 305 |
| Foreign Countries | 217 |
| Higher Education | 183 |
| Measurement Techniques | 170 |
| Psychometrics | 168 |
| Elementary Secondary Education | 147 |
| Evaluation Criteria | 122 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 74 |
| Practitioners | 72 |
| Teachers | 29 |
| Administrators | 18 |
| Policymakers | 11 |
| Students | 4 |
| Counselors | 3 |
| Support Staff | 3 |
| Community | 1 |
| Parents | 1 |
Location
| Australia | 24 |
| United Kingdom | 22 |
| Canada | 18 |
| Turkey | 16 |
| China | 14 |
| United States | 14 |
| California | 11 |
| Netherlands | 10 |
| Florida | 9 |
| Texas | 8 |
| United Kingdom (England) | 8 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Lee, Jeong-Sook; Kim, Sung-Wan – Journal of Educational Computing Research, 2015
The purpose of this study is to develop and validate an evaluation tool of educational apps for smart education. Based on literature reviews, a potential model for evaluating educational apps was suggested. An evaluation tool consisting of 57 survey items was delivered to 156 students in middle and high schools. An exploratory factor analysis was…
Descriptors: Educational Technology, Courseware, Computer Software Evaluation, Test Construction
Amrein-Beardsley, Audrey; Holloway-Libell, Jessica; Cirell, Anna Montana; Hays, Alice; Chapman, Kathryn – Practical Assessment, Research & Evaluation, 2015
There is something incalculable about teacher expertise and whether it can be observed, detected, quantified, and as per current educational policies, used as an accountability tool to hold America's public school teachers accountable for that which they do (or do not do well). In this commentary, authors (all of whom are former public school…
Descriptors: Accountability, Educational Change, Educational Policy, Expertise
Wilcox, Bethany R.; Pollock, Steven J. – Physical Review Special Topics - Physics Education Research, 2015
Standardized conceptual assessment represents a widely used tool for educational researchers interested in student learning within the standard undergraduate physics curriculum. For example, these assessments are often used to measure student learning across educational contexts and instructional strategies. However, to support the large-scale…
Descriptors: Science Instruction, Scientific Concepts, College Science, Physics
Wilkerson, Judy R. – Journal of Teacher Education, 2015
This commentary on the article titled "Examining the Internal Structure Evidence for the Performance Assessment for California Teachers: A Validation Study of the Elementary Literacy Teaching Event for Tier 1 Teacher Licensure" provides an overview of Performance Assessment for California Teachers (PACT), its relationship to edTPA and…
Descriptors: Teacher Evaluation, Performance Based Assessment, Evaluation Methods, Teacher Certification
Khan, R. Nazim – International Journal of Mathematical Education in Science and Technology, 2015
Open book assessment is not a new idea, but it does not seem to have gained ground in higher education. In particular, not much literature is available on open book examinations in mathematics and statistics in higher education. The objective of this paper is to investigate the appropriateness of open book assessments in a first-year business…
Descriptors: Evaluation Methods, Higher Education, Mathematics Tests, Statistics
Choque Olsson, Nora; Bölte, Sven – Journal of Autism and Developmental Disorders, 2014
There are few evaluated economic tools to assess change in autism. This study examined the inter-rater reliability of the Developmental Disabilities Children's Global Assessment Scale (DD-CGAS), and the OSU Autism Clinical Global Impression (OSU Autism CGI) in a European setting. Using these scales, 16 clinicians with multidisciplinary…
Descriptors: Evaluation Methods, Autism, Developmental Disabilities, Vignettes
Eliasson, Ann-Christin – Physical & Occupational Therapy in Pediatrics, 2012
Assessments used for both clinical practice and research should show evidence of validity and reliability for the target group of people. It is easy to agree with this statement, but it is not always easy to choose the right assessment for the right purpose. Recently there have been increasing numbers of studies which investigate further the…
Descriptors: Psychometrics, Test Construction, Test Reliability, Test Validity
Piotrowski, Jessica Taylor; Litman, Jordan A.; Valkenburg, Patti – Infant and Child Development, 2014
Epistemic curiosity (EC) is the desire to obtain new knowledge capable of either producing positive experiences of intellectual interest (I-type) or of reducing undesirable conditions of informational deprivation (D-type). Although researchers acknowledge that there are individual differences in young children's epistemic curiosity, there are…
Descriptors: Epistemology, Personality Traits, Knowledge Level, Young Children
Gargani, John; Strong, Michael – Journal of Teacher Education, 2015
In Gargani and Strong (2014), we describe The Rapid Assessment of Teacher Effectiveness (RATE), a new teacher evaluation instrument. Our account of the validation research associated with RATE inspired a review by Good and Lavigne (2015). Here, we reply to the main points of their review. We elaborate on the validity, reliability, theoretical…
Descriptors: Evidence, Teacher Effectiveness, Teacher Evaluation, Evaluation Methods
Improving Comprehension Assessment for Middle and High School Students: Challenges and Opportunities
Sabatini, John; Petscher, Yaacov; O'Reilly, Tenaha; Truckenmiller, Adrea – Grantee Submission, 2015
For decades, standardized reading comprehension tests have consisted of a series of passages and associated multiple-choice questions. Although widely used in and out of the classroom, there continues to be considerable disagreement regarding how or whether such tests have net value in the service of advancing educational progress in reading. This…
Descriptors: Middle School Students, High School Students, Reading Comprehension, Reading Tests
Alexander, Patricia A.; Dumas, Denis; Grossnickle, Emily M.; List, Alexandra; Firetto, Carla M. – Journal of Experimental Education, 2016
Relational reasoning is the foundational cognitive ability to discern meaningful patterns within an informational stream, but its reliable and valid measurement remains problematic. In this investigation, the measurement of relational reasoning unfolded in three stages. Stage 1 entailed the establishment of a research-based conceptualization of…
Descriptors: Cognitive Ability, Logical Thinking, Thinking Skills, Cognitive Processes
Moodie, Shannon; Daneri, Paula; Goldhagen, Samantha; Halle, Tamara; Green, Katie; LaMonte, Lauren – US Department of Health and Human Services, 2014
For children age birth to five, physical, cognitive, linguistic, and social-emotional growth and development occur at a rapid pace. While all children in this age range may not reach developmental milestones (e.g., smiling, saying first words, taking first steps) at the same time, development that does not happen within an expected timeframe can…
Descriptors: Young Children, Child Development, Screening Tests, Measurement Techniques
Ngo, Federick; Kwon, William W. – Research in Higher Education, 2015
Community college students are often placed in developmental math courses based on the results of a single placement test. However, concerns about accurate placement have recently led states and colleges across the country to consider using other measures to inform placement decisions. While the relationships between college outcomes and such…
Descriptors: Access to Education, Success, Community Colleges, Mathematics Education
Tseng, Jun-Jie – Computer Assisted Language Learning, 2016
Researchers have been keen to develop instruments for the assessment of teachers' self-perceived technological pedagogical content knowledge (TPACK); however, few studies have been conducted to validate such assessment tools through students' perspectives in the context of English as a foreign language (EFL). The purpose of this study was thus to…
Descriptors: English (Second Language), Second Language Learning, Pedagogical Content Knowledge, Educational Technology
Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016
The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…
Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores

Peer reviewed
Direct link
