Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 4 |
| Since 2007 (last 20 years) | 10 |
Descriptor
| Student Evaluation | 19 |
| Test Reliability | 19 |
| Test Theory | 19 |
| Test Validity | 18 |
| Evaluation Methods | 10 |
| Psychometrics | 6 |
| Test Construction | 6 |
| Test Bias | 5 |
| Measures (Individuals) | 4 |
| Scientific Concepts | 4 |
| Standardized Tests | 4 |
| More ▼ | |
Source
Author
| Aktas, Mehtap | 1 |
| Asiret, Semih | 1 |
| Bachor, Dan G. | 1 |
| Barbera, Jack | 1 |
| Bers, Marina Umaschi | 1 |
| Bullock, Lyndal M. | 1 |
| Buxner, Sanlyn R. | 1 |
| Cason, Gerald J. | 1 |
| Cheng, Britte H. | 1 |
| Colker, Alexis M. | 1 |
| DeBarger, Angela | 1 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 2 |
| Students | 1 |
| Teachers | 1 |
Location
| Australia | 1 |
| Turkey (Ankara) | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Gayle Geschwind; Michael Vignal; Marcos D. Caballero; H.? J. Lewandowski – Physical Review Physics Education Research, 2024
The Survey of Physics Reasoning on Uncertainty Concepts in Experiments (SPRUCE) was designed to measure students' proficiency with measurement uncertainty concepts and practices across ten different assessment objectives to help facilitate the improvement of laboratory instruction focused on this important topic. To ensure the reliability and…
Descriptors: Measurement, Ambiguity (Context), Scientific Concepts, Physics
Simon, Molly N.; Prather, Edward E.; Buxner, Sanlyn R.; Impey, Chris D. – International Journal of Science Education, 2019
The discovery and characterisation of planets orbiting distant stars has shed light on the origin of our own Solar System. It is important that college-level introductory astronomy students have a general understanding of the planet formation process before they are able to draw parallels between extrasolar systems and our own Solar System. In…
Descriptors: Measures (Individuals), Test Validity, Test Reliability, Student Evaluation
Uzun, N. Bilge; Aktas, Mehtap; Asiret, Semih; Yormaz, Seha – Asian Journal of Education and Training, 2018
The goal of this study is to determine the reliability of the performance points of dentistry students regarding communication skills and to examine the scoring reliability by generalizability theory in balanced random and fixed facet (mixed design) data, considering also the interactions of student, rater and duty. The study group of the research…
Descriptors: Foreign Countries, Generalizability Theory, Scores, Test Reliability
Relkin, Emily; de Ruiter, Laura; Bers, Marina Umaschi – Journal of Science Education and Technology, 2020
There is a need for developmentally appropriate Computational Thinking (CT) assessments that can be implemented in early childhood classrooms. We developed a new instrument called "TechCheck" for assessing CT skills in young children that does not require prior knowledge of computer programming. "TechCheck" is based on…
Descriptors: Developmentally Appropriate Practices, Computation, Thinking Skills, Early Childhood Education
Improving Comprehension Assessment for Middle and High School Students: Challenges and Opportunities
Sabatini, John; Petscher, Yaacov; O'Reilly, Tenaha; Truckenmiller, Adrea – Grantee Submission, 2015
For decades, standardized reading comprehension tests have consisted of a series of passages and associated multiple-choice questions. Although widely used in and out of the classroom, there continues to be considerable disagreement regarding how or whether such tests have net value in the service of advancing educational progress in reading. This…
Descriptors: Middle School Students, High School Students, Reading Comprehension, Reading Tests
Barbera, Jack – Journal of Chemical Education, 2013
The Chemical Concepts Inventory (CCI) is a multiple-choice instrument
designed to assess the alternate conceptions of students in high school or first-semester college chemistry. The instrument was published in 2002 along with an analysis of its data from a test population. This study supports the initial analysis and expands on the psychometric…
Descriptors: Science Instruction, Secondary School Science, High Schools, College Science
Mislevy, Robert J.; Haertel, Geneva; Cheng, Britte H.; Ructtinger, Liliana; DeBarger, Angela; Murray, Elizabeth; Rose, David; Gravel, Jenna; Colker, Alexis M.; Rutstein, Daisy; Vendlinski, Terry – Educational Research and Evaluation, 2013
Standardizing aspects of assessments has long been recognized as a tactic to help make evaluations of examinees fair. It reduces variation in irrelevant aspects of testing procedures that could advantage some examinees and disadvantage others. However, recent attention to making assessment accessible to a more diverse population of students…
Descriptors: Testing Accommodations, Access to Education, Testing, Psychometrics
Keller, Christopher M.; Kros, John F. – Marketing Education Review, 2011
Measures of survey reliability are commonly addressed in marketing courses. One statistic of reliability is "Cronbach's alpha." This paper presents an application of survey reliability as a reflexive application of multiple-choice exam validation. The application provides an interactive decision support system that incorporates survey item…
Descriptors: Test Validity, Marketing, Test Reliability, Multiple Choice Tests
Martone, Andrea; Sireci, Stephen G. – Review of Educational Research, 2009
The authors (a) discuss the importance of alignment for facilitating proper assessment and instruction, (b) describe the three most common methods for evaluating the alignment between state content standards and assessments, (c) discuss the relative strengths and limitations of these methods, and (d) discuss examples of applications of each…
Descriptors: Teaching Methods, Alignment (Education), Student Evaluation, Curriculum Development
Herman, Geoffrey Lindsay – ProQuest LLC, 2011
Instructors in electrical and computer engineering and in computer science have developed innovative methods to teach digital logic circuits. These methods attempt to increase student learning, satisfaction, and retention. Although there are readily accessible and accepted means for measuring satisfaction and retention, there are no widely…
Descriptors: Grounded Theory, Delphi Technique, Concept Formation, Misconceptions
Peer reviewedWard, James Gordon – Peabody Journal of Education, 1981
Teachers need valid information to judge the types of programs, instruction, and colleges best suited to students. Teachers appear to support the use of standardized tests to provides some of that information. Abolishing such tests may lead to dependence on more subjective measures, resulting in inequities in placement and selection. (FG)
Descriptors: Achievement Tests, Educational Assessment, Elementary Secondary Education, Standardized Tests
Peer reviewedDouglas, Dan – Annual Review of Applied Linguistics, 1995
Reviews recent theoretical, methodological, and analytical developments in language testing, focusing on more refined models of language ability, reliability and validity, performance testing, innovative test formats, new applications of Item Response Theory and Generalizability Theory to test performance. An annotated bibliography discusses seven…
Descriptors: Annotated Bibliographies, Evaluation Methods, Language Proficiency, Language Tests
Bullock, Lyndal M.; And Others – 1988
Prompted by the increased use of behavior rating instruments in educational environments and evidence of confusion over the interpretation of labels designating behavior clusters, the present two-phase study analyzed 410 specific items contained in seven behavior rating instruments and investigated whether these items could be intuitively sorted…
Descriptors: Behavior Disorders, Behavior Rating Scales, Classroom Observation Techniques, Elementary Secondary Education
Peer reviewedBachor, Dan G. – Alberta Journal of Educational Research, 1990
Reviews medical and psychoeducational assessment models, illustrating need to expand assessment procedure database resulting from the measurement errors in traditional assessment practices. Introduces assessment model emphasizing collection of classroom-based information over time and evaluating data more critically. Suggests model facilitates…
Descriptors: Classroom Observation Techniques, Evaluation Methods, Evaluation Problems, Instructional Development
Cason, Gerald J.; And Others – 1983
Prior research in a single clinical training setting has shown Cason and Cason's (1981) simplified model of their performance rating theory can improve rating reliability and validity through statistical control of rater stringency error. Here, the model was applied to clinical performance ratings of 14 cohorts (about 250 students and 200 raters)…
Descriptors: Clinical Experience, Error of Measurement, Evaluation Methods, Higher Education
Previous Page | Next Page ยป
Pages: 1 | 2
Direct link
