Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 0 |
| Since 2007 (last 20 years) | 3 |
Descriptor
| Coding | 3 |
| Interrater Reliability | 3 |
| Scoring | 3 |
| Biology | 1 |
| College Science | 1 |
| Componential Analysis | 1 |
| Computational Linguistics | 1 |
| Concept Formation | 1 |
| Data Analysis | 1 |
| Difficulty Level | 1 |
| Effect Size | 1 |
| More ▼ | |
Author
| Deygers, Bart | 1 |
| Huang, Chiungjung | 1 |
| Knight, Jennifer K. | 1 |
| Prevost, Luanna B. | 1 |
| Smith, Michelle K. | 1 |
| Van Gorp, Koen | 1 |
Publication Type
| Journal Articles | 3 |
| Reports - Research | 3 |
Education Level
| Higher Education | 1 |
| Postsecondary Education | 1 |
Audience
Location
| Netherlands | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Prevost, Luanna B.; Smith, Michelle K.; Knight, Jennifer K. – CBE - Life Sciences Education, 2016
Previous work has shown that students have persistent difficulties in understanding how central dogma processes can be affected by a stop codon mutation. To explore these difficulties, we modified two multiple-choice questions from the Genetics Concept Assessment into three open-ended questions that asked students to write about how a stop codon…
Descriptors: Science Instruction, Genetics, Scientific Concepts, Scoring
Deygers, Bart; Van Gorp, Koen – Language Testing, 2015
Considering scoring validity as encompassing both reliable rating scale use and valid descriptor interpretation, this study reports on the validation of a CEFR-based scale that was co-constructed and used by novice raters. The research questions this paper wishes to answer are (a) whether it is possible to construct a CEFR-based rating scale with…
Descriptors: Rating Scales, Scoring, Validity, Interrater Reliability
Huang, Chiungjung – Educational and Psychological Measurement, 2009
This study examined the percentage of task-sampling variability in performance assessment via a meta-analysis. In total, 50 studies containing 130 independent data sets were analyzed. Overall results indicate that the percentage of variance for (a) differential difficulty of task was roughly 12% and (b) examinee's differential performance of the…
Descriptors: Test Bias, Research Design, Performance Based Assessment, Performance Tests

Peer reviewed
Direct link
