NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 5 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Kylie Gorney; Mark D. Reckase – Journal of Educational Measurement, 2025
In computerized adaptive testing, item exposure control methods are often used to provide a more balanced usage of the item pool. Many of the most popular methods, including the restricted method (Revuelta and Ponsoda), use a single maximum exposure rate to limit the proportion of times that each item is administered. However, Barrada et al.…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Item Banks
Peer reviewed Peer reviewed
Direct linkDirect link
Paige E. Cervantes; Robert D. Gibbons; Lawrence A. Palinkas; Greta R. Conlon; Sarah M. Horwitz – Journal of Developmental and Physical Disabilities, 2025
Because autistic youth experience increased suicide risk and there are no suicide risk screening tools for this population, existing measures need to be evaluated and then modified with input from the autism community. This pilot study obtained feedback from autistic youth, caregivers, and autism specialist clinicians (N = 14) on the applicability…
Descriptors: Autism Spectrum Disorders, Suicide, Risk, Computer Assisted Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Andrew P. Jaciw – American Journal of Evaluation, 2025
By design, randomized experiments (XPs) rule out bias from confounded selection of participants into conditions. Quasi-experiments (QEs) are often considered second-best because they do not share this benefit. However, when results from XPs are used to generalize causal impacts, the benefit from unconfounded selection into conditions may be offset…
Descriptors: Elementary School Students, Elementary School Teachers, Generalization, Test Bias
Peer reviewed Peer reviewed
Direct linkDirect link
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Takunori Terasawa; So Sudo; Takeshi Kajigaya; Ryosuke Aoyama; Ryuko Kubota – Current Issues in Language Planning, 2025
This paper examines recent reforms in English-language testing in Japan using a policy distraction framework. We identify the term 'washback (effect)' and other related discourses as major distractors and investigate how 'washback' discourses have functioned as political slogans or catchphrases in policy deliberation processes and how they have…
Descriptors: Testing Problems, English (Second Language), Second Language Learning, Second Language Instruction