Publication Date
| In 2026 | 0 |
| Since 2025 | 5 |
Descriptor
| Test Format | 5 |
| Test Reliability | 5 |
| Test Validity | 4 |
| Psychometrics | 3 |
| Foreign Countries | 2 |
| Item Response Theory | 2 |
| Language Tests | 2 |
| Test Items | 2 |
| Accountability | 1 |
| Alignment (Education) | 1 |
| Allied Health Occupations… | 1 |
| More ▼ | |
Source
| Anatomical Sciences Education | 1 |
| Discover Education | 1 |
| International Journal of… | 1 |
| Journal of Research in… | 1 |
| Language Assessment Quarterly | 1 |
Author
| Adam B. Wilson | 1 |
| Andrew S. Cale | 1 |
| Bin Tan | 1 |
| Brenda Kucha Anak Ganeng | 1 |
| Cecelia J. Vetter | 1 |
| Duyen Thi Bich Nguyen | 1 |
| Elisabetta Mazzullo | 1 |
| Elizabeth R. Agosto | 1 |
| Hung Tan Ha | 1 |
| Judy R. Wilkerson | 1 |
| Kyle A. Robertson | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 5 |
| Reports - Research | 3 |
| Information Analyses | 2 |
| Reports - Evaluative | 1 |
Education Level
| Higher Education | 3 |
| Postsecondary Education | 3 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Bin Tan; Nour Armoush; Elisabetta Mazzullo; Okan Bulut; Mark J. Gierl – International Journal of Assessment Tools in Education, 2025
This study reviews existing research on the use of large language models (LLMs) for automatic item generation (AIG). We performed a comprehensive literature search across seven research databases, selected studies based on predefined criteria, and summarized 60 relevant studies that employed LLMs in the AIG process. We identified the most commonly…
Descriptors: Artificial Intelligence, Test Items, Automation, Test Format
Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025
This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…
Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis
Andrew S. Cale; Elizabeth R. Agosto; Brenda Kucha Anak Ganeng; Megan E. Kruskie; Margaret A. McNulty; Kyle A. Robertson; Cecelia J. Vetter; Sabrina C. Woods; Md. Nazmul Karim; Adam B. Wilson – Anatomical Sciences Education, 2025
To keep pace with medicine's unpredictable changes, medical trainees must learn to accurately monitor and evaluate themselves via metacognition (i.e., thinking about thinking). The Metacognitive Awareness Inventory (MAI) can assess and guide the metacognitive development of trainees. This study summarizes existing psychometric evidence and…
Descriptors: Meta Analysis, Psychometrics, Metacognition, Measures (Individuals)
Judy R. Wilkerson; W. Steve Lang; LaSonya Moore – Journal of Research in Education, 2025
The DAATS (Dispositions Assessments Aligned with Teacher Standards) battery is a series of five instruments of different item types that measure teachers' consistency with the critical dispositions embedded in the InTASC Standards. The purpose of this study was to continue a 20-year research project on the development and implementation of…
Descriptors: Educational Assessment, National Standards, Teacher Evaluation, Teacher Competencies
Muhammed Parviz; Masoud Azizi – Discover Education, 2025
This article offers a critical review of the Ministry of Science, Research, and Technology English Proficiency Test (MSRT), a high-stakes exam required for postgraduate graduation, scholarships, and certain employment positions in Iran. Despite its widespread use, the design and implementation of the MSRT raise concerns about its validity and…
Descriptors: Language Tests, Language Proficiency, English (Second Language), Second Language Learning

Peer reviewed
Direct link
