Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 8 |
Descriptor
| Simulation | 8 |
| Test Reliability | 8 |
| Test Construction | 4 |
| Accuracy | 3 |
| Evaluation Methods | 3 |
| Error of Measurement | 2 |
| Foreign Countries | 2 |
| Language Proficiency | 2 |
| Test Format | 2 |
| Test Length | 2 |
| Test Validity | 2 |
| More ▼ | |
Source
| Applied Measurement in… | 1 |
| Education and Information… | 1 |
| European Journal of Education | 1 |
| Journal of Educational… | 1 |
| Journal of Occupational… | 1 |
| Measurement:… | 1 |
| Psychology Learning and… | 1 |
| Research Synthesis Methods | 1 |
Author
| Afsoon Hassani Mehraban | 1 |
| Akram Azad | 1 |
| Azaan Vhora | 1 |
| Clark, Amy K. | 1 |
| Cristina Semaca | 1 |
| Cui, Zhongmin | 1 |
| Delphine Franco | 1 |
| Gelbal, Selahattin | 1 |
| Gerta Rücker | 1 |
| Guido Schwarzer | 1 |
| Hau, Kit-Tai | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 8 |
| Reports - Research | 8 |
| Information Analyses | 1 |
| Tests/Questionnaires | 1 |
Education Level
| Higher Education | 2 |
| Postsecondary Education | 2 |
Audience
Location
| Australia | 1 |
| Germany | 1 |
| Iran | 1 |
| North America | 1 |
| Sweden | 1 |
| United Kingdom | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Guido Schwarzer; Gerta Rücker; Cristina Semaca – Research Synthesis Methods, 2024
The "LFK" index has been promoted as an improved method to detect bias in meta-analysis. Putatively, its performance does not depend on the number of studies in the meta-analysis. We conducted a simulation study, comparing the "LFK" index test to three standard tests for funnel plot asymmetry in settings with smaller or larger…
Descriptors: Bias, Meta Analysis, Simulation, Evaluation Methods
Azaan Vhora; Ryan L. Davies; Kylie Rice – Psychology Learning and Teaching, 2024
Background: Objective Structured Clinical Examinations (OSCEs) are a simulation-based assessment tool used extensively in medical education for evaluating clinical competence. OSCEs are widely regarded as more valid, reliable, and valuable compared to traditional assessment measures, and are now emerging within professional psychology training…
Descriptors: Psychology, Higher Education, Psychometrics, Objective Tests
Thompson, W. Jake; Nash, Brooke; Clark, Amy K.; Hoover, Jeffrey C. – Journal of Educational Measurement, 2023
As diagnostic classification models become more widely used in large-scale operational assessments, we must give consideration to the methods for estimating and reporting reliability. Researchers must explore alternatives to traditional reliability methods that are consistent with the design, scoring, and reporting levels of diagnostic assessment…
Descriptors: Diagnostic Tests, Simulation, Test Reliability, Accuracy
Practical Considerations in Choosing an Anchor Test Form for Equating under the Random Groups Design
Cui, Zhongmin; He, Yong – Measurement: Interdisciplinary Research and Perspectives, 2023
Careful considerations are necessary when there is a need to choose an anchor test form from a list of old test forms for equating under the random groups design. The choice of the anchor form potentially affects the accuracy of equated scores on new test forms. Few guidelines, however, can be found in the literature on choosing the anchor form.…
Descriptors: Test Format, Equated Scores, Best Practices, Test Construction
Delphine Franco; Ruben Vanderlinde; Martin Valcke – European Journal of Education, 2025
Complex competences, such as managing students' aggressive behaviour, are challenging to develop during teacher training. Recently, video-based simulations have been considered promising, yet suitable assessment instruments are limitedly available. This paper reports on the design and evaluation of a video-based assessment tool tailored to measure…
Descriptors: Preservice Teachers, Preservice Teacher Education, Student Behavior, Aggression
Xiao, Leifeng; Hau, Kit-Tai – Applied Measurement in Education, 2023
We compared coefficient alpha with five alternatives (omega total, omega RT, omega h, GLB, and coefficient H) in two simulation studies. Results showed for unidimensional scales, (a) all indices except omega h performed similarly well for most conditions; (b) alpha is still good; (c) GLB and coefficient H overestimated reliability with small…
Descriptors: Test Theory, Test Reliability, Factor Analysis, Test Length
Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022
The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…
Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency
Marzieh Pashmdarfard; Afsoon Hassani Mehraban; Narges Shafaroodi; Kamran Soltani Arabshahi; Soroor Parvizy; Akram Azad; Samaneh Karamali Esmaeili – Journal of Occupational Therapy Education, 2022
Fieldwork education is an integral part of the educational process in occupational therapy and assessing student competency at the end of fieldwork is important. The aim of this study was to design and conduct an Objective Structured Clinical Examination (OSCE) based on the Occupational Therapy Practice Framework (OTPF) for occupational therapy…
Descriptors: Occupational Therapy, Allied Health Occupations Education, Test Construction, Test Validity

Peer reviewed
Direct link
