NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 46 to 60 of 5,127 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Cole, Ki; Paek, Insu – Measurement: Interdisciplinary Research and Perspectives, 2022
Statistical Analysis Software (SAS) is a widely used tool for data management analysis across a variety of fields. The procedure for item response theory (PROC IRT) is one to perform unidimensional and multidimensional item response theory (IRT) analysis for dichotomous and polytomous data. This review provides a summary of the features of PROC…
Descriptors: Item Response Theory, Computer Software, Item Analysis, Statistical Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Xueliang Chen; Vahid Aryadoust; Wenxin Zhang – Language Testing, 2025
The growing diversity among test takers in second or foreign language (L2) assessments makes the importance of fairness front and center. This systematic review aimed to examine how fairness in L2 assessments was evaluated through differential item functioning (DIF) analysis. A total of 83 articles from 27 journals were included in a systematic…
Descriptors: Second Language Learning, Language Tests, Test Items, Item Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Erik Forsberg; Anders Sjöberg – Measurement: Interdisciplinary Research and Perspectives, 2025
This paper reports a validation study based on descriptive multidimensional item response theory (DMIRT), implemented in the R package "D3mirt" by using the ERS-C, an extended version of the Relevance subscale from the Moral Foundations Questionnaire including two new items for collectivism (17 items in total). Two latent models are…
Descriptors: Evaluation Methods, Programming Languages, Altruism, Collectivism
Peer reviewed Peer reviewed
Direct linkDirect link
Semere Kiros Bitew; Amir Hadifar; Lucas Sterckx; Johannes Deleu; Chris Develder; Thomas Demeester – IEEE Transactions on Learning Technologies, 2024
Multiple-choice questions (MCQs) are widely used in digital learning systems, as they allow for automating the assessment process. However, owing to the increased digital literacy of students and the advent of social media platforms, MCQ tests are widely shared online, and teachers are continuously challenged to create new questions, which is an…
Descriptors: Multiple Choice Tests, Computer Assisted Testing, Test Construction, Test Items
Peer reviewed Peer reviewed
Direct linkDirect link
Theodore E. G. Alivio; Claire E. Galloway; Blain Mamiya; Vickie M. Williamson – Journal of Science Education and Technology, 2024
The link between a student's math fluency and their success in general chemistry has been thoroughly documented in the literature. One diagnostic instrument that can be used to assess a student's arithmetic skills is the Math-Up Skills Test (MUST), a 20-question, free-response math test completed in 15 min. The MUST instrument assesses the…
Descriptors: Mathematics Tests, Test Items, Item Analysis, Early Intervention
Peer reviewed Peer reviewed
Direct linkDirect link
Kelly B. Beck; Lauren A. Terhorst; Carol M. Greco; Jamie L. Kulzer; Elizabeth R. Skidmore; Michael P. McCue – Journal of Autism and Developmental Disorders, 2024
Quality of life (QOL) and life satisfaction are important research priorities for autistic adults. As such, we saw a need to evaluate individual items of commonly used subjective QOL scales to understand how they are interpreted and perceived by autistic adults. This study used cognitive interviews and repeated sampling to evaluate the…
Descriptors: Quality of Life, Life Satisfaction, Measures (Individuals), Adults
Hannah Gadd Ardrey – ProQuest LLC, 2024
The purpose of the study was to investigate secondary choral music educators' and administrators' perceptions of the use of the Mississippi Professional Growth System (PGS) as an applicable tool for evaluating secondary choral music educators. While there is limited research regarding the evaluation of choral music educators, this study aimed to…
Descriptors: Secondary School Teachers, Music Teachers, Singing, Teacher Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Sen, Sedat; Cohen, Allan S. – Educational and Psychological Measurement, 2023
The purpose of this study was to examine the effects of different data conditions on item parameter recovery and classification accuracy of three dichotomous mixture item response theory (IRT) models: the Mix1PL, Mix2PL, and Mix3PL. Manipulated factors in the simulation included the sample size (11 different sample sizes from 100 to 5000), test…
Descriptors: Sample Size, Item Response Theory, Accuracy, Classification
Peer reviewed Peer reviewed
Direct linkDirect link
Liang, Xinya; Cao, Chunhua – Journal of Experimental Education, 2023
To evaluate multidimensional factor structure, a popular method that combines features of confirmatory and exploratory factor analysis is Bayesian structural equation modeling with small-variance normal priors (BSEM-N). This simulation study evaluated BSEM-N as a variable selection and parameter estimation tool in factor analysis with sparse…
Descriptors: Factor Analysis, Bayesian Statistics, Structural Equation Models, Simulation
Jungsun Go – ProQuest LLC, 2023
The purpose of this study was to investigate the effectiveness of four different models (bifactor, CTC(M-1), CTCU and unidimensional) as to optimal model selection when the wording effect associated with negatively worded items was present. A Monte Carlo simulation study was conducted to compare model-data fit and accuracy in parameter estimates…
Descriptors: Language Usage, Negative Attitudes, Models, Goodness of Fit
Peer reviewed Peer reviewed
Direct linkDirect link
Yasuda, Jun-ichiro; Hull, Michael M.; Mae, Naohiro – Physical Review Physics Education Research, 2023
We aim to graphically analyze the depth of conceptual understanding behind the Force Concept Inventory (FCI) responses of students, focusing on three questions (questions 1, 15, and 28). In our study, we created and implemented subquestions to clarify and quantify the students' reasoning steps in reaching their responses to the original FCI…
Descriptors: Scientific Concepts, Concept Formation, Misconceptions, Visual Aids
Peer reviewed Peer reviewed
Direct linkDirect link
Vy Le; Jayson M. Nissen; Xiuxiu Tang; Yuxiao Zhang; Amirreza Mehrabi; Jason W. Morphew; Hua Hua Chang; Ben Van Dusen – Physical Review Physics Education Research, 2025
In physics education research, instructors and researchers often use research-based assessments (RBAs) to assess students' skills and knowledge. In this paper, we support the development of a mechanics cognitive diagnostic to test and implement effective and equitable pedagogies for physics instruction. Adaptive assessments using cognitive…
Descriptors: Physics, Science Education, Scientific Concepts, Diagnostic Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Xiao, Yue; Veldkamp, Bernard; Liu, Hongyun – Educational Measurement: Issues and Practice, 2022
The action sequences of respondents in problem-solving tasks reflect rich and detailed information about their performance, including differences in problem-solving ability, even if item scores are equal. It is therefore not sufficient to infer individual problem-solving skills based solely on item scores. This study is a preliminary attempt to…
Descriptors: Problem Solving, Item Response Theory, Scores, Item Analysis
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Cum, Sait – International Journal of Assessment Tools in Education, 2021
In this study, it was claimed that ROC analysis, which is used to determine to what extent medical diagnosis tests can be differentiated between patients and non-patients, can also be used to examine the discrimination of binary scored items in cognitive tests. In order to obtain various evidence for this claim, the 2x2 contingency table used in…
Descriptors: Test Items, Item Analysis, Discriminant Analysis, Item Response Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Stephen Humphry; Paul Montuoro; Carolyn Maxwell – Journal of Psychoeducational Assessment, 2024
This article builds upon a proiminent definition of construct validity that focuses on variation in attributes causing variation in measurement outcomes. This article synthesizes the defintion and uses Rasch measurement modeling to explicate a modified conceptualization of construct validity for assessments of developmental attributes. If…
Descriptors: Construct Validity, Measurement Techniques, Developmental Stages, Item Analysis
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9  |  10  |  11  |  ...  |  342