Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 8 |
Since 2006 (last 20 years) | 39 |
Descriptor
Comparative Analysis | 68 |
Educational Assessment | 68 |
Models | 68 |
Evaluation Methods | 18 |
Academic Achievement | 13 |
Higher Education | 13 |
Foreign Countries | 11 |
Classification | 10 |
Scores | 10 |
Educational Objectives | 9 |
Item Response Theory | 9 |
More ▼ |
Source
Author
Seppanen, Patricia | 5 |
Paulston, Rolland G. | 2 |
Plake, Barbara S. | 2 |
Angeles, January | 1 |
Barnes, Tiffany, Ed. | 1 |
Bender, Timothy A. | 1 |
Bibi, Tauqir | 1 |
Biggs, John | 1 |
Blake, R. John | 1 |
Bragg, D. | 1 |
Brennan, John | 1 |
More ▼ |
Publication Type
Education Level
Audience
Practitioners | 1 |
Researchers | 1 |
Students | 1 |
Location
Florida | 5 |
Australia | 4 |
Netherlands | 3 |
United States | 3 |
Canada | 2 |
France | 2 |
Germany | 2 |
Indiana | 2 |
Israel | 2 |
Massachusetts | 2 |
Michigan | 2 |
More ▼ |
Laws, Policies, & Programs
Workforce Investment Act 1998… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 2 |
Meets WWC Standards with or without Reservations | 2 |
Kazuhiro Yamaguchi – Journal of Educational and Behavioral Statistics, 2025
This study proposes a Bayesian method for diagnostic classification models (DCMs) for a partially known Q-matrix setting between exploratory and confirmatory DCMs. This Q-matrix setting is practical and useful because test experts have pre-knowledge of the Q-matrix but cannot readily specify it completely. The proposed method employs priors for…
Descriptors: Models, Classification, Bayesian Statistics, Evaluation Methods
Song, Yoon Ah; Lee, Won-Chan – Applied Measurement in Education, 2022
This article presents the performance of item response theory (IRT) models when double ratings are used as item scores over single ratings when rater effects are present. Study 1 examined the influence of the number of ratings on the accuracy of proficiency estimation in the generalized partial credit model (GPCM). Study 2 compared the accuracy of…
Descriptors: Item Response Theory, Item Analysis, Scores, Accuracy
von Davier, Matthias – Measurement: Interdisciplinary Research and Perspectives, 2018
This article critically reviews how diagnostic models have been conceptualized and how they compare to other approaches used in educational measurement. In particular, certain assumptions that have been taken for granted and used as defining characteristics of diagnostic models are reviewed and it is questioned whether these assumptions are the…
Descriptors: Criticism, Psychometrics, Diagnostic Tests, Educational Assessment
Wind, Stefanie A. – Educational and Psychological Measurement, 2017
Molenaar extended Mokken's original probabilistic-nonparametric scaling models for use with polytomous data. These polytomous extensions of Mokken's original scaling procedure have facilitated the use of Mokken scale analysis as an approach to exploring fundamental measurement properties across a variety of domains in which polytomous ratings are…
Descriptors: Nonparametric Statistics, Scaling, Models, Item Response Theory
Rupp, André A.; van Rijn, Peter W. – Measurement: Interdisciplinary Research and Perspectives, 2018
We review the GIDNA and CDM packages in R for fitting cognitive diagnosis/diagnostic classification models. We first provide a summary of their core capabilities and then use both simulated and real data to compare their functionalities in practice. We found that the most relevant routines in the two packages appear to be more similar than…
Descriptors: Educational Assessment, Cognitive Measurement, Measurement, Computer Software
Haberman, Shelby J.; Liu, Yang; Lee, Yi-Hsuan – ETS Research Report Series, 2019
Distractor analyses are routinely conducted in educational assessments with multiple-choice items. In this research report, we focus on three item response models for distractors: (a) the traditional nominal response (NR) model, (b) a combination of a two-parameter logistic model for item scores and a NR model for selections of incorrect…
Descriptors: Multiple Choice Tests, Scores, Test Reliability, High Stakes Tests
Fulmer, Gavin W.; Polikoff, Morgan S. – Educational Assessment, Evaluation and Accountability, 2014
An essential component in school accountability efforts is for assessments to be well-aligned with the standards or curriculum they are intended to measure. However, relatively little prior research has explored methods to determine statistical significance of alignment or misalignment. This study explores analyses of alignment as a special case…
Descriptors: Alignment (Education), Educational Assessment, Academic Standards, Regression (Statistics)
Crawford, Aaron – ProQuest LLC, 2014
This simulation study compared the utility of various discrepancy measures within a posterior predictive model checking (PPMC) framework for detecting different types of data-model misfit in multidimensional Bayesian network (BN) models. The investigated conditions were motivated by an applied research program utilizing an operational complex…
Descriptors: Bayesian Statistics, Networks, Models, Goodness of Fit
Crosson, Pat; Orcutt, Bonnie – Change: The Magazine of Higher Learning, 2014
This article describes efforts in Massachusetts and the Multi-State Collaborative to Advance Learning Outcomes Assessment (MSC) to develop a statewide system for learning outcomes assessment that does not rely on standardized testing and that is designed to transcend the traditional tensions and boundaries between campus-based formative and…
Descriptors: Educational Assessment, Higher Education, Formative Evaluation, Accountability
Koziol, Natalie A. – Applied Measurement in Education, 2016
Testlets, or groups of related items, are commonly included in educational assessments due to their many logistical and conceptual advantages. Despite their advantages, testlets introduce complications into the theory and practice of educational measurement. Responses to items within a testlet tend to be correlated even after controlling for…
Descriptors: Classification, Accuracy, Comparative Analysis, Models
Jacob, Robin T.; Goddard, Roger D.; Kim, Eun Sook – Educational Evaluation and Policy Analysis, 2014
It is often difficult and costly to obtain individual-level student achievement data, yet, researchers are frequently reluctant to use school-level achievement data that are widely available from state websites. We argue that public-use aggregate school-level achievement data are, in fact, sufficient to address a wide range of evaluation questions…
Descriptors: Academic Achievement, Data, Information Utilization, Educational Assessment
Rhoades, Jeffrey T. – ProQuest LLC, 2014
The community college baccalaureate and the university-center baccalaureate models are gaining traction in the state of California as alternatives to addressing the need for greater access to baccalaureate degree programs and to increase the baccalaureate-educated workforce. Little is known about the characteristics and factors associated with the…
Descriptors: Community Colleges, Bachelors Degrees, Case Studies, Undergraduate Study
Bulut, Okan – ProQuest LLC, 2013
The importance of subscores in educational and psychological assessments is undeniable. Subscores yield diagnostic information that can be used for determining how each examinee's abilities/skills vary over different content domains. One of the most common criticisms about reporting and using subscores is insufficient reliability of subscores.…
Descriptors: Item Response Theory, Simulation, Correlation, Reliability
Lin, Pei-Ying; Lin, Yu-Cheng – International Journal of Inclusive Education, 2015
To identify teacher candidates' needs for training in inclusive classroom assessment, the present study investigated teacher candidates' beliefs about inclusive classroom assessments for all students educated in regular classrooms, including those with special needs and English language learners. An innovative theoretical assessment model,…
Descriptors: Foreign Countries, Preservice Teachers, Teacher Education, Inclusion
Razzaq, Jamila; Forde, Christine – School Leadership & Management, 2014
This article argues that although there are increasing similarities in priorities across different national education systems, contextual differences raise questions about the replication of sets of change strategies based on particular understandings of the nature of educational change across these different systems. This article begins with an…
Descriptors: Foreign Countries, Educational Change, Developing Nations, Administrators