Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 1 |
| Since 2007 (last 20 years) | 19 |
Descriptor
| Educational Testing | 25 |
| Evaluation Methods | 25 |
| Test Items | 25 |
| Educational Assessment | 13 |
| Test Construction | 13 |
| Student Evaluation | 12 |
| Item Response Theory | 10 |
| Measurement | 8 |
| Foreign Countries | 7 |
| Psychometrics | 7 |
| Computer Assisted Testing | 6 |
| More ▼ | |
Source
Author
| Frey, Andreas | 2 |
| Ainley, John | 1 |
| Ban, Jae-Chun | 1 |
| Bertling, Jonas P. | 1 |
| Bricker, Diane | 1 |
| Brookhart, Susan M. | 1 |
| Capt, Betty | 1 |
| Carstensen, Claus H. | 1 |
| Check, John F. | 1 |
| Chen, Deng-Jyi | 1 |
| Chen, Shu-Ling | 1 |
| More ▼ | |
Publication Type
Education Level
| Elementary Secondary Education | 6 |
| Elementary Education | 3 |
| Postsecondary Education | 3 |
| Grade 6 | 2 |
| Higher Education | 2 |
| Secondary Education | 2 |
| Adult Education | 1 |
| Grade 8 | 1 |
| Grade 9 | 1 |
| Junior High Schools | 1 |
| Middle Schools | 1 |
| More ▼ | |
Audience
| Researchers | 1 |
| Teachers | 1 |
Location
| Australia | 2 |
| Canada | 2 |
| Taiwan | 2 |
| Asia | 1 |
| China | 1 |
| Germany | 1 |
| Hong Kong | 1 |
| India | 1 |
| Japan | 1 |
| Nebraska | 1 |
| South Africa | 1 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
| ACT Assessment | 1 |
| Program for International… | 1 |
What Works Clearinghouse Rating
Nebraska Department of Education, 2024
The Nebraska Student-Centered Assessment System (NSCAS) is a statewide assessment system that embodies Nebraska's holistic view of students and helps them prepare for success in postsecondary education, career, and civic life. It uses multiple measures throughout the year to provide educators and decision-makers at all levels with the insights…
Descriptors: Student Evaluation, Evaluation Methods, Elementary School Students, Middle School Students
Socha, Alan; DeMars, Christine E.; Zilberberg, Anna; Phan, Ha – International Journal of Testing, 2015
The Mantel-Haenszel (MH) procedure is commonly used to detect items that function differentially for groups of examinees from various demographic and linguistic backgrounds--for example, in international assessments. As in some other DIF methods, the total score is used to match examinees on ability. In thin matching, each of the total score…
Descriptors: Test Items, Educational Testing, Evaluation Methods, Ability Grouping
Reckase, Mark D.; Xu, Jing-Ru – Educational and Psychological Measurement, 2015
How to compute and report subscores for a test that was originally designed for reporting scores on a unidimensional scale has been a topic of interest in recent years. In the research reported here, we describe an application of multidimensional item response theory to identify a subscore structure in a test designed for reporting results using a…
Descriptors: English, Language Skills, English Language Learners, Scores
Emenogu, Barnabas C.; Falenchuk, Olesya; Childs, Ruth A. – Alberta Journal of Educational Research, 2010
Most implementations of the Mantel-Haenszel differential item functioning procedure delete records with missing responses or replace missing responses with scores of 0. These treatments of missing data make strong assumptions about the causes of the missing data. Such assumptions may be particularly problematic when groups differ in their patterns…
Descriptors: Foreign Countries, Test Bias, Test Items, Educational Testing
Brookhart, Susan M. – ASCD, 2010
Don't settle for assessing recall and comprehension only when you can use this guide to create assessments for higher-order thinking skills. Assessment expert Susan M. Brookhart brings you up to speed on how to develop and use test questions and other assessments that reveal how well your students can analyze, reason, solve problems, and think…
Descriptors: Test Items, Performance Based Assessment, Thinking Skills, Cognitive Processes
Koon, Sharon – ProQuest LLC, 2010
This study examined the effectiveness of the odds-ratio method (Penfield, 2008) and the multinomial logistic regression method (Kato, Moen, & Thurlow, 2009) for measuring differential distractor functioning (DDF) effects in comparison to the standardized distractor analysis approach (Schmitt & Bleistein, 1987). Students classified as participating…
Descriptors: Test Bias, Test Items, Reference Groups, Lunch Programs
Lee, Won-Chan; Ban, Jae-Chun – Applied Measurement in Education, 2010
Various applications of item response theory often require linking to achieve a common scale for item parameter estimates obtained from different groups. This article used a simulation to examine the relative performance of four different item response theory (IRT) linking procedures in a random groups equating design: concurrent calibration with…
Descriptors: Item Response Theory, Simulation, Comparative Analysis, Measurement Techniques
Fan, Ya-Ching; Wang, Tzu-Hua; Wang, Kuo-Hua – Computers & Education, 2011
This research investigates the effect of a web-based model, named "Practicing, Reflecting, and Revising with Web-based Assessment and Test Analysis system (P2R-WATA) Assessment Literacy Development Model," on enhancing assessment knowledge and perspectives of secondary in-service teachers, and adopts a single group experimental research…
Descriptors: Research Design, Test Items, Summer Programs, Prior Learning
Holling, Heinz; Bertling, Jonas P.; Zeuch, Nina – Studies in Educational Evaluation, 2009
Mathematical word problems represent a common item format for assessing student competencies. Automatic item generation (AIG) is an effective way of constructing many items with predictable difficulties, based on a set of predefined task parameters. The current study presents a framework for the automatic generation of probability word problems…
Descriptors: Word Problems (Mathematics), Probability, Automation, College Students
Glas, Cees A. W.; Geerlings, Hanneke – Studies in Educational Evaluation, 2009
Pupil monitoring systems support the teacher in tailoring teaching to the individual level of a student and in comparing the progress and results of teaching with national standards. The systems are based on the availability of an item bank calibrated using item response theory. The assessment of the students' progress and results can be further…
Descriptors: Item Banks, Adaptive Testing, National Standards, Psychometrics
Frey, Andreas; Seitz, Nicki-Nils – Studies in Educational Evaluation, 2009
The paper gives an overview of multidimensional adaptive testing (MAT) and evaluates its applicability in educational and psychological testing. The approach of Segall (1996) is described as a general framework for MAT. The main advantage of MAT is its capability to increase measurement efficiency. In simulation studies conceptualizing situations…
Descriptors: Psychological Testing, Adaptive Testing, Simulation, Evaluation Methods
Penfield, Randall D. – Applied Measurement in Education, 2007
A widely used approach for categorizing the level of differential item functioning (DIF) in dichotomous items is the scheme proposed by Educational Testing Service (ETS) based on a transformation of the Mantel-Haeszel common odds ratio. In this article two classification schemes for DIF in polytomous items (referred to as the P1 and P2 schemes)…
Descriptors: Simulation, Educational Testing, Test Bias, Evaluation Methods
Cui, Ying; Leighton, Jacqueline P. – Journal of Educational Measurement, 2009
In this article, we introduce a person-fit statistic called the hierarchy consistency index (HCI) to help detect misfitting item response vectors for tests developed and analyzed based on a cognitive model. The HCI ranges from -1.0 to 1.0, with values close to -1.0 indicating that students respond unexpectedly or differently from the responses…
Descriptors: Test Length, Simulation, Correlation, Research Methodology
Tatsuoka, Curtis – Measurement: Interdisciplinary Research and Perspectives, 2009
In this commentary, the author addresses what is referred to as the deterministic input, noisy "and" gate (DINA) model. The author mentions concerns with how this model has been formulated and presented. In particular, the author points out that there is a lack of recognition of the confounding of profiles that generally arises and then discusses…
Descriptors: Test Items, Classification, Psychometrics, Item Response Theory
Frey, Andreas; Carstensen, Claus H. – Measurement: Interdisciplinary Research and Perspectives, 2009
On a general level, the objective of diagnostic classifications models (DCMs) lies in a classification of individuals regarding multiple latent skills. In this article, the authors show that this objective can be achieved by multidimensional adaptive testing (MAT) as well. The authors discuss whether or not the restricted applicability of DCMs can…
Descriptors: Adaptive Testing, Test Items, Classification, Psychometrics
Previous Page | Next Page ยป
Pages: 1 | 2
Peer reviewed
Direct link
