ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	6

Source

Applied Measurement in…

Author

Abbakumov, Dmitry	1
Anne Corinne Huggins-Manley	1
Canivez, Gary L.	1
Daniel Katz	1
Desmet, Piet	1
Diao, Hongyu	1
Haladyna, Thomas M.	1
Keller, Lisa	1
Rodriguez, Michael C.	1
Sinharay, Sandip	1
Stevens, Craig	1
Van den Noortgate, Wim	1
Walter Leite	1
Youngstrom, Eric A.	1
More ▼

Publication Type

Journal Articles	6
Reports - Research	5
Reports - Evaluative	1

Education Level

Higher Education	3
Postsecondary Education	3
Elementary Secondary Education	1

Audience

Location

Florida

Laws, Policies, & Programs

Assessments and Surveys

Wechsler Intelligence Scale…	1
Woodcock Johnson Tests of…	1

What Works Clearinghouse Rating

Showing all 6 results Save | Export

Personalized Online Learning, Test Fairness, and Educational Measurement: Considering Differential Content Exposure Prior to a High Stakes End of Course Exam

Peer reviewed

Direct link

Daniel Katz; Anne Corinne Huggins-Manley; Walter Leite – Applied Measurement in Education, 2022

According to the "Standards for Educational and Psychological Testing" (2014), one aspect of test fairness concerns examinees having comparable opportunities to learn prior to taking tests. Meanwhile, many researchers are developing platforms enhanced by artificial intelligence (AI) that can personalize curriculum to individual student…

Descriptors: High Stakes Tests, Test Bias, Testing Problems, Prior Learning

Rasch Model Extensions for Enhanced Formative Assessments in MOOCs

Peer reviewed

Direct link

Abbakumov, Dmitry; Desmet, Piet; Van den Noortgate, Wim – Applied Measurement in Education, 2020

Formative assessments are an important component of massive open online courses (MOOCs), online courses with open access and unlimited student participation. Accurate conclusions on students' proficiency via formative, however, face several challenges: (a) students are typically allowed to make several attempts; and (b) student performance might…

Descriptors: Item Response Theory, Formative Evaluation, Online Courses, Response Style (Tests)

Challenges to the Cattell-Horn-Carroll Theory: Empirical, Clinical, and Policy Implications

Peer reviewed

Direct link

Canivez, Gary L.; Youngstrom, Eric A. – Applied Measurement in Education, 2019

The Cattell-Horn-Carroll (CHC) taxonomy of cognitive abilities married John Horn and Raymond Cattell's Extended Gf-Gc theory with John Carroll's Three-Stratum Theory. While there are some similarities in arrangements or classifications of tasks (observed variables) within similar broad or narrow dimensions, other salient theoretical features and…

Descriptors: Taxonomy, Cognitive Ability, Intelligence, Cognitive Tests

Investigating Repeater Effects on Small Sample Equating: Include or Exclude?

Peer reviewed

Direct link

Diao, Hongyu; Keller, Lisa – Applied Measurement in Education, 2020

Examinees who attempt the same test multiple times are often referred to as "repeaters." Previous studies suggested that repeaters should be excluded from the total sample before equating because repeater groups are distinguishable from non-repeater groups. In addition, repeaters might memorize anchor items, causing item drift under a…

Descriptors: Licensing Examinations (Professions), College Entrance Examinations, Repetition, Testing Problems

Are Multiple-Choice Items Too Fat?

Peer reviewed

Direct link

Haladyna, Thomas M.; Rodriguez, Michael C.; Stevens, Craig – Applied Measurement in Education, 2019

The evidence is mounting regarding the guidance to employ more three-option multiple-choice items. From theoretical analyses, empirical results, and practical considerations, such items are of equal or higher quality than four- or five-option items, and more items can be administered to improve content coverage. This study looks at 58 tests,…

Descriptors: Multiple Choice Tests, Test Items, Testing Problems, Guessing (Tests)

Are the Nonparametric Person-Fit Statistics More Powerful than Their Parametric Counterparts? Revisiting the Simulations in Karabatsos (2003)

Peer reviewed

Direct link

Sinharay, Sandip – Applied Measurement in Education, 2017

Karabatsos compared the power of 36 person-fit statistics using receiver operating characteristics curves and found the "H[superscript T]" statistic to be the most powerful in identifying aberrant examinees. He found three statistics, "C", "MCI", and "U3", to be the next most powerful. These four statistics,…

Descriptors: Nonparametric Statistics, Goodness of Fit, Simulation, Comparative Analysis

Testing Problems	6
Item Response Theory	3
College Entrance Examinations	2
Evaluation Criteria	2
Licensing Examinations…	2
Achievement Tests	1
Algebra	1
Artificial Intelligence	1
Cognitive Ability	1
Cognitive Tests	1
Comparative Analysis	1
Computation	1
Elementary Secondary Education	1
Equated Scores	1
Evaluation Problems	1
Factor Structure	1
Formative Evaluation	1
Goodness of Fit	1
Guessing (Tests)	1
High Stakes Tests	1
Individualized Instruction	1
Intelligence	1
Intelligence Tests	1
Mathematics Tests	1
Multiple Choice Tests	1
More ▼