Publication Date
| In 2026 | 0 |
| Since 2025 | 11 |
| Since 2022 (last 5 years) | 66 |
| Since 2017 (last 10 years) | 144 |
| Since 2007 (last 20 years) | 255 |
Descriptor
| Difficulty Level | 492 |
| Item Analysis | 492 |
| Test Items | 377 |
| Test Construction | 153 |
| Foreign Countries | 118 |
| Multiple Choice Tests | 103 |
| Test Validity | 95 |
| Item Response Theory | 91 |
| Test Reliability | 89 |
| Comparative Analysis | 80 |
| Statistical Analysis | 79 |
| More ▼ | |
Source
Author
| Reckase, Mark D. | 6 |
| Lord, Frederic M. | 5 |
| Roid, Gale | 4 |
| Bratfisch, Oswald | 3 |
| Cahen, Leonard S. | 3 |
| Dorans, Neil J. | 3 |
| Dunne, Tim | 3 |
| Facon, Bruno | 3 |
| Hambleton, Ronald K. | 3 |
| Huck, Schuyler W. | 3 |
| Kostin, Irene | 3 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 34 |
| Practitioners | 4 |
| Teachers | 2 |
Location
| Indonesia | 8 |
| Nigeria | 8 |
| Turkey | 8 |
| Germany | 7 |
| Taiwan | 7 |
| South Africa | 6 |
| United States | 6 |
| Canada | 5 |
| India | 5 |
| China | 4 |
| Florida | 4 |
| More ▼ | |
Laws, Policies, & Programs
| Education Consolidation… | 1 |
| Elementary and Secondary… | 1 |
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Ames, Allison; Smith, Elizabeth – Journal of Educational Measurement, 2018
Bayesian methods incorporate model parameter information prior to data collection. Eliciting information from content experts is an option, but has seen little implementation in Bayesian item response theory (IRT) modeling. This study aims to use ethical reasoning content experts to elicit prior information and incorporate this information into…
Descriptors: Item Response Theory, Bayesian Statistics, Ethics, Specialists
Hong, Jon-Chao; Hwang, Ming-Yueh; Tai, Kai-Hsin; Lin, Pei-Hsin – Computer Assisted Language Learning, 2021
Remote association requires players to use their mental transformation to identify objects' relationships by activating knowledge application. A Chinese Remote Association game was designed (example question: [characters omitted], where the answer is [character omitted]) to explore learners' cognitive and affective effects, and then eighth grade…
Descriptors: Computer Games, Grade 8, Telecommunications, Handheld Devices
Gu, Lin; Ling, Guangming; Liu, Ou Lydia; Yang, Zhitong; Li, Guirong; Kardanova, Elena; Loyalka, Prashant – Assessment & Evaluation in Higher Education, 2021
We examine the effects of computer-based versus paper-based assessment of critical thinking skills, adapted from English (in the U.S.) to Chinese. Using data collected based on a random assignment between the two modes in multiple Chinese colleges, we investigate mode effects from multiple perspectives: mean scores, measurement precision, item…
Descriptors: Critical Thinking, Tests, Test Format, Computer Assisted Testing
Yeager, Rebecca; Meyer, Zachary – International Journal of Listening, 2022
This study investigates the effects of adding stem preview to an English for Academic Purposes (EAP) multiple-choice listening assessment. In stem preview, listeners may view the item stems, but not response options, before listening. Previous research indicates that adding preview to an exam typically decreases difficulty, but raises concerns…
Descriptors: English for Academic Purposes, Second Language Learning, Second Language Instruction, Teaching Methods
Smith, Tamarah; Smith, Samantha – International Journal of Teaching and Learning in Higher Education, 2018
The Research Methods Skills Assessment (RMSA) was created to measure psychology majors' statistics knowledge and skills. The American Psychological Association's Guidelines for the Undergraduate Major in Psychology (APA, 2007, 2013) served as a framework for development. Results from a Rasch analysis with data from n = 330 undergraduates showed…
Descriptors: Psychology, Statistics, Undergraduate Students, Item Response Theory
McIntosh, James – Scandinavian Journal of Educational Research, 2019
This article examines whether the way that PISA models item outcomes in mathematics affects the validity of its country rankings. As an alternative to PISA methodology a two-parameter model is applied to PISA mathematics item data from Canada and Finland for the year 2012. In the estimation procedure item difficulty and dispersion parameters are…
Descriptors: Foreign Countries, Achievement Tests, Secondary School Students, International Assessment
Wyse, Adam E. – Applied Measurement in Education, 2018
This article discusses regression effects that are commonly observed in Angoff ratings where panelists tend to think that hard items are easier than they are and easy items are more difficult than they are in comparison to estimated item difficulties. Analyses of data from two credentialing exams illustrate these regression effects and the…
Descriptors: Regression (Statistics), Test Items, Difficulty Level, Licensing Examinations (Professions)
Bourdeaud'Hui, Heleen; Aesaert, Koen; van Braak, Johan – Language Assessment Quarterly, 2021
Effective listening comprehension skills are an important prerequisite for the academic success of primary school students. However, the assessment of listening skills in the instructional language appears to have received only scant attention in the literature. Therefore, the goal of the present study was twofold. Firstly, a comprehensive…
Descriptors: Native Language, Indo European Languages, Second Language Learning, Test Items
Beauchamp, David; Constantinou, Filio – Research Matters, 2020
Assessment is a useful process as it provides various stakeholders (e.g., teachers, parents, government, employers) with information about students' competence in a particular subject area. However, for the information generated by assessment to be useful, it needs to support valid inferences. One factor that can undermine the validity of…
Descriptors: Computational Linguistics, Inferences, Validity, Language Usage
Zufferey, Sandrine; Gygax, Pascal – Discourse Processes: A Multidisciplinary Journal, 2020
Understanding discourse connectives is an important step to achieving effective verbal communication. Yet, the ability of adult native speakers to understand the broad range of connectives found in most Indo-European languages has seldom been assessed. In this article we demonstrate that some adults have difficulties recognizing correct and…
Descriptors: Language Usage, Form Classes (Languages), Discourse Analysis, Adults
Blotenberg, Iris; Schmidt-Atzert, Lothar – Journal of Intelligence, 2019
The present study set out to explore the locus of the poorly understood but frequently reported and comparatively large practice effect in sustained attention tests. Drawing on a recently proposed process model of sustained attention tests, several cognitive tasks were administered twice in order to examine which specific component of test…
Descriptors: Attention Control, Tests, Models, Test Items
Perkins, Kyle; Frank, Eva – Online Submission, 2018
This paper presents item-analysis data to illustrate how to identify a set of internally consistent test items that differentiate or discriminate among examinees who are highly proficient and nonproficient on the construct of interest. Suggestions for analyzing the quality of test items are offered as well as a pedagogical approach to augment the…
Descriptors: Item Analysis, Test Items, Test Reliability, Kinetics
Lahner, Felicitas-Maria; Lörwald, Andrea Carolin; Bauer, Daniel; Nouns, Zineb Miriam; Krebs, René; Guttormsen, Sissel; Fischer, Martin R.; Huwendiek, Sören – Advances in Health Sciences Education, 2018
Multiple true-false (MTF) items are a widely used supplement to the commonly used single-best answer (Type A) multiple choice format. However, an optimal scoring algorithm for MTF items has not yet been established, as existing studies yielded conflicting results. Therefore, this study analyzes two questions: What is the optimal scoring algorithm…
Descriptors: Scoring Formulas, Scoring Rubrics, Objective Tests, Multiple Choice Tests
Pettersen, Andreas; Braeken, Johan – International Journal of Science and Mathematics Education, 2019
The implementation of mathematical competencies in school curricula requires assessment instruments to be aligned with this new view on mathematical mastery. However, there are concerns over whether existing assessments capture the wide variety of cognitive skills and abilities that constitute mathematical competence. The current study applied an…
Descriptors: Mathematics Instruction, Mathematics Skills, Mathematics Tests, Cognitive Ability
Chen, Pei-Hua; Fu, Jen-Tso – Language Assessment Quarterly, 2018
The Revised Preschool Language Assessment (RPLA) is a standardized measure for examining the language status of and determining potential language difficulties among preschoolers between 3 and 6 years old. To facilitate the applicability of the RPLA for use with Mandarin-speaking children, the present study adopted exploratory factor analysis…
Descriptors: Mandarin Chinese, Preschool Children, Language Tests, Standardized Tests

Peer reviewed
Direct link
