Publication Date
| In 2026 | 0 |
| Since 2025 | 9 |
| Since 2022 (last 5 years) | 48 |
| Since 2017 (last 10 years) | 88 |
| Since 2007 (last 20 years) | 283 |
Descriptor
| Evaluation Methods | 470 |
| Item Analysis | 470 |
| Test Items | 105 |
| Test Construction | 97 |
| Foreign Countries | 83 |
| Psychometrics | 79 |
| Test Validity | 76 |
| Item Response Theory | 75 |
| Measurement Techniques | 71 |
| Evaluation Research | 70 |
| Student Evaluation | 60 |
| More ▼ | |
Source
Author
| Alonzo, Julie | 8 |
| Tindal, Gerald | 8 |
| Lai, Cheng Fei | 7 |
| Hambleton, Ronald K. | 4 |
| Raykov, Tenko | 4 |
| Chun Wang | 3 |
| Gongjun Xu | 3 |
| Brennan, Robert L. | 2 |
| Dancer, L. Suzanne | 2 |
| De Maeyer, Sven | 2 |
| Gierl, Mark J. | 2 |
| More ▼ | |
Publication Type
Education Level
Location
| Australia | 11 |
| United Kingdom | 10 |
| Oregon | 8 |
| United States | 8 |
| China | 7 |
| United Kingdom (England) | 6 |
| Netherlands | 5 |
| Canada | 4 |
| California | 3 |
| Germany | 3 |
| Greece | 3 |
| More ▼ | |
Laws, Policies, & Programs
| Individuals with Disabilities… | 8 |
| Elementary and Secondary… | 1 |
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Alicia A. Stoltenberg – ProQuest LLC, 2024
Multiple-select multiple-choice items, or multiple-choice items with more than one correct answer, are used to quickly assess content on standardized assessments. Because there are multiple keys to these item types, there are also multiple ways to score student responses to these items. The purpose of this study was to investigate how changing the…
Descriptors: Scoring, Evaluation Methods, Multiple Choice Tests, Standardized Tests
Hung, Su-Pin; Huang, Hung-Yu – Journal of Educational and Behavioral Statistics, 2022
To address response style or bias in rating scales, forced-choice items are often used to request that respondents rank their attitudes or preferences among a limited set of options. The rating scales used by raters to render judgments on ratees' performance also contribute to rater bias or errors; consequently, forced-choice items have recently…
Descriptors: Evaluation Methods, Rating Scales, Item Analysis, Preferences
Lozano, José H.; Revuelta, Javier – Educational and Psychological Measurement, 2023
The present paper introduces a general multidimensional model to measure individual differences in learning within a single administration of a test. Learning is assumed to result from practicing the operations involved in solving the items. The model accounts for the possibility that the ability to learn may manifest differently for correct and…
Descriptors: Bayesian Statistics, Learning Processes, Test Items, Item Analysis
Chun Wang; Ruoyi Zhu; Gongjun Xu – Grantee Submission, 2022
Differential item functioning (DIF) analysis refers to procedures that evaluate whether an item's characteristic differs for different groups of persons after controlling for overall differences in performance. DIF is routinely evaluated as a screening step to ensure items behavior the same across groups. Currently, the majority DIF studies focus…
Descriptors: Models, Item Response Theory, Item Analysis, Comparative Analysis
Clark McKown; Nicole Russo-Ponsaran; Matthew Wronski; Ashley Karls – Grantee Submission, 2025
This study describes the rationale, design, development, and technical properties of SELweb MS, a direct assessment of social and emotional competencies in middle school students. Assessment and item design were iteratively developed with input from youth and experts to measure five domains: Self-Awareness, Self-Management, Social Awareness,…
Descriptors: Psychometrics, Social Emotional Learning, Middle School Students, Correlation
Hosseinzadeh, Mostafa – ProQuest LLC, 2021
In real-world situations, multidimensional data may appear on large-scale tests or attitudinal surveys. A simple structure, multidimensional model may be used to evaluate the items, ignoring the cross-loading of some items on the secondary dimension. The purpose of this study was to investigate the influence of structure complexity magnitude of…
Descriptors: Item Response Theory, Models, Simulation, Evaluation Methods
Smith, Trevor I.; Bendjilali, Nasrine – Physical Review Physics Education Research, 2022
Several recent studies have employed item response theory (IRT) to rank incorrect responses to commonly used research-based multiple-choice assessments. These studies use Bock's nominal response model (NRM) for applying IRT to categorical (nondichotomous) data, but the response rankings only utilize half of the parameters estimated by the model.…
Descriptors: Item Response Theory, Test Items, Multiple Choice Tests, Science Tests
Koyuncu, Ilhan; Kilic, Abdullah Faruk – International Journal of Assessment Tools in Education, 2021
In exploratory factor analysis, although the researchers decide which items belong to which factors by considering statistical results, the decisions taken sometimes can be subjective in case of having items with similar factor loadings and complex factor structures. The aim of this study was to examine the validity of classifying items into…
Descriptors: Classification, Graphs, Factor Analysis, Decision Making
Tu, Thuy Thi Minh – ProQuest LLC, 2023
The study aimed to elicit information from Vietnamese EFL university instructors about their knowledge and skills regarding the principles, theory, and practices of language assessment by means of revision and validation of the Language Assessment Literacy--Revised Vietnam (LAL-RV), which was previously developed by Kremmel and Harding (2020). A…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, College Faculty
Atmoko, Adi; Hambali, IM.; Barida, Muya – Pegem Journal of Education and Instruction, 2022
Religious morals are important to be explored in each student in participating in learning in the new normal era. The purpose of this study is to develop a religious motivation scale instrument that can be used to photograph students' religious motivation in this new normal era. Research applies ADDIE research and development design. Participants…
Descriptors: Religious Factors, Religious Education, Islam, Moral Values
Patel, Nirmal; Sharma, Aditya; Shah, Tirth; Lomas, Derek – Journal of Educational Data Mining, 2021
Process Analysis is an emerging approach to discover meaningful knowledge from temporal educational data. The study presented in this paper shows how we used Process Analysis methods on the National Assessment of Educational Progress (NAEP) test data for modeling and predicting student test-taking behavior. Our process-oriented data exploration…
Descriptors: Learning Analytics, National Competency Tests, Evaluation Methods, Prediction
Parry, James R. – Online Submission, 2020
This paper presents research and provides a method to ensure that parallel assessments, that are generated from a large test-item database, maintain equitable difficulty and content coverage each time the assessment is presented. To maintain fairness and validity it is important that all instances of an assessment, that is intended to test the…
Descriptors: Culture Fair Tests, Difficulty Level, Test Items, Test Validity
Russell, Michael; Szendey, Olivia; Li, Zhushan – Educational Assessment, 2022
Recent research provides evidence that an intersectional approach to defining reference and focal groups results in a higher percentage of comparisons flagged for potential DIF. The study presented here examined the generalizability of this pattern across methods for examining DIF. While the level of DIF detection differed among the four methods…
Descriptors: Comparative Analysis, Item Analysis, Test Items, Test Construction
Feuerstahler, Leah; Wilson, Mark – Journal of Educational Measurement, 2019
Scores estimated from multidimensional item response theory (IRT) models are not necessarily comparable across dimensions. In this article, the concept of aligned dimensions is formalized in the context of Rasch models, and two methods are described--delta dimensional alignment (DDA) and logistic regression alignment (LRA)--to transform estimated…
Descriptors: Item Response Theory, Models, Scores, Comparative Analysis
Boxuan Ma; Sora Fukui; Yuji Ando; Shinichi Konomi – Journal of Educational Data Mining, 2024
Language proficiency diagnosis is essential to extract fine-grained information about the linguistic knowledge states and skill mastery levels of test takers based on their performance on language tests. Different from comprehensive standardized tests, many language learning apps often revolve around word-level questions. Therefore, knowledge…
Descriptors: Language Proficiency, Brain Hemisphere Functions, Language Processing, Task Analysis

Direct link
Peer reviewed
