Publication Date
In 2025 | 1 |
Since 2024 | 5 |
Since 2021 (last 5 years) | 19 |
Since 2016 (last 10 years) | 47 |
Since 2006 (last 20 years) | 68 |
Descriptor
Comparative Analysis | 92 |
Correlation | 92 |
Item Analysis | 92 |
Foreign Countries | 33 |
Test Items | 27 |
Factor Analysis | 23 |
Statistical Analysis | 23 |
Scores | 22 |
Student Attitudes | 16 |
Difficulty Level | 15 |
Measures (Individuals) | 14 |
More ▼ |
Source
Author
Allan S. Cohen | 2 |
Bratfisch, Oswald | 2 |
Herman, Keith C. | 2 |
Kilgus, Stephen P. | 2 |
Riley-Tillman, T. Chris | 2 |
Sinclair, James S. | 2 |
Van Wie, Michael P. | 2 |
Acar, Tülin | 1 |
Airasian, Peter W. | 1 |
Akhtar, Hanif | 1 |
Algina, James | 1 |
More ▼ |
Publication Type
Reports - Research | 68 |
Journal Articles | 60 |
Speeches/Meeting Papers | 10 |
Reports - Evaluative | 7 |
Tests/Questionnaires | 7 |
Dissertations/Theses -… | 3 |
Information Analyses | 3 |
Numerical/Quantitative Data | 2 |
Books | 1 |
Opinion Papers | 1 |
Education Level
Audience
Researchers | 3 |
Practitioners | 1 |
Students | 1 |
Location
Turkey | 7 |
Canada | 4 |
South Korea | 4 |
United Kingdom (England) | 4 |
Germany | 2 |
Netherlands | 2 |
Canada (Toronto) | 1 |
Czech Republic | 1 |
Germany (Berlin) | 1 |
India | 1 |
Indonesia | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Xiaowen Liu – International Journal of Testing, 2024
Differential item functioning (DIF) often arises from multiple sources. Within the context of multidimensional item response theory, this study examined DIF items with varying secondary dimensions using the three DIF methods: SIBTEST, Mantel-Haenszel, and logistic regression. The effect of the number of secondary dimensions on DIF detection rates…
Descriptors: Item Analysis, Test Items, Item Response Theory, Correlation
Karl Schweizer; Andreas Gold; Dorothea Krampen; Stefan Troche – Educational and Psychological Measurement, 2024
Conceptualizing two-variable disturbances preventing good model fit in confirmatory factor analysis as item-level method effects instead of correlated residuals avoids violating the principle that residual variation is unique for each item. The possibility of representing such a disturbance by a method factor of a bifactor measurement model was…
Descriptors: Correlation, Factor Analysis, Measurement Techniques, Item Analysis
Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025
This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…
Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis
Jordan M. Wheeler; Allan S. Cohen; Shiyu Wang – Journal of Educational and Behavioral Statistics, 2024
Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming…
Descriptors: Semantics, Educational Assessment, Evaluators, Reliability
Celen, Umit; Aybek, Eren Can – International Journal of Assessment Tools in Education, 2022
Item analysis is performed by developers as an integral part of the scale development process. Thus, items are excluded from the scale depending on the item analysis prior to the factor analysis. Existing item discrimination indices are calculated based on correlation, yet items with different response patterns are likely to have a similar item…
Descriptors: Likert Scales, Factor Analysis, Item Analysis, Correlation
Sedat Sen; Allan S. Cohen – Educational and Psychological Measurement, 2024
A Monte Carlo simulation study was conducted to compare fit indices used for detecting the correct latent class in three dichotomous mixture item response theory (IRT) models. Ten indices were considered: Akaike's information criterion (AIC), the corrected AIC (AICc), Bayesian information criterion (BIC), consistent AIC (CAIC), Draper's…
Descriptors: Goodness of Fit, Item Response Theory, Sample Size, Classification
Yoo Jeong Jang – ProQuest LLC, 2022
Despite the increasing demand for diagnostic information, observed subscores have been often reported to lack adequate psychometric qualities such as reliability, distinctiveness, and validity. Therefore, several statistical techniques based on CTT and IRT frameworks have been proposed to improve the quality of subscores. More recently, DCM has…
Descriptors: Classification, Accuracy, Item Response Theory, Correlation
Robie, Chet; Meade, Adam W.; Risavy, Stephen D.; Rasheed, Sabah – Educational and Psychological Measurement, 2022
The effects of different response option orders on survey responses have been studied extensively. The typical research design involves examining the differences in response characteristics between conditions with the same item stems and response option orders that differ in valence--either incrementally arranged (e.g., strongly disagree to…
Descriptors: Likert Scales, Psychometrics, Surveys, Responses
David Bell; Vikki O'Neill; Vivienne Crawford – Practitioner Research in Higher Education, 2023
We compared the influence of open-book extended duration versus closed book time-limited format on reliability and validity of written assessments of pharmacology learning outcomes within our medical and dental courses. Our dental cohort undertake a mid-year test (30xfree-response short answer to a question, SAQ) and end-of-year paper (4xSAQ,…
Descriptors: Undergraduate Students, Pharmacology, Pharmaceutical Education, Test Format
Phusee-orn, Songsak; Pongteerawut, Sasipat – Journal of Educational Issues, 2022
The research aimed at studying and comparing the futuristic thinking of Grade 9 students studying in schools of different sizes. The samples of the research were Grade 9 students of semester 2 in academic year 2020 in Sisaket Province, Thailand. The multi-stage random sampling technique was employed for the selection of 860 students from 12…
Descriptors: School Size, Futures (of Society), Correlation, Likert Scales
Yildirim, Osman Gazi; Ozdener, Nesrin – International Journal of Computer Science Education in Schools, 2022
The main goal of the current study is to develop a reliable instrument to measure programming anxiety in university students. A pool of 33 items based on extensive literature review and experts' opinions were created by researchers. The draft scale comprised three factors applied to 392 university students from two different universities in Turkey…
Descriptors: Anxiety, Undergraduate Students, Student Attitudes, Factor Analysis
Akhtar, Hanif – International Association for Development of the Information Society, 2022
When examinees perceive a test as low stakes, it is logical to assume that some of them will not put out their maximum effort. This condition makes the validity of the test results more complicated. Although many studies have investigated motivational fluctuation across tests during a testing session, only a small number of studies have…
Descriptors: Intelligence Tests, Student Motivation, Test Validity, Student Attitudes
Zijlmans, Eva A. O.; Tijmstra, Jesper; van der Ark, L. Andries; Sijtsma, Klaas – Educational and Psychological Measurement, 2018
Reliability is usually estimated for a total score, but it can also be estimated for item scores. Item-score reliability can be useful to assess the repeatability of an individual item score in a group. Three methods to estimate item-score reliability are discussed, known as method MS, method [lambda][subscript 6], and method CA. The item-score…
Descriptors: Test Items, Test Reliability, Correlation, Comparative Analysis
PaaBen, Benjamin; Dywel, Malwina; Fleckenstein, Melanie; Pinkwart, Niels – International Educational Data Mining Society, 2022
Item response theory (IRT) is a popular method to infer student abilities and item difficulties from observed test responses. However, IRT struggles with two challenges: How to map items to skills if multiple skills are present? And how to infer the ability of new students that have not been part of the training data? Inspired by recent advances…
Descriptors: Item Response Theory, Test Items, Item Analysis, Inferences
Swit, Cara S. – Early Child Development and Care, 2021
The goal of the study was to examine preschool teachers' (N = 96) and parents' (N = 82) perceptions of seriousness, empathy, likelihood to intervene, and intervention responses for perpetrators and victims of hypothetical scenarios depicting relational and physical aggression. After establishing differential associations between relational and…
Descriptors: Aggression, Empathy, Intervention, Victims