Publication Date
| In 2026 | 0 |
| Since 2025 | 3 |
| Since 2022 (last 5 years) | 34 |
| Since 2017 (last 10 years) | 801 |
| Since 2007 (last 20 years) | 2609 |
Descriptor
| Statistical Analysis | 2935 |
| Feedback (Response) | 1143 |
| Foreign Countries | 1107 |
| Item Response Theory | 663 |
| Questionnaires | 604 |
| Student Attitudes | 510 |
| Comparative Analysis | 498 |
| Correlation | 476 |
| Emotional Response | 458 |
| Teaching Methods | 446 |
| College Students | 375 |
| More ▼ | |
Source
Author
| Sinharay, Sandip | 17 |
| Smolkowski, Keith | 13 |
| Fien, Hank | 12 |
| Clarke, Ben | 11 |
| Doabler, Christian T. | 11 |
| Alonzo, Julie | 10 |
| Tindal, Gerald | 10 |
| Baker, Scott K. | 9 |
| Haberman, Shelby J. | 9 |
| Cho, Sun-Joo | 7 |
| Lai, Cheng-Fei | 7 |
| More ▼ | |
Publication Type
Education Level
Location
| Australia | 97 |
| Turkey | 71 |
| Germany | 65 |
| Taiwan | 57 |
| Canada | 55 |
| Iran | 54 |
| United Kingdom | 54 |
| China | 51 |
| Netherlands | 46 |
| California | 33 |
| Japan | 33 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 7 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Using Differential Item Functioning to Test for Interrater Reliability in Constructed Response Items
Walker, Cindy M.; Göçer Sahin, Sakine – Educational and Psychological Measurement, 2020
The purpose of this study was to investigate a new way of evaluating interrater reliability that can allow one to determine if two raters differ with respect to their rating on a polytomous rating scale or constructed response item. Specifically, differential item functioning (DIF) analyses were used to assess interrater reliability and compared…
Descriptors: Test Bias, Interrater Reliability, Responses, Correlation
Man, Kaiwen; Harring, Jeffrey R. – Educational and Psychological Measurement, 2021
Many approaches have been proposed to jointly analyze item responses and response times to understand behavioral differences between normally and aberrantly behaved test-takers. Biometric information, such as data from eye trackers, can be used to better identify these deviant testing behaviors in addition to more conventional data types. Given…
Descriptors: Cheating, Item Response Theory, Reaction Time, Eye Movements
Su, Shiyang; Wang, Chun; Weiss, David J. – Educational and Psychological Measurement, 2021
S-X[superscript 2] is a popular item fit index that is available in commercial software packages such as "flex"MIRT. However, no research has systematically examined the performance of S-X[superscript 2] for detecting item misfit within the context of the multidimensional graded response model (MGRM). The primary goal of this study was…
Descriptors: Statistics, Goodness of Fit, Test Items, Models
Cladera, Magdalena – Educational Assessment, Evaluation and Accountability, 2021
Students' feedback is usually gathered in institutions of higher education to evaluate the teaching quality from the students' perspective, using questionnaires administered at the end of the courses. These evaluations are useful to pinpoint the course strengths, identify areas of improvement, and understand the factors that contribute to…
Descriptors: Feedback (Response), Student Evaluation of Teacher Performance, Higher Education, Student Satisfaction
Gelman, Andrew – Grantee Submission, 2022
I discuss a published paper in political science that made a claim that aroused skepticism. The reanalysis is an example of how we, as consumers as well as producers of science, can engage with published work. This can be viewed as a sort of collaboration performed implicitly between the authors of a published paper and later researchers who want…
Descriptors: Criticism, Political Science, Social Science Research, Authors
Luo, Jiaorong; Yang, Mingcheng; Wang, Ling – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2023
The increased Simon effect with increasing the ratio of congruent trials may be interpreted by both attention modulation and irrelevant stimulus-response (S-R) associations learning accounts, although the reversed Simon effect with increasing the ratio of incongruent trials provides evidence supporting the latter account. To investigate if…
Descriptors: Foreign Countries, Responses, Reaction Time, Accuracy
Lorette, Pernelle – Journal of Multilingual and Multicultural Development, 2023
Quantitative social scientists have adopted the positivist epistemology and methodology of natural sciences, seeking objectivity, generalisability, and neutrality. However, in social sciences -- unlike in natural sciences -- humans are both the investigators and the object of investigation, leading to intricate interconnections between researchers…
Descriptors: Social Sciences, Statistical Analysis, Research Problems, Interpersonal Relationship
Ryan, Tracii; Henderson, Michael – Assessment & Evaluation in Higher Education, 2018
Assessment feedback allows students to obtain valuable information about how they can improve their future performance and learning strategies. However, research indicates that students are more likely to reject or ignore comments if they evoke negative emotional responses. Despite the importance of this issue, there is a lack of research…
Descriptors: Foreign Countries, College Students, Feedback (Response), Foreign Students
Iannario, Maria; Manisera, Marica; Piccolo, Domenico; Zuccolotto, Paola – Sociological Methods & Research, 2020
In analyzing data from attitude surveys, it is common to consider the "don't know" responses as missing values. In this article, we present a statistical model commonly used for the analysis of responses/evaluations expressed on Likert scales and extended to take into account the presence of don't know responses. The main objective is to…
Descriptors: Response Style (Tests), Likert Scales, Statistical Analysis, Models
Vaheoja, Monika; Verhelst, N. D.; Eggen, T.J.H.M. – European Journal of Science and Mathematics Education, 2019
In this article, the authors applied profile analysis to Maths exam data to demonstrate how different exam forms, differing in difficulty and length, can be reported and easily interpreted. The results were presented for different groups of participants and for different institutions in different Maths domains by evaluating the balance. Some…
Descriptors: Feedback (Response), Foreign Countries, Statistical Analysis, Scores
Lozano, José H.; Revuelta, Javier – Applied Measurement in Education, 2021
The present study proposes a Bayesian approach for estimating and testing the operation-specific learning model, a variant of the linear logistic test model that allows for the measurement of the learning that occurs during a test as a result of the repeated use of the operations involved in the items. The advantages of using a Bayesian framework…
Descriptors: Bayesian Statistics, Computation, Learning, Testing
Diaz, Emily; Brooks, Gordon; Johanson, George – International Journal of Assessment Tools in Education, 2021
This Monte Carlo study assessed Type I error in differential item functioning analyses using Lord's chi-square (LC), Likelihood Ratio Test (LRT), and Mantel-Haenszel (MH) procedure. Two research interests were investigated: item response theory (IRT) model specification in LC and the LRT and continuity correction in the MH procedure. This study…
Descriptors: Test Bias, Item Response Theory, Statistical Analysis, Comparative Analysis
Karadavut, Tugba – Applied Measurement in Education, 2021
Mixture IRT models address the heterogeneity in a population by extracting latent classes and allowing item parameters to vary between latent classes. Once the latent classes are extracted, they need to be further examined to be characterized. Some approaches have been adopted in the literature for this purpose. These approaches examine either the…
Descriptors: Item Response Theory, Models, Test Items, Maximum Likelihood Statistics
Erturk, Zafer; Oyar, Esra – International Journal of Assessment Tools in Education, 2021
Studies aiming to make cross-cultural comparisons first should establish measurement invariance in the groups to be compared because results obtained from such comparisons may be artificial in the event that measurement invariance cannot be established. The purpose of this study is to investigate the measurement invariance of the data obtained…
Descriptors: International Assessment, Foreign Countries, Attitude Measures, Mathematics
Guo, Hongwen; Dorans, Neil J. – ETS Research Report Series, 2019
We derive formulas for the differential item functioning (DIF) measures that two routinely used DIF statistics are designed to estimate. The DIF measures that match on observed scores are compared to DIF measures based on an unobserved ability (theta or true score) for items that are described by either the one-parameter logistic (1PL) or…
Descriptors: Scores, Test Bias, Statistical Analysis, Item Response Theory

Peer reviewed
Direct link
