Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 14 |
| Since 2017 (last 10 years) | 32 |
| Since 2007 (last 20 years) | 140 |
Descriptor
| Correlation | 154 |
| Evaluation Methods | 154 |
| Feedback (Response) | 65 |
| Item Response Theory | 52 |
| Foreign Countries | 36 |
| Scores | 33 |
| Student Evaluation | 30 |
| Models | 28 |
| Comparative Analysis | 23 |
| Emotional Response | 21 |
| Factor Analysis | 21 |
| More ▼ | |
Source
Author
| Cutumisu, Maria | 4 |
| Chin, Doris B. | 3 |
| Schwartz, Daniel L. | 3 |
| Blair, Kristen P. | 2 |
| Kamata, Akihito | 2 |
| McKown, Clark | 2 |
| A. Sedory, Stephen | 1 |
| Adams, Raymond | 1 |
| Adams, Stephen T. | 1 |
| Adler-Nevo, Gili | 1 |
| Adunyarittigun, Dumrong | 1 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 4 |
| Practitioners | 1 |
Location
| Australia | 5 |
| California | 5 |
| United Kingdom | 5 |
| Canada | 4 |
| Denmark | 3 |
| Germany | 3 |
| Illinois | 3 |
| Israel | 3 |
| Estonia | 2 |
| Hong Kong | 2 |
| Japan | 2 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
| Program for International… | 4 |
| National Survey of Student… | 1 |
| State Trait Anxiety Inventory | 1 |
| Trends in International… | 1 |
| Vineland Adaptive Behavior… | 1 |
| Wechsler Intelligence Scale… | 1 |
| Woodcock Johnson Tests of… | 1 |
What Works Clearinghouse Rating
Hung, Su-Pin; Huang, Hung-Yu – Journal of Educational and Behavioral Statistics, 2022
To address response style or bias in rating scales, forced-choice items are often used to request that respondents rank their attitudes or preferences among a limited set of options. The rating scales used by raters to render judgments on ratees' performance also contribute to rater bias or errors; consequently, forced-choice items have recently…
Descriptors: Evaluation Methods, Rating Scales, Item Analysis, Preferences
Smith, Trevor I.; Bendjilali, Nasrine – Physical Review Physics Education Research, 2022
Several recent studies have employed item response theory (IRT) to rank incorrect responses to commonly used research-based multiple-choice assessments. These studies use Bock's nominal response model (NRM) for applying IRT to categorical (nondichotomous) data, but the response rankings only utilize half of the parameters estimated by the model.…
Descriptors: Item Response Theory, Test Items, Multiple Choice Tests, Science Tests
Guo, Wenjing; Choi, Youn-Jeng – Educational and Psychological Measurement, 2023
Determining the number of dimensions is extremely important in applying item response theory (IRT) models to data. Traditional and revised parallel analyses have been proposed within the factor analysis framework, and both have shown some promise in assessing dimensionality. However, their performance in the IRT framework has not been…
Descriptors: Item Response Theory, Evaluation Methods, Factor Analysis, Guidelines
Olanipekun, Oluwaseun L.; Zhao, JuLong; Wang, Rongdong; A. Sedory, Stephen; Singh, Sarjinder – Sociological Methods & Research, 2023
In carrying out surveys involving sensitive characteristics, randomized response models have been considered among the best techniques since they provide the maximum privacy protection to the respondents and procure honest responses. Over the years, researchers have carried out studies on the estimation of proportions of the population possessing…
Descriptors: Correlation, Smoking, Thinking Skills, Health Behavior
Yan, Zi; Chiu, Ming Ming – British Educational Research Journal, 2023
Despite the general consensus on the positive impact of formative assessment on student learning, researchers have not shown the underlying mechanisms between specific formative assessment strategies and academic performance on an international sample. This study examines the link between student and teacher reports of teachers' formative…
Descriptors: Formative Evaluation, Evaluation Methods, Reading Achievement, Correlation
Stefanie A. Wind; Yangmeng Xu – Educational Assessment, 2024
We explored three approaches to resolving or re-scoring constructed-response items in mixed-format assessments: rater agreement, person fit, and targeted double scoring (TDS). We used a simulation study to consider how the three approaches impact the psychometric properties of student achievement estimates, with an emphasis on person fit. We found…
Descriptors: Interrater Reliability, Error of Measurement, Evaluation Methods, Examiners
Manapat, Patrick D.; Edwards, Michael C. – Educational and Psychological Measurement, 2022
When fitting unidimensional item response theory (IRT) models, the population distribution of the latent trait ([theta]) is often assumed to be normally distributed. However, some psychological theories would suggest a nonnormal [theta]. For example, some clinical traits (e.g., alcoholism, depression) are believed to follow a positively skewed…
Descriptors: Robustness (Statistics), Computational Linguistics, Item Response Theory, Psychological Patterns
Jamie L. Thompson – ProQuest LLC, 2023
Research indicates that teacher performance is a critical focus for school districts, administrators, and teachers. Pre-service teacher preparation, teacher retention, job satisfaction, mentoring, continuous feedback, and onboarding support for new teachers are all factors that influence teacher performance (Carver-Thomas & Darling-Hammond,…
Descriptors: Teacher Evaluation, Teacher Effectiveness, Evaluation Methods, Feedback (Response)
Yuanfang Liu; Mark H. C. Lai; Ben Kelcey – Structural Equation Modeling: A Multidisciplinary Journal, 2024
Measurement invariance holds when a latent construct is measured in the same way across different levels of background variables (continuous or categorical) while controlling for the true value of that construct. Using Monte Carlo simulation, this paper compares the multiple indicators, multiple causes (MIMIC) model and MIMIC-interaction to a…
Descriptors: Classification, Accuracy, Error of Measurement, Correlation
Ben Stenhaug; Ben Domingue – Grantee Submission, 2022
The fit of an item response model is typically conceptualized as whether a given model could have generated the data. We advocate for an alternative view of fit, "predictive fit", based on the model's ability to predict new data. We derive two predictive fit metrics for item response models that assess how well an estimated item response…
Descriptors: Goodness of Fit, Item Response Theory, Prediction, Models
Zaher M. Kmail; Gordon Brobbey – Journal of the American Academy of Special Education Professionals, 2024
Teacher evaluation has been closely tied to professional development. In special education, professional development experiences are meant to promote special educator learning and implementation of high leverage practices. Yet, the connection between teacher evaluation outcomes and professional development decisions of special educators is largely…
Descriptors: Teacher Evaluation, Special Education Teachers, Teacher Attitudes, Faculty Development
D'Urso, E. Damiano; Tijmstra, Jesper; Vermunt, Jeroen K.; De Roover, Kim – Educational and Psychological Measurement, 2023
Assessing the measurement model (MM) of self-report scales is crucial to obtain valid measurements of individuals' latent psychological constructs. This entails evaluating the number of measured constructs and determining which construct is measured by which item. Exploratory factor analysis (EFA) is the most-used method to evaluate these…
Descriptors: Factor Analysis, Measurement Techniques, Self Evaluation (Individuals), Psychological Patterns
Hosseinzadeh, Mostafa – ProQuest LLC, 2021
In real-world situations, multidimensional data may appear on large-scale tests or attitudinal surveys. A simple structure, multidimensional model may be used to evaluate the items, ignoring the cross-loading of some items on the secondary dimension. The purpose of this study was to investigate the influence of structure complexity magnitude of…
Descriptors: Item Response Theory, Models, Simulation, Evaluation Methods
Brittany Nicole Rogers – ProQuest LLC, 2021
Prior research demonstrates that school leaders can help develop teacher efficacy by creating positive school climates, providing opportunities for mentoring and purposeful feedback, and providing meaningful professional learning (Bressman, Winters, & Efron, 2017; Fry, 2009; Ross & Bruce, 2007). Yet despite significant prior research on…
Descriptors: Teacher Evaluation, Teacher Effectiveness, Administrator Attitudes, School Districts
Chung, Seungwon; Houts, Carrie – Measurement: Interdisciplinary Research and Perspectives, 2020
Advanced modeling of item response data through the item response theory (IRT) or item factor analysis frameworks is becoming increasingly popular. In the social and behavioral sciences, the underlying structure of tests/assessments is often multidimensional (i.e., more than 1 latent variable/construct is represented in the items). This review…
Descriptors: Item Response Theory, Evaluation Methods, Models, Factor Analysis

Peer reviewed
Direct link
