Publication Date
| In 2026 | 0 |
| Since 2025 | 2 |
| Since 2022 (last 5 years) | 10 |
| Since 2017 (last 10 years) | 22 |
| Since 2007 (last 20 years) | 57 |
Descriptor
| Models | 76 |
| Test Validity | 76 |
| Item Response Theory | 51 |
| Test Items | 31 |
| Test Reliability | 28 |
| Test Construction | 23 |
| Foreign Countries | 20 |
| Item Analysis | 18 |
| Psychometrics | 18 |
| Goodness of Fit | 11 |
| Factor Analysis | 10 |
| More ▼ | |
Source
Author
| Bejar, Isaac I. | 3 |
| Baghaei, Purya | 2 |
| Bennett, Randy Elliot | 2 |
| Graf, Edith Aurora | 2 |
| Khorramdel, Lale | 2 |
| Yocom, Peter | 2 |
| von Davier, Matthias | 2 |
| Abner, Kristin | 1 |
| Achituv, Michal | 1 |
| Ahmad, Mazalah | 1 |
| Ana Hernández-Dorado | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 58 |
| Reports - Research | 47 |
| Reports - Evaluative | 13 |
| Reports - Descriptive | 7 |
| Dissertations/Theses -… | 6 |
| Speeches/Meeting Papers | 4 |
| Tests/Questionnaires | 2 |
| Books | 1 |
| Collected Works - General | 1 |
| Opinion Papers | 1 |
Education Level
Audience
Location
| California | 2 |
| Germany | 2 |
| Iran | 2 |
| Spain | 2 |
| United Kingdom | 2 |
| Brazil | 1 |
| Idaho | 1 |
| Israel | 1 |
| Malaysia | 1 |
| Massachusetts | 1 |
| Missouri | 1 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Pere J. Ferrando; Fabia Morales-Vives; Ana Hernández-Dorado – Educational and Psychological Measurement, 2024
In recent years, some models for binary and graded format responses have been proposed to assess unipolar variables or "quasi-traits." These studies have mainly focused on clinical variables that have traditionally been treated as bipolar traits. In the present study, we have made a proposal for unipolar traits measured with continuous…
Descriptors: Item Analysis, Goodness of Fit, Accuracy, Test Validity
Ge, Yuan – ProQuest LLC, 2022
My dissertation research explored responder behaviors (e.g., demonstrating response styles, carelessness, and possessing misconceptions) that compromise psychometric quality and impact the interpretation and use of assessment results. Identifying these behaviors can help researchers understand and minimize their potentially construct-irrelevant…
Descriptors: Test Wiseness, Response Style (Tests), Item Response Theory, Psychometrics
James Soland – Journal of Research on Educational Effectiveness, 2024
When randomized control trials are not possible, quasi-experimental methods often represent the gold standard. One quasi-experimental method is difference-in-difference (DiD), which compares changes in outcomes before and after treatment across groups to estimate a causal effect. DiD researchers often use fairly exhaustive robustness checks to…
Descriptors: Item Response Theory, Testing, Test Validity, Intervention
Tong Wu; Stella Y. Kim; Carl Westine; Michelle Boyer – Journal of Educational Measurement, 2025
While significant attention has been given to test equating to ensure score comparability, limited research has explored equating methods for rater-mediated assessments, where human raters inherently introduce error. If not properly addressed, these errors can undermine score interchangeability and test validity. This study proposes an equating…
Descriptors: Item Response Theory, Evaluators, Error of Measurement, Test Validity
von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023
Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…
Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education
Chen Zong; Mirian Howland Cummings; Carolyn Haug; Nancy L. Leech – Research in the Schools, 2025
Faculty at institutions of higher education work longer hours and many are burned out and experience low satisfaction and low coping ability. To investigate this phenomenon, this study validated the instrument of Core Self-Evaluations Scale with a sample of higher education faculty members in the U.S., and three different theoretical models (Gu et…
Descriptors: Teacher Attitudes, Beliefs, Self Concept, College Faculty
Oliva, Jose M.; Blanco, Ángel – European Journal of Science and Mathematics Education, 2023
A questionnaire was recently developed for the use with the Spanish-speaking, and evidence have been provided about the construct internal validity by means of structural equation modelling. In this paper, two research questions were considered: (i) What new evidence does application of the Rasch model provide regarding the validity of this…
Descriptors: Spanish Speaking, High School Students, College Students, Item Response Theory
Karadavut, Tugba – Applied Measurement in Education, 2021
Mixture IRT models address the heterogeneity in a population by extracting latent classes and allowing item parameters to vary between latent classes. Once the latent classes are extracted, they need to be further examined to be characterized. Some approaches have been adopted in the literature for this purpose. These approaches examine either the…
Descriptors: Item Response Theory, Models, Test Items, Maximum Likelihood Statistics
Schulz, Andreas; Leuders, Timo; Rangel, Ulrike – Journal of Psychoeducational Assessment, 2020
We provide evidence of validity for a newly developed diagnostic competence model of operation sense, by both (a) describing the theoretically substantiated development of the competence model in close association with its use within a large-scale formative assessment and (b) providing empirical evidence for the theoretically described cognitive…
Descriptors: Diagnostic Tests, Models, Criterion Referenced Tests, Cognitive Measurement
Mateja Ploj Virtic; Andre Du Plessis; Andrej Šorgo – Center for Educational Policy Studies Journal, 2023
In the context of improving the quality of teacher education, the focus of the present work was to adapt the Mentoring for Effective Primary Science Teaching instrument to become more universal and have the potential to be used beyond the elementary science mentoring context. The adapted instrument was renamed the Mentoring for Effective Teaching…
Descriptors: Test Construction, Test Validity, Test Reliability, Measures (Individuals)
Qi Huang; Daniel M. Bolt; Weicong Lyu – Large-scale Assessments in Education, 2024
Large scale international assessments depend on invariance of measurement across countries. An important consideration when observing cross-national differential item functioning (DIF) is whether the DIF actually reflects a source of bias, or might instead be a methodological artifact reflecting item response theory (IRT) model misspecification.…
Descriptors: Test Items, Item Response Theory, Test Bias, Test Validity
Husain, Waqar – Health Education, 2022
Purpose: "Psychosocial health" is a new term to comprehend the already established factors involved in mental health and psychological well-being. The term has not been specifically defined and explained within the framework of psychology. Design/methodology/approach: The study proposed and validated a new model of psychosocial health.…
Descriptors: Mental Health, Psychological Patterns, Well Being, Models
Lúcio, Patrícia Silva; Vandekerckhove, Joachim; Polanczyk, Guilherme V.; Cogo-Moreira, Hugo – Journal of Psychoeducational Assessment, 2021
The present study compares the fit of two- and three-parameter logistic (2PL and 3PL) models of item response theory in the performance of preschool children on the Raven's Colored Progressive Matrices. The test of Raven is widely used for evaluating nonverbal intelligence of factor g. Studies comparing models with real data are scarce on the…
Descriptors: Guessing (Tests), Item Response Theory, Test Validity, Preschool Children
von Davier, Matthias; Khorramdel, Lale; He, Qiwei; Shin, Hyo Jeong; Chen, Haiwen – Journal of Educational and Behavioral Statistics, 2019
International large-scale assessments (ILSAs) transitioned from paper-based assessments to computer-based assessments (CBAs) facilitating the use of new item types and more effective data collection tools. This allows implementation of more complex test designs and to collect process and response time (RT) data. These new data types can be used to…
Descriptors: International Assessment, Computer Assisted Testing, Psychometrics, Item Response Theory
Warsono; Nursuhud, Puji Iman; Darma, Rio Sandhika; Supahar – International Journal of Instruction, 2020
The study was conducted to analyze the items about the ability of high school students diagram representation and obtain Item Curve Characteristic. Grid test instruments are compiled based on competencies and indicators of diagram representation which are then used to compile items. The test instrument consisted of five items and was validated by…
Descriptors: High School Students, Problem Solving, Visual Aids, Scoring

Peer reviewed
Direct link
