ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	10
Since 2017 (last 10 years)	22
Since 2007 (last 20 years)	57

Descriptor

Models	76
Test Validity	76
Item Response Theory	51
Test Items	31
Test Reliability	28
Test Construction	23
Foreign Countries	20
Item Analysis	18
Psychometrics	18
Goodness of Fit	11
Factor Analysis	10
Comparative Analysis	9
Measurement Techniques	9
Statistical Analysis	9
Difficulty Level	8
Response Style (Tests)	8
Scores	8
Cognitive Processes	7
Computer Assisted Testing	7
Evaluation Methods	7
Evidence	7
Mathematics Tests	7
Prediction	7
Questionnaires	7
Responses	7
More ▼

Publication Type

Journal Articles	58
Reports - Research	47
Reports - Evaluative	13
Reports - Descriptive	7
Dissertations/Theses -…	6
Speeches/Meeting Papers	4
Tests/Questionnaires	2
Books	1
Collected Works - General	1
Opinion Papers	1

Education Level

Higher Education	15
Postsecondary Education	12
Secondary Education	9
Elementary Education	8
Elementary Secondary Education	6
Intermediate Grades	4
Middle Schools	4
Grade 4	3
High Schools	3
Junior High Schools	3
Adult Education	2
Early Childhood Education	2
Two Year Colleges	2
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Preschool Education	1
More ▼

Audience

Location

California	2
Germany	2
Iran	2
Spain	2
United Kingdom	2
Brazil	1
Idaho	1
Israel	1
Malaysia	1
Massachusetts	1
Missouri	1
New Mexico	1
Oregon	1
Pakistan	1
Sri Lanka	1
Sweden	1
Taiwan	1
Washington	1
Yemen	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	3
Hidden Figures Test	2
Test of English as a Foreign…	2
Trends in International…	2
Armed Services Vocational…	1
Brief Symptom Inventory	1
Early Childhood Environment…	1
Early Childhood Longitudinal…	1
Home Observation for…	1
Raven Progressive Matrices	1
Wechsler Adult Intelligence…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 76 results Save | Export

Measuring Unipolar Traits with Continuous Response Items: Some Methodological and Substantive Developments

Peer reviewed

Direct link

Pere J. Ferrando; Fabia Morales-Vives; Ana Hernández-Dorado – Educational and Psychological Measurement, 2024

In recent years, some models for binary and graded format responses have been proposed to assess unipolar variables or "quasi-traits." These studies have mainly focused on clinical variables that have traditionally been treated as bipolar traits. In the present study, we have made a proposal for unipolar traits measured with continuous…

Descriptors: Item Analysis, Goodness of Fit, Accuracy, Test Validity

Identifying and Understanding Examinee Behaviors in Item Response Data That Compromise Psychometric Quality

Direct link

Ge, Yuan – ProQuest LLC, 2022

My dissertation research explored responder behaviors (e.g., demonstrating response styles, carelessness, and possessing misconceptions) that compromise psychometric quality and impact the interpretation and use of assessment results. Identifying these behaviors can help researchers understand and minimize their potentially construct-irrelevant…

Descriptors: Test Wiseness, Response Style (Tests), Item Response Theory, Psychometrics

Item Response Theory Models for Difference-in-Difference Estimates (And Whether They Are Worth the Trouble)

Peer reviewed

Direct link

James Soland – Journal of Research on Educational Effectiveness, 2024

When randomized control trials are not possible, quasi-experimental methods often represent the gold standard. One quasi-experimental method is difference-in-difference (DiD), which compares changes in outcomes before and after treatment across groups to estimate a causal effect. DiD researchers often use fairly exhaustive robustness checks to…

Descriptors: Item Response Theory, Testing, Test Validity, Intervention

IRT Observed-Score Equating for Rater-Mediated Assessments Using a Hierarchical Rater Model

Peer reviewed

Direct link

Tong Wu; Stella Y. Kim; Carl Westine; Michelle Boyer – Journal of Educational Measurement, 2025

While significant attention has been given to test equating to ensure score comparability, limited research has explored equating methods for rater-mediated assessments, where human raters inherently introduce error. If not properly addressed, these errors can undermine score interchangeability and test validity. This study proposes an equating…

Descriptors: Item Response Theory, Evaluators, Error of Measurement, Test Validity

Scoring Graphical Responses in TIMSS 2019 Using Artificial Neural Networks

Peer reviewed

Direct link

von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023

Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…

Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education

Faculty Beliefs about Themselves, Their Self-Worth, and Their Capabilities: Analysis of the Core Self-Evaluations Scale

Peer reviewed

Direct link

Chen Zong; Mirian Howland Cummings; Carolyn Haug; Nancy L. Leech – Research in the Schools, 2025

Faculty at institutions of higher education work longer hours and many are burned out and experience low satisfaction and low coping ability. To investigate this phenomenon, this study validated the instrument of Core Self-Evaluations Scale with a sample of higher education faculty members in the U.S., and three different theoretical models (Gu et…

Descriptors: Teacher Attitudes, Beliefs, Self Concept, College Faculty

Rasch Analysis and Validity of the Construct Understanding of the Nature of Models in Spanish-Speaking Students

Peer reviewed
PDF on ERIC

Download full text

Oliva, Jose M.; Blanco, Ángel – European Journal of Science and Mathematics Education, 2023

A questionnaire was recently developed for the use with the Spanish-speaking, and evidence have been provided about the construct internal validity by means of structural equation modelling. In this paper, two research questions were considered: (i) What new evidence does application of the Rasch model provide regarding the validity of this…

Descriptors: Spanish Speaking, High School Students, College Students, Item Response Theory

Characterizing the Latent Classes in a Mixture IRT Model Using DIF

Peer reviewed

Direct link

Karadavut, Tugba – Applied Measurement in Education, 2021

Mixture IRT models address the heterogeneity in a population by extracting latent classes and allowing item parameters to vary between latent classes. Once the latent classes are extracted, they need to be further examined to be characterized. Some approaches have been adopted in the literature for this purpose. These approaches examine either the…

Descriptors: Item Response Theory, Models, Test Items, Maximum Likelihood Statistics

The Use of a Diagnostic Competence Model about Children's Operation Sense for Criterion-Referenced Individual Feedback in a Large-Scale Formative Assessment

Peer reviewed

Direct link

Schulz, Andreas; Leuders, Timo; Rangel, Ulrike – Journal of Psychoeducational Assessment, 2020

We provide evidence of validity for a newly developed diagnostic competence model of operation sense, by both (a) describing the theoretically substantiated development of the competence model in close association with its use within a large-scale formative assessment and (b) providing empirical evidence for the theoretically described cognitive…

Descriptors: Diagnostic Tests, Models, Criterion Referenced Tests, Cognitive Measurement

Development and Validation of the 'Mentoring for Effective Teaching Practicum Instrument'

Peer reviewed
PDF on ERIC

Download full text

Mateja Ploj Virtic; Andre Du Plessis; Andrej Šorgo – Center for Educational Policy Studies Journal, 2023

In the context of improving the quality of teacher education, the focus of the present work was to adapt the Mentoring for Effective Primary Science Teaching instrument to become more universal and have the potential to be used beyond the elementary science mentoring context. The adapted instrument was renamed the Mentoring for Effective Teaching…

Descriptors: Test Construction, Test Validity, Test Reliability, Measures (Individuals)

Investigating Item Complexity as a Source of Cross-National DIF in TIMSS Math and Science

Peer reviewed

Direct link

Qi Huang; Daniel M. Bolt; Weicong Lyu – Large-scale Assessments in Education, 2024

Large scale international assessments depend on invariance of measurement across countries. An important consideration when observing cross-national differential item functioning (DIF) is whether the DIF actually reflects a source of bias, or might instead be a methodological artifact reflecting item response theory (IRT) model misspecification.…

Descriptors: Test Items, Item Response Theory, Test Bias, Test Validity

Components of Psychosocial Health

Peer reviewed

Direct link

Husain, Waqar – Health Education, 2022

Purpose: "Psychosocial health" is a new term to comprehend the already established factors involved in mental health and psychological well-being. The term has not been specifically defined and explained within the framework of psychology. Design/methodology/approach: The study proposed and validated a new model of psychosocial health.…

Descriptors: Mental Health, Psychological Patterns, Well Being, Models

Is It Worthy to Take Account of the "Guessing" in the Performance of the Raven Test? Calling for the Principle of Parsimony for Test Validation

Peer reviewed

Direct link

Lúcio, Patrícia Silva; Vandekerckhove, Joachim; Polanczyk, Guilherme V.; Cogo-Moreira, Hugo – Journal of Psychoeducational Assessment, 2021

The present study compares the fit of two- and three-parameter logistic (2PL and 3PL) models of item response theory in the performance of preschool children on the Raven's Colored Progressive Matrices. The test of Raven is widely used for evaluating nonverbal intelligence of factor g. Studies comparing models with real data are scarce on the…

Descriptors: Guessing (Tests), Item Response Theory, Test Validity, Preschool Children

Developments in Psychometric Population Models for Technology-Based Large-Scale Assessments: An Overview of Challenges and Opportunities

Peer reviewed

Direct link

von Davier, Matthias; Khorramdel, Lale; He, Qiwei; Shin, Hyo Jeong; Chen, Haiwen – Journal of Educational and Behavioral Statistics, 2019

International large-scale assessments (ILSAs) transitioned from paper-based assessments to computer-based assessments (CBAs) facilitating the use of new item types and more effective data collection tools. This allows implementation of more complex test designs and to collect process and response time (RT) data. These new data types can be used to…

Descriptors: International Assessment, Computer Assisted Testing, Psychometrics, Item Response Theory

Empirical Analysis of Diagramatic Representation Test Instruments Using Partial Credit Model in Realizing Learning Outcomes

Peer reviewed
PDF on ERIC

Download full text

Warsono; Nursuhud, Puji Iman; Darma, Rio Sandhika; Supahar – International Journal of Instruction, 2020

The study was conducted to analyze the items about the ability of high school students diagram representation and obtain Item Curve Characteristic. Grid test instruments are compiled based on competencies and indicators of diagram representation which are then used to compile items. The test instrument consisted of five items and was validated by…

Descriptors: High School Students, Problem Solving, Visual Aids, Scoring

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

ProQuest LLC	5
ETS Research Report Series	3
Journal of Educational…	3
Journal of Educational and…	3
Applied Psychological…	2
Educational and Psychological…	2
International Education…	2
Journal of Psychoeducational…	2
Language Testing	2
Practical Assessment,…	2
Psychological Assessment	2
Psychometrika	2
Applied Measurement in…	1
Asia-Pacific Education…	1
Canadian Journal of School…	1
Center for Educational Policy…	1
Child & Youth Care Forum	1
Cognitive Science	1
Developmental Psychology	1
Education Sciences	1
Educational Assessment	1
Educational Psychologist	1
Educational Research and…	1
Educational Technology &…	1
Elementary School Journal	1
More ▼

Bejar, Isaac I.	3
Baghaei, Purya	2
Bennett, Randy Elliot	2
Graf, Edith Aurora	2
Khorramdel, Lale	2
Yocom, Peter	2
von Davier, Matthias	2
Abner, Kristin	1
Achituv, Michal	1
Ahmad, Mazalah	1
Ana Hernández-Dorado	1
Andre Du Plessis	1
Andrej Šorgo	1
Arias, Benito	1
Bachar, Eytan	1
Bachman, Lyle F.	1
Barefah, Allaa	1
Bejar, Issac I.	1
Bell, Sherry Mee	1
Bergsma, Wicher P.	1
Bertenthal, Bennett I.	1
Blanco, Ángel	1
Boldt, Robert F.	1
Bolt, Daniel M.	1
More ▼