ERIC - Search Results

Publication Date

In 2026	0
Since 2025	15
Since 2022 (last 5 years)	43
Since 2017 (last 10 years)	133
Since 2007 (last 20 years)	194

Descriptor

Item Response Theory	219
Test Items	219
Test Reliability	219
Test Validity	110
Foreign Countries	76
Difficulty Level	72
Psychometrics	69
Test Construction	64
Scores	45
Goodness of Fit	35
Item Analysis	35
Scoring	32
Test Bias	29
Multiple Choice Tests	28
Mathematics Tests	25
Science Tests	25
High School Students	24
Computer Assisted Testing	23
Factor Analysis	23
Models	22
Comparative Analysis	21
Elementary School Students	21
Measures (Individuals)	21
Statistical Analysis	21
Undergraduate Students	21
More ▼

Publication Type

Journal Articles	172
Reports - Research	164
Reports - Evaluative	26
Reports - Descriptive	13
Speeches/Meeting Papers	13
Dissertations/Theses -…	12
Tests/Questionnaires	11
Numerical/Quantitative Data	7
Information Analyses	2
Guides - General	1
Non-Print Media	1
Reference Materials - General	1
More ▼

Education Level

Secondary Education	49
Higher Education	47
Postsecondary Education	39
Elementary Education	36
High Schools	29
Middle Schools	27
Junior High Schools	21
Early Childhood Education	13
Elementary Secondary Education	11
Intermediate Grades	11
Primary Education	10
Grade 8	9
Grade 5	8
Grade 6	7
Grade 7	6
Grade 1	5
Grade 2	5
Grade 3	5
Grade 4	5
Kindergarten	5
Grade 9	4
Grade 12	2
Adult Education	1
Preschool Education	1
More ▼

Audience

Location

Indonesia	16
Florida	8
Turkey	7
Germany	6
United States	6
Taiwan	5
Australia	4
Iran	4
California	3
Canada	3
China	3
Malaysia	3
New Mexico	3
Nigeria	3
Singapore	3
South Korea	3
Alabama	2
France	2
Japan	2
Oregon	2
South Africa	2
Texas	2
Utah	2
Arizona	1
Asia	1
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…

What Works Clearinghouse Rating

Showing 1 to 15 of 219 results Save | Export

Another Look at Yen's Q3: Is 0.2 an Appropriate Cut-Off?

Peer reviewed

Direct link

Kelsey Nason; Christine DeMars – Journal of Educational Measurement, 2025

This study examined the widely used threshold of 0.2 for Yen's Q3, an index for violations of local independence. Specifically, a simulation was conducted to investigate whether Q3 values were related to the magnitude of bias in estimates of reliability, item parameters, and examinee ability. Results showed that Q3 values below the typical cut-off…

Descriptors: Item Response Theory, Statistical Bias, Test Reliability, Test Items

Modeling Directional Testlet Effects on Multiple Open-Ended Questions

Peer reviewed

Direct link

Kuan-Yu Jin; Wai-Lok Siu – Journal of Educational Measurement, 2025

Educational tests often have a cluster of items linked by a common stimulus ("testlet"). In such a design, the dependencies caused between items are called "testlet effects." In particular, the directional testlet effect (DTE) refers to a recursive influence whereby responses to earlier items can positively or negatively affect…

Descriptors: Models, Test Items, Educational Assessment, Scores

Comparing and Combining IRTree Models and Anchoring Vignettes in Addressing Response Styles

Peer reviewed

Direct link

Mingfeng Xue; Ping Chen – Journal of Educational Measurement, 2025

Response styles pose great threats to psychological measurements. This research compares IRTree models and anchoring vignettes in addressing response styles and estimating the target traits. It also explores the potential of combining them at the item level and total-score level (ratios of extreme and middle responses to vignettes). Four models…

Descriptors: Item Response Theory, Models, Comparative Analysis, Vignettes

Comparative Evaluation of C-Test Reliability Using Classical and Modern Psychometric Methods

Peer reviewed
PDF on ERIC

Download full text

Neda Kianinezhad; Mohsen Kianinezhad – Language Education & Assessment, 2025

This study presents a comparative analysis of classical reliability measures, including Cronbach's alpha, test-retest, and parallel forms reliability, alongside modern psychometric methods such as the Rasch model and Mokken scaling, to evaluate the reliability of C-tests in language proficiency assessment. Utilizing data from 150 participants…

Descriptors: Psychometrics, Test Reliability, Language Proficiency, Language Tests

The Contribution of Rasch Modeling on Measuring Attitudes for Better Classroom Assessment

Peer reviewed
PDF on ERIC

Download full text

Akif Avcu – International Journal of Psychology and Educational Studies, 2025

This review explores the significant contributions of Rasch modeling in enhancing classroom assessment practices, particularly in measuring student attitudes. Classroom assessment has evolved from standardized testing to integrative practices that emphasize both academic and affective dimensions of student development. Accurate attitude…

Descriptors: Item Response Theory, Student Attitudes, Student Evaluation, Attitude Measures

Validation of an Elicited Imitation Test as a Measure of Korean Language Proficiency

Peer reviewed

Direct link

Hojung Kim; Changkyung Song; Jiyoung Kim; Hyeyun Jeong; Jisoo Park – Language Testing in Asia, 2024

This study presents a modified version of the Korean Elicited Imitation (EI) test, designed to resemble natural spoken language, and validates its reliability as a measure of proficiency. The study assesses the correlation between average test scores and Test of Proficiency in Korean (TOPIK) levels, examining score distributions among beginner,…

Descriptors: Korean, Test Validity, Test Reliability, Imitation

Is Effort Moderated Scoring Robust to Multidimensional Rapid Guessing?

Peer reviewed

Direct link

Joseph A. Rios; Jiayi Deng – Educational and Psychological Measurement, 2025

To mitigate the potential damaging consequences of rapid guessing (RG), a form of noneffortful responding, researchers have proposed a number of scoring approaches. The present simulation study examines the robustness of the most popular of these approaches, the unidimensional effort-moderated (EM) scoring procedure, to multidimensional RG (i.e.,…

Descriptors: Scoring, Guessing (Tests), Reaction Time, Item Response Theory

Psychometric Properties of the Academic Procrastination Scale in an Iranian Sample

Peer reviewed

Direct link

Mahdi Ghorbankhani; Keyvan Salehi – SAGE Open, 2025

Academic procrastination, the tendency to delay academic tasks without reasonable justification, has significant implications for students' academic performance and overall well-being. To measure this construct, numerous scales have been developed, among which the Academic Procrastination Scale (APS) has shown promise in assessing academic…

Descriptors: Psychometrics, Measures (Individuals), Time Management, Foreign Countries

How Many Response Categories Are Sufficient for Likert Type Scales? An Empirical Study Based on the Item Response Theory

Peer reviewed
PDF on ERIC

Download full text

Aybek, Eren Can; Toraman, Cetin – International Journal of Assessment Tools in Education, 2022

The current study investigates the optimum number of response categories for the Likert type of scales under the item response theory (IRT). The data was collected from university students attend to mainly the faculty of medicine and the faculty of education. A form of the "Social Gender Equity Scale" developed by Gozutok et al. (2017)…

Descriptors: Likert Scales, Item Response Theory, College Students, Test Reliability

Psychometric Analysis of the Resonance Concept Inventory

Peer reviewed

Direct link

Grace C. Tetschner; Sachin Nedungadi – Chemistry Education Research and Practice, 2025

Many undergraduate chemistry students hold alternate conceptions related to resonance--an important and fundamental topic of organic chemistry. To help address these alternate conceptions, an organic chemistry instructor could administer the resonance concept inventory (RCI), which is a multiple-choice assessment that was designed to identify…

Descriptors: Scientific Concepts, Concept Formation, Item Response Theory, Scores

A Novel Examination of None-of-the-Above as It Influences Examinee Item Responses

Direct link

Thompson, Kathryn N. – ProQuest LLC, 2023

It is imperative to collect validity evidence prior to interpreting and using test scores. During the process of collecting validity evidence, test developers should consider whether test scores are contaminated by sources of extraneous information. This is referred to as construct irrelevant variance, or the "degree to which test scores are…

Descriptors: Test Wiseness, Test Items, Item Response Theory, Scores

Application of the Rasch Model in Streamlining an Instrument Measuring Depression among College Students

Peer reviewed
PDF on ERIC

Download full text

Balbuena, Sherwin – International Journal of Assessment Tools in Education, 2023

Depression is a latent characteristic that is measured through self-reported or clinician-mediated instruments such as scales and inventories. The precision of depression estimates largely depends on the validity of the items used and on the truthfulness of people responding to these items. The existing methodology in instrumentation based on a…

Descriptors: Depression (Psychology), Test Items, Test Validity, Test Reliability

Validity and Reliability Analysis of a Socioscientific Issues-Based Critical Thinking Self-Assessment Instrument Using the Rasch Model

Peer reviewed
PDF on ERIC

Download full text

Y. Yokhebed; Rexy Maulana Dwi Karmadi; Luvia Ranggi Nastiti – Journal of Biological Education Indonesia (Jurnal Pendidikan Biologi Indonesia), 2025

Although self-assessment in critical thinking is thought to help students recognise their strengths and weaknesses, the reliability and validity of the assessment tool is still questionable, so a more objective evaluation is needed. Objective of this investigation is to assess the self-assessment tools in evaluating students' critical thinking…

Descriptors: Self Evaluation (Individuals), Critical Thinking, Science and Society, Test Validity

Measuring Up: Rasch Analysis of English Reading Comprehension Test for Informal Education Learners

Peer reviewed
PDF on ERIC

Download full text

Arandha May Rachmawati; Agus Widyantoro – English Language Teaching Educational Journal, 2025

This study aims to evaluate the quality of English reading comprehension test instruments used in informal learning, especially as English literacy tests. With a quantitative approach, the analysis was carried out using the Rasch model through the Quest program on 30 multiple-choice questions given to 30 grade IX students from informal educational…

Descriptors: Item Response Theory, Reading Tests, Reading Comprehension, English (Second Language)

Item Response Theory Modeling of the Verb Naming Test

Peer reviewed

Direct link

Fergadiotis, Gerasimos; Casilio, Marianne; Dickey, Michael Walsh; Steel, Stacey; Nicholson, Hannele; Fleegle, Mikala; Swiderski, Alexander; Hula, William D. – Journal of Speech, Language, and Hearing Research, 2023

Purpose: Item response theory (IRT) is a modern psychometric framework with several advantageous properties as compared with classical test theory. IRT has been successfully used to model performance on anomia tests in individuals with aphasia; however, all efforts to date have focused on noun production accuracy. The purpose of this study is to…

Descriptors: Item Response Theory, Psychometrics, Verbs, Naming

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 15

ProQuest LLC	12
Educational and Psychological…	8
Grantee Submission	7
Journal of Educational…	7
Online Submission	7
SAGE Open	6
ETS Research Report Series	5
International Journal of…	5
Applied Psychological…	4
Journal of Baltic Science…	4
Journal of Psychoeducational…	4
International Journal of…	3
International Journal of…	3
International Journal of…	3
Journal of Intelligence	3
Journal of Speech, Language,…	3
Language Testing	3
Psychometrika	3
Applied Measurement in…	2
Assessment for Effective…	2
Chemistry Education Research…	2
College Board	2
Cypriot Journal of…	2
EURASIA Journal of…	2
Educational Measurement:…	2
More ▼

Schoen, Robert C.	6
Anderson, Daniel	4
Petscher, Yaacov	4
Bauduin, Charity	3
Boone, William J.	3
Paek, Insu	3
Yang, Xiaotong	3
Zhang, Jinming	3
Bichi, Ado Abdu	2
Brown, Ted	2
Dogan, Nuri	2
Edwards, Michael C.	2
Guo, Hongwen	2
Hartig, Johannes	2
Istiyono, Edi	2
Lee, Yi-Hsuan	2
Lee, Young-Sun	2
Liu, Sicong	2
Meijer, Rob R.	2
Mike Stieff	2
Myszkowski, Nils	2
Nicewander, W. Alan	2
Retnawati, Heri	2
Segall, Daniel O.	2
More ▼

Graduate Record Examinations	5
SAT (College Admission Test)	4
ACT Assessment	2
Iowa Tests of Basic Skills	2
Stanford Achievement Tests	2
Test of English as a Foreign…	2
Trends in International…	2
Armed Forces Qualification…	1
Bruininks Oseretsky Test of…	1
Center for Epidemiologic…	1
Child Behavior Checklist	1
Defining Issues Test	1
Dynamic Indicators of Basic…	1
Hidden Figures Test	1
Kaufman Test of Educational…	1
MacArthur Communicative…	1
Measures of Academic Progress	1
National Assessment of…	1
Peabody Picture Vocabulary…	1
Preliminary Scholastic…	1
Program for International…	1
Raven Progressive Matrices	1
Student Teacher Relationship…	1
Wechsler Adult Intelligence…	1
More ▼