ERIC - Search Results

Publication Date

In 2025	3
Since 2024	12
Since 2021 (last 5 years)	42
Since 2016 (last 10 years)	82
Since 2006 (last 20 years)	118

Descriptor

Correlation	152
Item Analysis	152
Test Items	152
Foreign Countries	50
Difficulty Level	42
Item Response Theory	34
Scores	32
Test Construction	32
Statistical Analysis	29
Test Validity	29
Second Language Learning	28
Comparative Analysis	27
Language Tests	27
English (Second Language)	24
Test Reliability	23
Factor Analysis	22
Multiple Choice Tests	22
Test Format	18
Psychometrics	17
Undergraduate Students	17
Achievement Tests	13
Goodness of Fit	13
Second Language Instruction	13
Accuracy	12
College Students	12
More ▼

Publication Type

Reports - Research	132
Journal Articles	108
Speeches/Meeting Papers	15
Tests/Questionnaires	10
Reports - Evaluative	4
Dissertations/Theses -…	3
Reports - Descriptive	3
Books	1
Information Analyses	1
Numerical/Quantitative Data	1

Education Level

Higher Education	44
Postsecondary Education	38
Secondary Education	19
Elementary Education	11
High Schools	10
Middle Schools	5
Early Childhood Education	3
Junior High Schools	3
Primary Education	3
Adult Education	2
Grade 10	2
Grade 12	2
Grade 2	2
Grade 5	2
Grade 9	2
Intermediate Grades	2
Elementary Secondary Education	1
Grade 7	1
Grade 8	1
Kindergarten	1
Two Year Colleges	1
More ▼

Audience

Researchers	7
Practitioners	2
Students	1

Location

Turkey	7
Canada	6
Germany	3
Japan	3
South Korea	3
United Kingdom (England)	3
Indonesia	2
Iran	2
Switzerland	2
United States	2
China	1
Colombia	1
Czech Republic	1
Finland	1
France	1
Greece	1
Hong Kong	1
India	1
Ireland	1
Italy	1
Kazakhstan	1
Malaysia	1
Massachusetts	1
Michigan	1
Netherlands	1
More ▼

Laws, Policies, & Programs

What Works Clearinghouse Rating

Showing 1 to 15 of 152 results Save | Export

Detecting Differential Item Functioning with Multiple Causes: A Comparison of Three Methods

Peer reviewed

Direct link

Xiaowen Liu – International Journal of Testing, 2024

Differential item functioning (DIF) often arises from multiple sources. Within the context of multidimensional item response theory, this study examined DIF items with varying secondary dimensions using the three DIF methods: SIBTEST, Mantel-Haenszel, and logistic regression. The effect of the number of secondary dimensions on DIF detection rates…

Descriptors: Item Analysis, Test Items, Item Response Theory, Correlation

A Comparison of Yen's Q3 Coefficient and Rasch Testlet Modeling for Identifying Local Item Dependence: Evidence from Two Vocabulary Matching Tests

Peer reviewed

Direct link

Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025

This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…

Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis

Goodman-Kruskal Gamma and Dimension-Corrected Gamma in Educational Measurement Settings

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – International Journal of Educational Methodology, 2021

Although Goodman-Kruskal gamma (G) is used relatively rarely it has promising potential as a coefficient of association in educational settings. Characteristics of G are studied in three sub-studies related to educational measurement settings. G appears to be unexpectedly appealing as an estimator of association between an item and a score because…

Descriptors: Educational Assessment, Measurement, Item Analysis, Correlation

Assessing Dimensionality of IRT Models Using Traditional and Revised Parallel Analyses

Peer reviewed

Direct link

Guo, Wenjing; Choi, Youn-Jeng – Educational and Psychological Measurement, 2023

Determining the number of dimensions is extremely important in applying item response theory (IRT) models to data. Traditional and revised parallel analyses have been proposed within the factor analysis framework, and both have shown some promise in assessing dimensionality. However, their performance in the IRT framework has not been…

Descriptors: Item Response Theory, Evaluation Methods, Factor Analysis, Guidelines

There Are Many Greater Lower Bounds than Cronbach's [alpha]: A Monte Carlo Simulation Study

Peer reviewed

Direct link

Novak, Josip; Rebernjak, Blaž – Measurement: Interdisciplinary Research and Perspectives, 2023

A Monte Carlo simulation study was conducted to examine the performance of [alpha], [lambda]2, [lambda][subscript 4], [lambda][subscript 2], [omega][subscript T], GLB[subscript MRFA], and GLB[subscript Algebraic] coefficients. Population reliability, distribution shape, sample size, test length, and number of response categories were varied…

Descriptors: Monte Carlo Methods, Evaluation Methods, Reliability, Simulation

Evaluating ChatGPT as a Self-Learning Tool in Medical Biochemistry: A Performance Assessment in Undergraduate Medical University Examination

Peer reviewed

Direct link

Krishna Mohan Surapaneni; Anusha Rajajagadeesan; Lakshmi Goudhaman; Shalini Lakshmanan; Saranya Sundaramoorthi; Dineshkumar Ravi; Kalaiselvi Rajendiran; Porchelvan Swaminathan – Biochemistry and Molecular Biology Education, 2024

The emergence of ChatGPT as one of the most advanced chatbots and its ability to generate diverse data has given room for numerous discussions worldwide regarding its utility, particularly in advancing medical education and research. This study seeks to assess the performance of ChatGPT in medical biochemistry to evaluate its potential as an…

Descriptors: Biochemistry, Science Instruction, Artificial Intelligence, Teaching Methods

To What Extent Are Item Discrimination Values Realistic? A New Index for Two-Dimensional Structures

Peer reviewed
PDF on ERIC

Download full text

Kilic, Abdullah Faruk; Uysal, Ibrahim – International Journal of Assessment Tools in Education, 2022

Most researchers investigate the corrected item-total correlation of items when analyzing item discrimination in multi-dimensional structures under the Classical Test Theory, which might lead to underestimating item discrimination, thereby removing items from the test. Researchers might investigate the corrected item-total correlation with the…

Descriptors: Item Analysis, Correlation, Item Response Theory, Test Items

Can People with Higher versus Lower Scores on Impression Management or Self-Monitoring be Identified through Different Traces under Faking?

Peer reviewed

Direct link

Jessica Röhner; Philipp Thoss; Liad Uziel – Educational and Psychological Measurement, 2024

According to faking models, personality variables and faking are related. Most prominently, people's tendency to try to make an appropriate impression (impression management; IM) and their tendency to adjust the impression they make (self-monitoring; SM) have been suggested to be associated with faking. Nevertheless, empirical findings connecting…

Descriptors: Metacognition, Deception, Personality Traits, Scores

A Comparison of Latent Semantic Analysis and Latent Dirichlet Allocation in Educational Measurement

Peer reviewed

Direct link

Jordan M. Wheeler; Allan S. Cohen; Shiyu Wang – Journal of Educational and Behavioral Statistics, 2024

Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming…

Descriptors: Semantics, Educational Assessment, Evaluators, Reliability

An Analysis of Differential Bundle Functioning in Multidimensional Tests Using the SIBTEST Procedure

Peer reviewed
PDF on ERIC

Download full text

Özdogan, Didem; Kelecioglu, Hülya – International Journal of Assessment Tools in Education, 2022

This study aims to analyze the differential bundle functioning in multidimensional tests with a specific purpose to detect this effect through differentiating the location of the item with DIF in the test, the correlation between the dimensions, the sample size, and the ratio of reference to focal group size. The first 10 items of the test that is…

Descriptors: Correlation, Sample Size, Test Items, Item Analysis

Examination of Differential Item Functioning in PISA through Univariate and Multivariate Matching Differential Item Functioning

Peer reviewed
PDF on ERIC

Download full text

Ahmet Yildirim; Nizamettin Koç – International Journal of Assessment Tools in Education, 2024

The present research aims to examine whether the questions in the Program for the International Student Assessment (PISA) 2009 reading literacy instrument display differential item functioning (DIF) among the Turkish, French, and American samples based on univariate and multivariate matching techniques before and after the total score, which is…

Descriptors: Test Items, Item Analysis, Correlation, Error of Measurement

Speed-Accuracy Trade-Off? Not so Fast: Marginal Changes in Speed Have Inconsistent Relationships with Accuracy in Real-World Settings

Peer reviewed
PDF on ERIC

Download full text

Direct link

Domingue, Benjamin W.; Kanopka, Klint; Stenhaug, Ben; Sulik, Michael J.; Beverly, Tanesia; Brinkhuis, Matthieu; Circi, Ruhan; Faul, Jessica; Liao, Dandan; McCandliss, Bruce; Obradovic, Jelena; Piech, Chris; Porter, Tenelle; Soland, James; Weeks, Jon; Wise, Steven L.; Yeatman, Jason – Journal of Educational and Behavioral Statistics, 2022

The speed-accuracy trade-off (SAT) suggests that time constraints reduce response accuracy. Its relevance in observational settings--where response time (RT) may not be constrained but respondent speed may still vary--is unclear. Using 29 data sets containing data from cognitive tasks, we use a flexible method for identification of the SAT (which…

Descriptors: Accuracy, Reaction Time, Task Analysis, College Entrance Examinations

Examining Attribute Relationship Using Diagnostic Classification Models: A Mini Review

Peer reviewed
PDF on ERIC

Download full text

Alallo, Hajir Mahmood Ibrahim; Mohammed, Aisha; Hamid, Zayad Khalaf; Hassan, Aalaa Yaseen; Kadhim, Qasim Khlaif – International Journal of Language Testing, 2023

Diagnostic classification models (DCMs) have recently become very popular both for research purposes and for real testing endeavors for student assessment. A plethora of DCM models give researchers and practitioners a wide range of options for student diagnosis and classification. One intriguing option that some DCM models offer is the possibility…

Descriptors: Language Tests, Diagnostic Tests, Classification, Clinical Diagnosis

An Evaluation of Fit Indices Used in Model Selection of Dichotomous Mixture IRT Models

Peer reviewed

Direct link

Sedat Sen; Allan S. Cohen – Educational and Psychological Measurement, 2024

A Monte Carlo simulation study was conducted to compare fit indices used for detecting the correct latent class in three dichotomous mixture item response theory (IRT) models. Ten indices were considered: Akaike's information criterion (AIC), the corrected AIC (AICc), Bayesian information criterion (BIC), consistent AIC (CAIC), Draper's…

Descriptors: Goodness of Fit, Item Response Theory, Sample Size, Classification

Answer Changing Behaviors and Performance in a First-Year Medical Gross and Developmental Anatomy Course

Peer reviewed
PDF on ERIC

Download full text

Marli Crabtree; Kenneth L. Thompson; Ellen M. Robertson – HAPS Educator, 2024

Research has suggested that changing one's answer on multiple-choice examinations is more likely to lead to positive academic outcomes. This study aimed to further understand the relationship between changing answer selections and item attributes, student performance, and time within a population of 158 first-year medical students enrolled in a…

Descriptors: Anatomy, Science Tests, Medical Students, Medical Education

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11

Educational and Psychological…	14
ETS Research Report Series	6
International Journal of…	5
International Journal of…	5
Journal of Educational…	4
Journal of Educational and…	4
International Journal of…	3
International Journal of…	3
Journal of Experimental…	3
ProQuest LLC	3
Applied Measurement in…	2
Applied Psychological…	2
CBE - Life Sciences Education	2
Educational Assessment	2
Eurasian Journal of…	2
Grantee Submission	2
International Journal of…	2
International Journal of…	2
Journal of Education and…	2
Language Assessment Quarterly	2
Language Testing	2
Language Testing in Asia	2
Physical Review Physics…	2
American Institutes for…	1
American Journal of…	1
More ▼

Aryadoust, Vahid	3
Reckase, Mark D.	3
Vegelius, Jan	3
Allan S. Cohen	2
Benjamin W. Domingue	2
Circi, Ruhan	2
Gierl, Mark J.	2
Hassan, Aalaa Yaseen	2
Joshua B. Gilbert	2
Kelecioglu, Hülya	2
Kobrin, Jennifer L.	2
Leighton, Jacqueline P.	2
Luke W. Miratrix	2
McKinley, Robert L.	2
McLean, Stuart	2
Metsämuuronen, Jari	2
Mohammed, Aisha	2
Mridul Joshi	2
Acar, Tülin	1
Afif, Al Khateeb Nashaat…	1
Ahmet Yildirim	1
Ahn, Soyeon	1
Ahonen, Timo	1
Akhtar, Hanif	1
More ▼

SAT (College Admission Test)	4
Program for International…	3
Graduate Record Examinations	2
National Assessment of…	2
Test of English as a Foreign…	2
Test of English for…	2
Beck Depression Inventory	1
California Achievement Tests	1
Communication and Symbolic…	1
Comprehensive Tests of Basic…	1
Digit Span Test	1
International English…	1
Iowa Tests of Basic Skills	1
Metropolitan Achievement Tests	1
Motivated Strategies for…	1
NEO Personality Inventory	1
Nelson Denny Reading Tests	1
Peabody Picture Vocabulary…	1
Program for the International…	1
Rosenberg Self Esteem Scale	1
SRA Achievement Series	1
Stanford Achievement Tests	1
Wechsler Adult Intelligence…	1
Wechsler Intelligence Scale…	1
Woodcock Johnson Tests of…	1
More ▼