ERIC - Search Results

Publication Date

In 2026	0
Since 2025	9
Since 2022 (last 5 years)	112
Since 2017 (last 10 years)	216
Since 2007 (last 20 years)	377

Descriptor

Comparative Analysis	598
Item Analysis	598
Test Items	230
Foreign Countries	182
Scores	103
Item Response Theory	98
Statistical Analysis	97
Correlation	93
Test Construction	86
Factor Analysis	83
Difficulty Level	80
Models	63
Student Attitudes	62
Test Reliability	60
Test Validity	57
Measures (Individuals)	56
Achievement Tests	55
English (Second Language)	55
Second Language Learning	54
Evaluation Methods	52
Computer Assisted Testing	51
Questionnaires	51
Multiple Choice Tests	50
Psychometrics	50
Language Tests	49
More ▼

Education Level

Higher Education	107
Postsecondary Education	90
Secondary Education	73
Elementary Education	46
Elementary Secondary Education	27
High Schools	23
Middle Schools	22
Junior High Schools	17
Grade 4	12
Early Childhood Education	10
Grade 8	10
Intermediate Grades	10
Grade 6	9
Primary Education	8
Grade 5	7
Grade 7	6
Grade 3	5
Adult Education	4
Grade 12	4
Grade 9	4
Kindergarten	4
Preschool Education	4
Grade 1	3
Grade 2	3
Grade 10	2
More ▼

Audience

Researchers	15
Practitioners	4
Teachers	4
Students	2
Policymakers	1

Location

Australia	13
China	13
Germany	13
Turkey	13
Canada	8
United Kingdom	8
United Kingdom (England)	8
United States	8
Indonesia	7
Iran	7
Japan	7
Saudi Arabia	6
South Korea	6
Israel	5
Vietnam	5
Netherlands	4
Spain	4
Thailand	4
Belgium	3
Chile	3
Czech Republic	3
Europe	3
Finland	3
Hong Kong	3
India	3
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	3
Individuals with Disabilities…	1

What Works Clearinghouse Rating

Comparative Analysis X

Showing 16 to 30 of 598 results Save | Export

Adaptation and Development of Parent Rating Scale for Giftedness

Peer reviewed

Direct link

Seyda Aydin-Karaca; Mustafa Serdar Köksal; Bilkay Bi – Journal of Psychoeducational Assessment, 2024

This study aimed to develop a parent rating scale (PRSG) for screening children for further identification process in terms of giftedness. The participants of the study were 255 parents of gifted and non-gifted students. The PRSG, consisting of 30 items, was created by consulting parents and reviewing instruments existent in the literature. As…

Descriptors: Rating Scales, Parent Attitudes, Scores, Comparative Analysis

IRT Characteristic Curve Linking Methods Weighted by Information for Mixed-Format Tests

Peer reviewed

Direct link

Shaojie Wang; Won-Chan Lee; Minqiang Zhang; Lixin Yuan – Applied Measurement in Education, 2024

To reduce the impact of parameter estimation errors on IRT linking results, recent work introduced two information-weighted characteristic curve methods for dichotomous items. These two methods showed outstanding performance in both simulation and pseudo-form pseudo-group analysis. The current study expands upon the concept of information…

Descriptors: Item Response Theory, Test Format, Test Length, Error of Measurement

Comparison of Methods for Identifying Differential Step Functioning with Polytomous Item Response Data

Peer reviewed

Direct link

Finch, Holmes – Applied Measurement in Education, 2022

Much research has been devoted to identification of differential item functioning (DIF), which occurs when the item responses for individuals from two groups differ after they are conditioned on the latent trait being measured by the scale. There has been less work examining differential step functioning (DSF), which is present for polytomous…

Descriptors: Comparative Analysis, Item Response Theory, Item Analysis, Simulation

Analysing Costa Rican and Spanish Students' Comparisons of Probabilities and Ratios

Peer reviewed

Direct link

Carmen Batanero; Luis A. Hernandez-Solis; Maria M. Gea – Statistics Education Research Journal, 2023

We present an exploratory study of Costa Rican and Spanish students' (11-16-year-olds) competence to compare probabilities in urns and compare ratios in mixture problems. A sample of 704 students in Grades 6 through to Grade 10, 292 from Costa Rica and 412 from Spain, were given one of two forms of a questionnaire with three probability comparison…

Descriptors: Statistics Education, Comparative Analysis, Foreign Countries, Probability

Effects of Using Double Ratings as Item Scores on IRT Proficiency Estimation

Peer reviewed

Direct link

Song, Yoon Ah; Lee, Won-Chan – Applied Measurement in Education, 2022

This article presents the performance of item response theory (IRT) models when double ratings are used as item scores over single ratings when rater effects are present. Study 1 examined the influence of the number of ratings on the accuracy of proficiency estimation in the generalized partial credit model (GPCM). Study 2 compared the accuracy of…

Descriptors: Item Response Theory, Item Analysis, Scores, Accuracy

Comparing the Score Interpretation across Modes in PISA: An Investigation of How Item Facets Affect Difficulty

Peer reviewed

Direct link

Harrison, Scott; Kroehne, Ulf; Goldhammer, Frank; Lüdtke, Oliver; Robitzsch, Alexander – Large-scale Assessments in Education, 2023

Background: Mode effects, the variations in item and scale properties attributed to the mode of test administration (paper vs. computer), have stimulated research around test equivalence and trend estimation in PISA. The PISA assessment framework provides the backbone to the interpretation of the results of the PISA test scores. However, an…

Descriptors: Scoring, Test Items, Difficulty Level, Foreign Countries

Metacognitive Differentiation of Item Memory and Source Memory in Schema-Based Source Monitoring

Peer reviewed

Direct link

Schaper, Marie Luisa; Kuhlmann, Beatrice G.; Bayen, Ute J. – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2023

Item memory and source memory are different aspects of episodic remembering. To investigate metamemory differences between them, the authors assessed systematic differences between predictions of item memory via Judgments of Learning (JOLs) and source memory via Judgments of Source (JOSs). Schema-based expectations affect JOLs and JOSs…

Descriptors: Memory, Metacognition, Schemata (Cognition), Prediction

Effects of Partner Presence during the Interview on Survey Responses: The Example of Questions Concerning the Division of Household Labor

Peer reviewed

Direct link

Schröder, Jette; Schmiedeberg, Claudia – Sociological Methods & Research, 2023

Despite the fact that third parties are present during a substantial amount of face-to-face interviews, bystander influence on respondents' response behavior is not yet fully understood. We use nine waves of the German Family Panel "pairfam" and apply fixed effects panel regression models to analyze effects of third-party presence on…

Descriptors: Housework, Item Analysis, Interpersonal Relationship, Responses

An Exploration of Comparability Issues in Educational Research: Scale Linking, Equating, and Propensity Score Weighting

Direct link

Wu, Tong – ProQuest LLC, 2023

This three-article dissertation aims to address three methodological challenges to ensure comparability in educational research, including scale linking, test equating, and propensity score (PS) weighting. The first study intends to improve test scale comparability by evaluating the effect of six missing data handling approaches, including…

Descriptors: Educational Research, Comparative Analysis, Equated Scores, Weighted Scores

A Comparison of Latent Semantic Analysis and Latent Dirichlet Allocation in Educational Measurement

Peer reviewed

Direct link

Jordan M. Wheeler; Allan S. Cohen; Shiyu Wang – Journal of Educational and Behavioral Statistics, 2024

Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming…

Descriptors: Semantics, Educational Assessment, Evaluators, Reliability

Examining the Effect of Item Difficulty and Rater Leniency on Iranian Test Takers' Performance on WDCT and DSAT: A Comparative Study

Peer reviewed
PDF on ERIC

Download full text

Reza Shahi; Hamdollah Ravand; Golam Reza Rohani – International Journal of Language Testing, 2025

The current paper intends to exploit the Many Facet Rasch Model to investigate and compare the impact of situations (items) and raters on test takers' performance on the Written Discourse Completion Test (WDCT) and Discourse Self-Assessment Tests (DSAT). In this study, the participants were 110 English as a Foreign Language (EFL) students at…

Descriptors: Comparative Analysis, English (Second Language), Second Language Learning, Second Language Instruction

A Novel Approach for Calculating the Item Discrimination for Likert Type of Scales

Peer reviewed
PDF on ERIC

Download full text

Celen, Umit; Aybek, Eren Can – International Journal of Assessment Tools in Education, 2022

Item analysis is performed by developers as an integral part of the scale development process. Thus, items are excluded from the scale depending on the item analysis prior to the factor analysis. Existing item discrimination indices are calculated based on correlation, yet items with different response patterns are likely to have a similar item…

Descriptors: Likert Scales, Factor Analysis, Item Analysis, Correlation

The Effect of Ratio of Items Indicating Differential Item Functioning on Computer Adaptive and Multi-Stage Tests

Peer reviewed
PDF on ERIC

Download full text

Erdem-Kara, Basak; Dogan, Nuri – International Journal of Assessment Tools in Education, 2022

Recently, adaptive test approaches have become a viable alternative to traditional fixed-item tests. The main advantage of adaptive tests is that they reach desired measurement precision with fewer items. However, fewer items mean that each item has a more significant effect on ability estimation and therefore those tests are open to more…

Descriptors: Item Analysis, Computer Assisted Testing, Test Items, Test Construction

The Study of the Effect of Item Parameter Drift on Ability Estimation Obtained from Adaptive Testing under Different Conditions

Peer reviewed
PDF on ERIC

Download full text

Sahin Kursad, Merve; Cokluk Bokeoglu, Omay; Cikrikci, Rahime Nukhet – International Journal of Assessment Tools in Education, 2022

Item parameter drift (IPD) is the systematic differentiation of parameter values of items over time due to various reasons. If it occurs in computer adaptive tests (CAT), it causes errors in the estimation of item and ability parameters. Identification of the underlying conditions of this situation in CAT is important for estimating item and…

Descriptors: Item Analysis, Computer Assisted Testing, Test Items, Error of Measurement

Content and Item Response Theory Analysis of ChatGPT-4-Generated Multiple-Choice Items

Peer reviewed

Direct link

Roger Young; Emily Courtney; Alexander Kah; Mariah Wilkerson; Yi-Hsin Chen – Teaching of Psychology, 2025

Background: Multiple-choice item (MCI) assessments are burdensome for instructors to develop. Artificial intelligence (AI, e.g., ChatGPT) can streamline the process without sacrificing quality. The quality of AI-generated MCIs and human experts is comparable. However, whether the quality of AI-generated MCIs is equally good across various domain-…

Descriptors: Item Response Theory, Multiple Choice Tests, Psychology, Textbooks

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 40

Educational and Psychological…	36
Journal of Educational…	21
ProQuest LLC	20
ETS Research Report Series	11
Applied Measurement in…	10
Applied Psychological…	10
Language Testing	9
Online Submission	8
Grantee Submission	7
International Journal of…	7
Journal of Educational and…	7
International Educational…	5
International Journal of…	5
International Journal of…	5
Journal of Consulting and…	5
Journal of Experimental…	5
Measurement:…	5
Physical Review Physics…	5
Journal of Experimental…	4
Journal of Psychoeducational…	4
Journal of Speech, Language,…	4
Language Assessment Quarterly	4
Practical Assessment,…	4
Psychometrika	4
Scandinavian Journal of…	4
More ▼

Hambleton, Ronald K.	5
Weiss, David J.	4
Bashaw, W. L.	3
Benson, Jeri	3
Blanton, Maria	3
Facon, Bruno	3
Gongjun Xu	3
Haladyna, Tom	3
Knuth, Eric	3
Lord, Frederic M.	3
Reckase, Mark D.	3
Stephens, Ana	3
Strachota, Susanne	3
Stroud, Rena	3
Stylianou, Despina	3
Vale, C. David	3
Allan S. Cohen	2
Angoff, William H.	2
Ayan, Cansu	2
Baghaei, Purya	2
Bennett, Randy Elliot	2
Bratfisch, Oswald	2
Chun Wang	2
Dawis, Rene V.	2
More ▼

Reports - Research	412
Journal Articles	396
Reports - Evaluative	75
Speeches/Meeting Papers	56
Tests/Questionnaires	29
Dissertations/Theses -…	20
Numerical/Quantitative Data	18
Reports - Descriptive	15
Information Analyses	11
Books	5
Collected Works - General	4
Guides - Non-Classroom	4
Opinion Papers	3
Collected Works - Serials	1
Guides - General	1
Non-Print Media	1
Reference Materials - General	1
Reports - General	1
Translations	1
More ▼

Program for International…	16
SAT (College Admission Test)	11
Trends in International…	8
National Assessment of…	4
Test of English as a Foreign…	4
California Achievement Tests	3
Raven Progressive Matrices	3
Beck Depression Inventory	2
Childrens Manifest Anxiety…	2
Eysenck Personality Inventory	2
Graduate Record Examinations	2
International English…	2
Iowa Tests of Educational…	2
Metropolitan Achievement Tests	2
Minnesota Multiphasic…	2
Peabody Picture Vocabulary…	2
Sequential Tests of…	2
Stanford Binet Intelligence…	2
ACT Assessment	1
Armed Services Vocational…	1
Autism Diagnostic Observation…	1
Bem Sex Role Inventory	1
Bender Gestalt Test	1
Boehm Test of Basic Concepts	1
California Critical Thinking…	1
More ▼