ERIC - Search Results

Publication Date

In 2025	37
Since 2024	160
Since 2021 (last 5 years)	583
Since 2016 (last 10 years)	1218
Since 2006 (last 20 years)	2724

Descriptor

Item Analysis	5124
Test Items	1478
Foreign Countries	1210
Test Construction	1166
Test Validity	945
Factor Analysis	918
Test Reliability	886
Psychometrics	759
Correlation	697
Statistical Analysis	645
Comparative Analysis	595
Measures (Individuals)	561
Item Response Theory	518
Scores	488
Difficulty Level	483
Questionnaires	478
Evaluation Methods	468
Student Attitudes	427
Achievement Tests	411
Higher Education	401
Multiple Choice Tests	385
Measurement Techniques	372
Validity	342
Models	338
Reliability	334
More ▼

Education Level

Higher Education	841
Postsecondary Education	566
Secondary Education	411
Elementary Education	305
Elementary Secondary Education	205
High Schools	187
Middle Schools	171
Junior High Schools	116
Adult Education	103
Early Childhood Education	98
Grade 8	65
Grade 4	62
Intermediate Grades	59
Grade 5	58
Grade 6	47
Primary Education	45
Grade 7	43
Preschool Education	43
Kindergarten	37
Grade 3	32
Grade 9	28
Grade 10	23
Grade 12	22
Grade 1	18
Grade 11	16
More ▼

Audience

Researchers	169
Practitioners	49
Teachers	32
Administrators	8
Policymakers	8
Counselors	4
Students	4
Media Staff	1

Location

Turkey	172
Australia	81
Canada	79
China	68
United States	55
Germany	43
Taiwan	43
Japan	40
United Kingdom	38
Iran	36
Spain	33
California	30
Indonesia	30
Netherlands	28
South Korea	28
United Kingdom (England)	28
Hong Kong	26
India	25
Florida	24
New York	24
Malaysia	23
Singapore	20
Israel	18
Nigeria	18
Pennsylvania	17
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	18
Individuals with Disabilities…	15
Elementary and Secondary…	5
Elementary and Secondary…	2
Every Student Succeeds Act…	2
Deferred Action for Childhood…	1
Education Amendments 1974	1
Education Consolidation…	1
Emergency School Aid Act 1972	1
Individuals with Disabilities…	1
National Defense Education Act	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1
Does not meet standards	1

Showing 91 to 105 of 5,124 results Save | Export

Assessment of Large Language Models' Performances and Hallucinations for Chinese Postgraduate Medical Entrance Examination

Peer reviewed

Direct link

Hongfei Ye; Jian Xu; Danqing Huang; Meng Xie; Jinming Guo; Junrui Yang; Haiwei Bao; Mingzhi Zhang; Ce Zheng – Discover Education, 2025

This study evaluates Large language models (LLMs)' performance on Chinese Postgraduate Medical Entrance Examination (CPGMEE) as well as the hallucinations produced by LLMs and investigate their implications for medical education. We curated 10 trials of mock CPGMEE to evaluate the performances of 4 LLMs (GPT-4.0, ChatGPT, QWen 2.1 and Ernie 4.0).…

Descriptors: College Entrance Examinations, Foreign Countries, Computational Linguistics, Graduate Medical Education

An Approach to Test Equating under the Latent "D"-Scoring Method

Peer reviewed

Direct link

Dimitrov, Dimiter M.; Atanasov, Dimitar V. – Measurement: Interdisciplinary Research and Perspectives, 2021

This study offers an approach to test equating under the latent D-scoring method (DSM-L) using the nonequivalent groups with anchor tests (NEAT) design. The accuracy of the test equating was examined via a simulation study under a 3 × 3 design by two conditions: group ability at three levels and test difficulty at three levels. The results for…

Descriptors: Equated Scores, Scoring, Test Items, Accuracy

A Robust Method for Detecting Item Misfit in Large-Scale Assessments

Peer reviewed

Direct link

von Davier, Matthias; Bezirhan, Ummugul – Educational and Psychological Measurement, 2023

Viable methods for the identification of item misfit or Differential Item Functioning (DIF) are central to scale construction and sound measurement. Many approaches rely on the derivation of a limiting distribution under the assumption that a certain model fits the data perfectly. Typical DIF assumptions such as the monotonicity and population…

Descriptors: Robustness (Statistics), Test Items, Item Analysis, Goodness of Fit

Using Item Scores and Distractors to Detect Item Compromise and Preknowledge

Peer reviewed

Direct link

Gorney, Kylie; Wollack, James A.; Sinharay, Sandip; Eckerly, Carol – Journal of Educational and Behavioral Statistics, 2023

Any time examinees have had access to items and/or answers prior to taking a test, the fairness of the test and validity of test score interpretations are threatened. Therefore, there is a high demand for procedures to detect both compromised items (CI) and examinees with preknowledge (EWP). In this article, we develop a procedure that uses item…

Descriptors: Scores, Test Validity, Test Items, Prior Learning

Examining the Psychometric Integrity of the Social Skills Improvement System Teacher Rating Scale Scores for a Sample of Preschool-Age Children

Peer reviewed

Direct link

Huang, Ke; Conroy, Maureen A.; Snyder, Patricia A.; Miller, David; Sutherland, Kevin S. – Assessment for Effective Intervention, 2023

The Social Skills Improvement System-Teacher Rating Scale (SSIS-TRS) has been widely used to measure the social skills and behaviors of children and adolescents that are challenging. Studies examining the psychometric properties of the SSIS-TRS have been conducted, but the dimensional structure and item properties of the SSIS-TRS have not been…

Descriptors: Psychometrics, Integrity, Interpersonal Competence, Rating Scales

A Comparison of Aggregation Rules for Selecting Anchor Items in Multigroup DIF Analysis

Peer reviewed

Direct link

Huelmann, Thorben; Debelak, Rudolf; Strobl, Carolin – Journal of Educational Measurement, 2020

This study addresses the topic of how anchoring methods for differential item functioning (DIF) analysis can be used in multigroup scenarios. The direct approach would be to combine anchoring methods developed for two-group scenarios with multigroup DIF-detection methods. Alternatively, multiple tests could be carried out. The results of these…

Descriptors: Test Items, Test Bias, Equated Scores, Item Analysis

Environmental Values and Education in Spanish Universities: A Questionnaire Validation

Peer reviewed

Direct link

Clara Margaça; José Carlos Sánchez-García; Brizeida Hernández Sánchez; Susana Lucas Mangas – International Journal of Sustainability in Higher Education, 2024

Purpose: To protect the environment and society, research on responsible behavior and personal values has increased. Values have been identified as important for understanding and predicting environmental preservation behaviors. The purpose of this study is to analyze the validity and reliability of the Environmental Portrait Value Questionnaire…

Descriptors: Universities, Conservation (Environment), Altruism, Self Concept

Adaptation and Development of Parent Rating Scale for Giftedness

Peer reviewed

Direct link

Seyda Aydin-Karaca; Mustafa Serdar Köksal; Bilkay Bi – Journal of Psychoeducational Assessment, 2024

This study aimed to develop a parent rating scale (PRSG) for screening children for further identification process in terms of giftedness. The participants of the study were 255 parents of gifted and non-gifted students. The PRSG, consisting of 30 items, was created by consulting parents and reviewing instruments existent in the literature. As…

Descriptors: Rating Scales, Parent Attitudes, Scores, Comparative Analysis

Development of Computer-Based Chemical Five-Tier Diagnostic Test Instruments: A Generalized Partial Credit Model

Peer reviewed
PDF on ERIC

Download full text

Achmad Rante Suparman; Eli Rohaeti; Sri Wening – Journal on Efficiency and Responsibility in Education and Science, 2024

This study focuses on developing a five-tier chemical diagnostic test based on a computer-based test with 11 assessment categories with an assessment score from 0 to 10. A total of 20 items produced were validated by education experts, material experts, measurement experts, and media experts, and an average index of the Aiken test > 0.70 was…

Descriptors: Chemistry, Diagnostic Tests, Computer Assisted Testing, Credits

An Improved Inferential Procedure to Evaluate Item Discriminations in a Conditional Maximum Likelihood Framework

Peer reviewed

Direct link

Clemens Draxler; Andreas Kurz; Can Gürer; Jan Philipp Nolte – Journal of Educational and Behavioral Statistics, 2024

A modified and improved inductive inferential approach to evaluate item discriminations in a conditional maximum likelihood and Rasch modeling framework is suggested. The new approach involves the derivation of four hypothesis tests. It implies a linear restriction of the assumed set of probability distributions in the classical approach that…

Descriptors: Inferences, Test Items, Item Analysis, Maximum Likelihood Statistics

IRT Characteristic Curve Linking Methods Weighted by Information for Mixed-Format Tests

Peer reviewed

Direct link

Shaojie Wang; Won-Chan Lee; Minqiang Zhang; Lixin Yuan – Applied Measurement in Education, 2024

To reduce the impact of parameter estimation errors on IRT linking results, recent work introduced two information-weighted characteristic curve methods for dichotomous items. These two methods showed outstanding performance in both simulation and pseudo-form pseudo-group analysis. The current study expands upon the concept of information…

Descriptors: Item Response Theory, Test Format, Test Length, Error of Measurement

Does Acquiescence Disagree with Measurement Invariance Testing?

Peer reviewed

Direct link

E. Damiano D'Urso; Jesper Tijmstra; Jeroen K. Vermunt; Kim De Roover – Structural Equation Modeling: A Multidisciplinary Journal, 2024

Measurement invariance (MI) is required for validly comparing latent constructs measured by multiple ordinal self-report items. Non-invariances may occur when disregarding (group differences in) an acquiescence response style (ARS; an agreeing tendency regardless of item content). If non-invariance results solely from neglecting ARS, one should…

Descriptors: Error of Measurement, Structural Equation Models, Construct Validity, Measurement Techniques

Community-Guided, Autism-Adapted Group Cognitive Behavioral Therapy for Depression in Autistic Youth (CBT-DAY): Preliminary Feasibility, Acceptability, and Efficacy

Peer reviewed

Direct link

Jessica M. Schwartzman; Marissa C. Roth; Ann V. Paterson; Alexandra X. Jacobs; Zachary J. Williams – Autism: The International Journal of Research and Practice, 2024

This study examined the preliminary feasibility, acceptability, and efficacy of an autism-adapted cognitive behavioral therapy for depression in autistic youth, CBT-DAY. Twenty-four autistic youth (11-17 years old) participated in the pilot non-randomized trial including 5 cisgender females, 14 cisgender males, and 5 non-binary youth. Youth…

Descriptors: Autism Spectrum Disorders, Youth, Depression (Psychology), Cognitive Restructuring

Are You…? Asking Questions on Sex with a Third Category in Germany

Peer reviewed

Direct link

Hadler, Patricia; Neuert, Cornelia E.; Ortmanns, Verena; Stiegler, Angelika – Field Methods, 2022

A question asking for respondents' sex is one of the standard sociodemographic characteristics collected in a survey. Until now, it typically consisted of a simple question (e.g., "Are you…?") with two answer categories ("male" and "female"). In 2019, Germany implemented the additional sex designation divers for…

Descriptors: Foreign Countries, Gender Differences, Sex, Surveys

Comparison of Methods for Identifying Differential Step Functioning with Polytomous Item Response Data

Peer reviewed

Direct link

Finch, Holmes – Applied Measurement in Education, 2022

Much research has been devoted to identification of differential item functioning (DIF), which occurs when the item responses for individuals from two groups differ after they are conditioned on the latent trait being measured by the scale. There has been less work examining differential step functioning (DSF), which is present for polytomous…

Descriptors: Comparative Analysis, Item Response Theory, Item Analysis, Simulation

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 342

Educational and Psychological…	274
Journal of Educational…	132
ProQuest LLC	125
Online Submission	72
Psychometrika	70
Applied Psychological…	60
Journal of Psychoeducational…	59
Language Testing	45
Applied Measurement in…	40
Educ Psychol Meas	34
ETS Research Report Series	33
Journal of Experimental…	33
Educational Measurement:…	31
Journal of Experimental…	31
Measurement and Evaluation in…	31
Journal of Consulting and…	29
Educational Sciences: Theory…	28
Journal of Educational and…	28
Grantee Submission	27
Psychological Assessment	27
International Journal of…	26
International Journal of…	24
Physical Review Physics…	24
Research on Social Work…	24
Multivariate Behavioral…	23
More ▼

Hambleton, Ronald K.	21
Reckase, Mark D.	20
Tindal, Gerald	15
Weiss, David J.	14
Lord, Frederic M.	13
Bart, William M.	12
Plake, Barbara S.	12
van der Linden, Wim J.	12
Alonzo, Julie	11
Dorans, Neil J.	11
Samejima, Fumiko	10
Dawis, Rene V.	9
Harnisch, Delwyn L.	9
McKinley, Robert L.	9
Rudner, Lawrence M.	9
Wainer, Howard	9
Wright, Benjamin D.	9
Angoff, William H.	8
Baker, Eva L.	8
Gierl, Mark J.	8
Matson, Johnny L.	8
Raykov, Tenko	8
Sireci, Stephen G.	8
More ▼

Reports - Research	3125
Journal Articles	3104
Reports - Evaluative	626
Speeches/Meeting Papers	429
Reports - Descriptive	298
Tests/Questionnaires	266
Dissertations/Theses -…	127
Information Analyses	85
Numerical/Quantitative Data	85
Guides - Non-Classroom	66
Opinion Papers	54
Books	16
Guides - Classroom - Teacher	15
Guides - General	13
Collected Works - General	11
Reports - General	7
Reference Materials -…	6
Non-Print Media	5
Collected Works - Serials	4
Computer Programs	4
Collected Works - Proceedings	3
Dissertations/Theses	3
Guides - Classroom - Learner	3
Historical Materials	3
Legal/Legislative/Regulatory…	3
More ▼

SAT (College Admission Test)	53
Program for International…	48
National Assessment of…	40
Trends in International…	31
Test of English as a Foreign…	30
Minnesota Multiphasic…	23
California Achievement Tests	22
Graduate Record Examinations	22
Stanford Achievement Tests	22
Iowa Tests of Basic Skills	21
Wechsler Intelligence Scale…	19
ACT Assessment	18
Peabody Picture Vocabulary…	18
Metropolitan Achievement Tests	15
Raven Progressive Matrices	14
Wechsler Adult Intelligence…	14
Armed Services Vocational…	13
Stanford Binet Intelligence…	13
Beck Depression Inventory	10
National Longitudinal Study…	9
Autism Diagnostic Observation…	8
Comprehensive Tests of Basic…	8
Eysenck Personality Inventory	7
International English…	7
Pennsylvania Educational…	7
More ▼