ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	17
Since 2017 (last 10 years)	40
Since 2007 (last 20 years)	150

Descriptor

Item Analysis	300
Test Items	94
Test Construction	75
Evaluation Methods	54
Psychometrics	49
Foreign Countries	46
Item Response Theory	41
Test Reliability	40
Evaluation Research	38
Test Validity	36
Student Evaluation	30
Models	27
Computer Assisted Testing	26
Higher Education	26
Scoring	25
Computer Software	24
Elementary Secondary Education	24
Difficulty Level	22
Educational Assessment	22
Multiple Choice Tests	21
Research Methodology	21
Statistical Analysis	21
Measurement Techniques	20
Program Descriptions	20
Test Results	20
More ▼

Publication Type

Reports - Descriptive	300
Journal Articles	218
Speeches/Meeting Papers	25
Numerical/Quantitative Data	12
Tests/Questionnaires	11
Guides - Non-Classroom	10
Opinion Papers	9
Reports - Research	7
Information Analyses	3
Collected Works - General	2
Computer Programs	2
Book/Product Reviews	1
Guides - Classroom - Teacher	1
Historical Materials	1
Legal/Legislative/Regulatory…	1
Reports - Evaluative	1
More ▼

Education Level

Higher Education	43
Elementary Secondary Education	26
Elementary Education	20
Adult Education	14
Secondary Education	12
Postsecondary Education	11
Grade 4	10
High Schools	10
Middle Schools	10
Early Childhood Education	9
Grade 8	8
Junior High Schools	8
Grade 5	7
Grade 6	7
Grade 7	6
Intermediate Grades	6
Grade 3	4
Primary Education	4
Kindergarten	3
Grade 12	2
Grade 2	2
Grade 9	2
Preschool Education	2
Grade 1	1
Two Year Colleges	1
More ▼

Audience

Researchers	13
Teachers	11
Practitioners	9
Administrators	3
Policymakers	2
Counselors	1

Location

Australia	10
United Kingdom	6
Canada	4
Czech Republic	3
Hong Kong	3
Netherlands	3
Pennsylvania	3
United States	3
California	2
China	2
Finland	2
Florida	2
Japan	2
Malaysia	2
Massachusetts	2
Nebraska	2
New York	2
Puerto Rico	2
Spain	2
Tennessee	2
Belgium	1
Colorado (Denver)	1
Denmark	1
District of Columbia	1
France	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	2
Education Amendments 1974	1
National Defense Education Act	1

Assessments and Surveys

Pennsylvania Educational…	4
Iowa Tests of Basic Skills	3
National Assessment of…	3
Stanford Achievement Tests	3
Trends in International…	3
SAT (College Admission Test)	2
College Board Achievement…	1
College Level Academic Skills…	1
Eysenck Personality Inventory	1
General Educational…	1
Graduate Record Examinations	1
Hopkins Symptom Checklist	1
New Jersey High School…	1
Parenting Stress Index	1
Pre Professional Skills Tests	1
Program for International…	1
Stanford Early School…	1
Teaching and Learning…	1
More ▼

What Works Clearinghouse Rating

Reports - Descriptive X

Showing 61 to 75 of 300 results Save | Export

Test Reviewing at the Buros Center for Testing

Peer reviewed

Direct link

Carlson, Janet F.; Geisinger, Kurt F. – International Journal of Testing, 2012

The test review process used by the Buros Center for Testing is described as a series of 11 steps: (1) identifying tests to be reviewed, (2) obtaining tests and preparing test descriptions, (3) determining whether tests meet review criteria, (4) identifying appropriate reviewers, (5) selecting reviewers, (6) sending instructions and materials to…

Descriptors: Testing, Test Reviews, Evaluation Methods, Evaluation Criteria

Assessing the "Rothstein Falsification Test": Does It Really Show Teacher Value-Added Models Are Biased? CEDR Working Paper No. 2012 1.3

Direct link

Goldhaber, Dan; Chaplin, Duncan – Center for Education Data & Research, 2012

In a provocative and influential paper, Jesse Rothstein (2010) finds that standard value added models (VAMs) suggest implausible future teacher effects on past student achievement, a finding that obviously cannot be viewed as causal. This is the basis of a falsification test (the Rothstein falsification test) that appears to indicate bias in VAM…

Descriptors: School Effectiveness, Teacher Effectiveness, Achievement Gains, Statistical Bias

Open-Ended Test Items Pose Challenges

Direct link

Sawchuk, Stephen – Education Week, 2010

Most experts in the testing community have presumed that the $350 million promised by the U.S. Department of Education to support common assessments would promote those that made greater use of open-ended items capable of measuring higher-order critical-thinking skills. But as measurement experts consider the multitude of possibilities for an…

Descriptors: Test Items, Federal Legislation, Scoring, Accountability

When Majority Doesn't Rule: The Use of Discrimination Indices to Improve the Quality of MCQs

Peer reviewed

Direct link

Chiavaroli, Neville; Familari, Mary – Bioscience Education, 2011

This paper outlines the use of item analysis to assist examiners in evaluating the quality and validity of their MCQ exam questions. The generation of item analysis, particularly discrimination index, has long been established practice in professional testing and credentialing organisations and some disciplines in tertiary education, but its use…

Descriptors: Self Actualization, Time Management, Audiences, Museums

Development of PE Metrics Elementary Assessments for National Physical Education Standard 1

Peer reviewed

Direct link

Dyson, Ben; Placek, Judith H.; Graber, Kim C.; Fisette, Jennifer L.; Rink, Judy; Zhu, Weimo; Avery, Marybell; Franck, Marian; Fox, Connie; Raynes, De; Park, Youngsik – Measurement in Physical Education and Exercise Science, 2011

This article describes how assessments in PE Metrics were developed following six steps: (a) determining test blueprint, (b) writing assessment tasks and scoring rubrics, (c) establishing content validity, (d) piloting assessments, (e) conducting item analysis, and (f) modifying the assessments based on analysis and expert opinion. A task force,…

Descriptors: Expertise, Evidence, Physical Education, Elementary Education

Test Reviewing in Spain

Peer reviewed

Direct link

Muniz, Jose; Fernandez-Hermida, Jose R.; Fonseca-Pedrero, Eduardo; Campillo-Alvarez, Angela; Pena-Suarez, Elsa – International Journal of Testing, 2012

The proper use of psychological tests requires that the measurement instruments have adequate psychometric properties, such as reliability and validity, and that the professionals who use the instruments have the necessary expertise. In this article, we present the first review of tests published in Spain, carried out with an assessment model…

Descriptors: Student Evaluation, Measurement, Foreign Countries, Psychometrics

Action Research and Response to Intervention: Bridging the Discourse Divide

Peer reviewed

Direct link

Little, Mary E. – Educational Forum, 2012

The purpose of this article is to define and clarify the process of instructional problem-solving using assessment data within action research (AR) and Response to Intervention (RtI). Similarities between AR and RtI are defined and compared. Lastly, specific resources and examples of the instructional problem-solving process of AR within…

Descriptors: Intervention, Action Research, Problem Solving, Data Analysis

The International Certification of Addiction Medicine: Validating Clinical Knowledge across Borders

Peer reviewed

Direct link

el-Guebaly, Nady; Violato, Claudio – Substance Abuse, 2011

The experience of the International Society of Addiction Medicine in setting up the first international certification of clinical knowledge is reported. The steps followed and the results of a psychometric analysis of the tests from the first 65 candidates are reported. Lessons learned in the first 5 years and challenges for the future are…

Descriptors: Psychometrics, Certification, Substance Abuse, Medicine

Computerized Classification Testing under the One-Parameter Logistic Response Model with Ability-Based Guessing

Peer reviewed

Direct link

Wang, Wen-Chung; Huang, Sheng-Yun – Educational and Psychological Measurement, 2011

The one-parameter logistic model with ability-based guessing (1PL-AG) has been recently developed to account for effect of ability on guessing behavior in multiple-choice items. In this study, the authors developed algorithms for computerized classification testing under the 1PL-AG and conducted a series of simulations to evaluate their…

Descriptors: Computer Assisted Testing, Classification, Item Analysis, Probability

TIMSS 2011 User Guide for the International Database. Supplement 2: National Adaptations of International Background Questionnaires

Download full text

Foy, Pierre, Ed.; Arora, Alka, Ed.; Stanco, Gabrielle M., Ed. – International Association for the Evaluation of Educational Achievement, 2013

This supplement describes national adaptations made to the international version of the TIMSS 2011 background questionnaires. This information provides users with a guide to evaluate the availability of internationally comparable data for use in secondary analyses involving the TIMSS 2011 background variables. Background questionnaire adaptations…

Descriptors: Questionnaires, Technology Transfer, Adoption (Ideas), Media Adaptation

What Is F2 Good for?

Peer reviewed

Direct link

Forster, Kenneth I. – Journal of Memory and Language, 2008

It is commonly assumed that a significant item analysis (F2) provides an assurance that the treatment effect is generalizable to the population of items from which the items were drawn, which in turn implies that the effect is reasonably general across items. The latter implication is shown to be false, and it is argued that a new test of…

Descriptors: Item Analysis, Evaluation Methods

A Note on Item-Restscore Association in Rasch Models

Peer reviewed

Direct link

Kreiner, Svend – Applied Psychological Measurement, 2011

To rule out the need for a two-parameter item response theory (IRT) model during item analysis by Rasch models, it is important to check the Rasch model's assumption that all items have the same item discrimination. Biserial and polyserial correlation coefficients measuring the association between items and restscores are often used in an informal…

Descriptors: Item Analysis, Correlation, Item Response Theory, Models

School Leadership Preparation and Practice Survey Instruments and Their Uses

Peer reviewed

Direct link

Pounder, Diana – Journal of Research on Leadership Education, 2012

This article addresses the leadership preparation line of inquiry developed in the past decade by the University Council for Educational Administration/Learning and Teaching in Educational Leadership Special Interest Group Taskforce on Evaluating Leadership Preparation Programs, and it particularly addresses the series of survey instruments…

Descriptors: Administrator Education, Educational Administration, Instructional Leadership, Program Evaluation

Construct Definition Using Cognitively Based Evidence: A Framework for Practice

Peer reviewed

Direct link

Ketterlin-Geller, Leanne R.; Yovanoff, Paul; Jung, EunJu; Liu, Kimy; Geller, Josh – Educational Assessment, 2013

In this article, we highlight the need for a precisely defined construct in score-based validation and discuss the contribution of cognitive theories to accurately and comprehensively defining the construct. We propose a framework for integrating cognitively based theoretical and empirical evidence to specify and evaluate the construct. We apply…

Descriptors: Test Validity, Construct Validity, Scores, Evidence

Clearing the AIR about the Use of Self-Reported Gains in Institutional Research

Peer reviewed

Direct link

Gonyea, Robert M.; Miller, Angie – New Directions for Institutional Research, 2011

Correlations between self-reported learning gains and direct, longitudinal measures that ostensibly correspond in content area are generally inadequate. This chapter clarifies that self-reported measures of learning are more properly used and interpreted as evidence of students' perceived learning and affective outcomes. In this context, the…

Descriptors: Evidence, College Students, Institutional Research, Social Desirability

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 20

Educational and Psychological…	8
Educational Measurement:…	7
Online Submission	7
Applied Psychological…	6
Psychometrika	6
International Journal of…	5
Journal of Educational…	4
Measurement:…	4
National Assessment Governing…	4
Practical Assessment,…	4
Assessment & Evaluation in…	3
Journal of Chemical Education	3
Journal of Educational and…	3
Journal of Psychoeducational…	3
Language Testing	3
Multivariate Behavioral…	3
Psychological Review	3
Structural Equation Modeling:…	3
Achieve, Inc.	2
Applied Measurement in…	2
Assessment Update	2
Behavioral Research and…	2
Computer Assisted Language…	2
Counseling Psychologist	2
Education Canada	2
More ▼

Raykov, Tenko	4
Hertzog, James F., Comp.	3
Marcoulides, George A.	3
Porch, Ann	3
Seiverling, Richard F., Comp.	3
Ahmed, S.	2
Baxter, G. P.	2
Ferrando, Pere J.	2
Fisher, Douglas	2
Graber, Kim C.	2
Gramenz, Gary W.	2
Hambleton, Ronald K.	2
Harnisch, Delwyn L.	2
Hertwig, Ralph	2
Ketterlin-Geller, Leanne R.	2
Liu, Kimy	2
Salmani-Nodoushan, Mohammad…	2
Salvucci, S.	2
Sikali, E.	2
Sloan, M.	2
Thissen, David	2
Thompson, Bruce	2
Wainer, Howard	2
Waits, T.	2
More ▼