ERIC - Search Results

Publication Date

In 2026	0
Since 2025	9
Since 2022 (last 5 years)	50
Since 2017 (last 10 years)	103
Since 2007 (last 20 years)	160

Descriptor

Difficulty Level	265
Test Format	265
Test Items	199
Multiple Choice Tests	96
Foreign Countries	79
Test Construction	74
Comparative Analysis	53
Item Response Theory	51
Item Analysis	49
Computer Assisted Testing	47
Scores	47
Higher Education	44
Language Tests	41
Test Reliability	33
Mathematics Tests	32
Test Validity	29
English (Second Language)	27
Undergraduate Students	27
Reading Tests	26
Second Language Learning	25
Achievement Tests	24
College Students	24
Reading Comprehension	22
Equated Scores	21
Science Tests	21
More ▼

Publication Type

Reports - Research	199
Journal Articles	163
Speeches/Meeting Papers	55
Reports - Evaluative	41
Tests/Questionnaires	15
Information Analyses	11
Dissertations/Theses -…	9
Reports - Descriptive	8
Numerical/Quantitative Data	2
Opinion Papers	2
Collected Works - Proceedings	1
Collected Works - Serials	1
Computer Programs	1
Guides - Non-Classroom	1
Non-Print Media	1
Reference Materials - General	1
More ▼

Education Level

Higher Education	61
Postsecondary Education	54
Secondary Education	34
Elementary Education	23
Middle Schools	18
High Schools	15
Junior High Schools	14
Grade 8	12
Intermediate Grades	12
Grade 4	11
Elementary Secondary Education	9
Grade 6	6
Grade 7	6
Grade 5	5
Grade 3	4
Early Childhood Education	3
Grade 12	3
Primary Education	3
Grade 2	2
Grade 9	2
Grade 1	1
Preschool Education	1
Two Year Colleges	1
More ▼

Audience

Researchers	8
Policymakers	1
Practitioners	1
Teachers	1

Location

Germany	8
Turkey	8
Australia	5
China	4
Indonesia	4
Iran	4
United Kingdom (England)	4
Canada	3
Japan	3
Netherlands	3
Taiwan	3
Finland	2
Hungary	2
Louisiana	2
Malaysia	2
Mexico	2
Nigeria	2
North Dakota	2
Singapore	2
Spain	2
United Kingdom	2
United Kingdom (Wales)	2
United States	2
Belgium	1
California	1
More ▼

Laws, Policies, & Programs

Pell Grant Program

What Works Clearinghouse Rating

Difficulty Level X

Showing 151 to 165 of 265 results Save | Export

Modeling the Predictive Validity of SAT Mathematics Items Using Item Characteristics

Download full text

Kobrin, Jennifer L.; Kim, Rachel; Sackett, Paul – College Board, 2011

There is much debate on the merits and pitfalls of standardized tests for college admission, with questions regarding the format (multiple-choice versus constructed response), cognitive complexity, and content of these assessments (achievement versus aptitude) at the forefront of the discussion. This study addressed these questions by…

Descriptors: College Entrance Examinations, Mathematics Tests, Test Items, Predictive Validity

Comparisons among Designs for Equating Constructed-Response Tests. Research Report. ETS RR-08-53

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Walker, Michael E.; McHale, Frederick – ETS Research Report Series, 2008

This study examined variations of a nonequivalent groups equating design used with constructed-response (CR) tests to determine which design was most effective in producing equivalent scores across the two tests to be equated. Using data from a large-scale exam, the study investigated the use of anchor CR item rescoring in the context of classical…

Descriptors: Equated Scores, Comparative Analysis, Test Format, Responses

Investigation of the Factors Affecting the Pre-Test Effect in National Curriculum Science Assessment Development in England

Peer reviewed

Direct link

Pyle, Katie; Jones, Emily; Williams, Chris; Morrison, Jo – Educational Research, 2009

Background: All national curriculum tests in England are pre-tested as part of the development process. Differences in pupil performance between pre-test and live test are consistently found. This difference has been termed the pre-test effect. Understanding the pre-test effect is essential in the test development and selection processes and in…

Descriptors: Foreign Countries, Pretesting, Context Effect, National Curriculum

Which Item Types Are Better Suited to the Linking of Verbal Adapted Tests?

Peer reviewed

Direct link

Allalouf, Avi; Rapp, Joel; Stoller, Reuven – International Journal of Testing, 2009

When a test is adapted from a source language (SL) into a target language (TL), the two forms are usually not psychometrically equivalent. If linking between test forms is necessary, those items that have had their psychometric characteristics altered by the translation (differential item functioning [DIF] items) should be eliminated from the…

Descriptors: Test Items, Test Format, Verbal Tests, Psychometrics

Measuring Knowledge of Natural Selection: A Comparison of the CINS, an Open-Response Instrument, and an Oral Interview

Peer reviewed

Direct link

Nehm, Ross H.; Schonfeld, Irvin Sam – Journal of Research in Science Teaching, 2008

Growing recognition of the central importance of fostering an in-depth understanding of natural selection has, surprisingly, failed to stimulate work on the development and rigorous evaluation of instruments that measure knowledge of it. We used three different methodological tools, the Conceptual Inventory of Natural Selection (CINS), a modified…

Descriptors: Evolution, Science Education, Interviews, Measures (Individuals)

Item Arrangement and Knowledge of Arrangement on Test Scores.

Peer reviewed

Plake, Barbara S. – Journal of Experimental Education, 1980

Three-item orderings and two levels of knowledge of ordering were used to study differences in test results, student's perception of the test's fairness and difficulty, and student's estimation of test performance. No significant order effect was found. (Author/GK)

Descriptors: Difficulty Level, Higher Education, Scores, Test Format

Examining Item Functioning of Math Screening Measures for Grades 1-8 Students. Technical Report Number 08-04

Download full text

Liu, Kimy; Ketterlin-Geller, Leanne R.; Yovanoff, Paul; Tindal, Gerald – Behavioral Research and Teaching, 2008

BRT Math Screening Measures focus on students' mathematics performance in grade-level standards for students in grades 1-8. A total of 24 test forms are available with three test forms per grade corresponding to fall, winter, and spring testing periods. Each form contains computation problems and application problems. BRT Math Screening Measures…

Descriptors: Test Items, Test Format, Test Construction, Item Response Theory

Peer reviewed

Direct link

Ascalon, M. Evelina; Meyers, Lawrence S.; Davis, Bruce W.; Smits, Niels – Applied Measurement in Education, 2007

This article examined two item-writing guidelines: the format of the item stem and homogeneity of the answer set. Answering the call of Haladyna, Downing, and Rodriguez (2002) for empirical tests of item writing guidelines and extending the work of Smith and Smith (1988) on differential use of item characteristics, a mock multiple-choice driver's…

Descriptors: Guidelines, Difficulty Level, Standard Setting, Driver Education

The Effect of Item Density on Item Difficulty in Passage-Related Language Tests.

Huntley, Renee M.; Loyd, Brenda H. – 1982

The study investigated the effect of "item density" on item difficulty in passage-related language tests. Item density refers to the number and frequency of items in relation to clear text. The format used was a passage with underlinings to signal the language situations out of which the items were constructed. American College Testing…

Descriptors: Difficulty Level, Language Tests, Secondary Education, Test Construction

Examining an Alternative to Score Equating: A Randomly Equivalent Forms Approach. Research Report. ETS RR-08-14

Peer reviewed
PDF on ERIC

Download full text

Liao, Chi-Wen; Livingston, Samuel A. – ETS Research Report Series, 2008

Randomly equivalent forms (REF) of tests in listening and reading for nonnative speakers of English were created by stratified random assignment of items to forms, stratifying on item content and predicted difficulty. The study included 50 replications of the procedure for each test. Each replication generated 2 REFs. The equivalence of those 2…

Descriptors: Equated Scores, Item Analysis, Test Items, Difficulty Level

Type K and Other Complex Multiple-Choice Items: An Analysis of Research and Item Properties.

Peer reviewed

Albanese, Mark A. – Educational Measurement: Issues and Practice, 1993

A comprehensive review is given of evidence, with a bearing on the recommendation to avoid use of complex multiple choice (CMC) items. Avoiding Type K items (four primary responses and five secondary choices) seems warranted, but evidence against CMC in general is less clear. (SLD)

Descriptors: Cues, Difficulty Level, Multiple Choice Tests, Responses

Structured Rating Scales.

Linacre, John Michael – 1991

A rating scale can be expressed as a chain of dichotomous items. The relationship between the dichotomies depends on the manner in which the rating scale is presented to the test taker. Three models for ordered scales are discussed. In the success model, which represents growth, the lowest or easiest category is presented first. If the test taker…

Descriptors: Difficulty Level, Equations (Mathematics), Mathematical Models, Rating Scales

A Comparison of the Item Difficulty and Item Discrimination of Multiple-Choice Items Using the "None of the Above" and One Correct Response Options.

Peer reviewed

Tollefson, Nona – Educational and Psychological Measurement, 1987

This study compared the item difficulty, item discrimination, and test reliability of three forms of multiple-choice items: (1) one correct answer; (2) "none of the above" as a foil; and (3) "none of the above" as the correct answer. Twelve items in the three formats were administered in a college statistics examination. (BS)

Descriptors: Difficulty Level, Higher Education, Item Analysis, Multiple Choice Tests

Basic Concepts in Modern Methods of Test Equating.

Download full text

Woldbeck, Tanya – 1998

This paper summarizes some of the basic concepts in test equating. Various types of equating methods, as well as data collection designs, are outlined, with attempts to provide insight into preferred methods and techniques. Test equating describes a group of methods that enable test constructors and users to compare scores from two different forms…

Descriptors: Comparative Analysis, Data Collection, Difficulty Level, Equated Scores

A Comparison of Difficulty and Discrimination Values of Selected True-False Item Types.

Peer reviewed

Barker, Douglas; Ebel, Robert L. – Contemporary Educational Psychology, 1982

Two forms of an undergraduate examination were constructed. Tests varied with respect to item truth value (true, false) and method of phrasing (positive, negative). Negatively stated items were more difficult but not more discriminating than positively stated items. False items were not more difficult but were more discriminating than true items.…

Descriptors: Difficulty Level, Higher Education, Item Analysis, Response Style (Tests)

« Previous Page | Next Page »

Pages: 1 | ... | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | ... | 18

Educational and Psychological…	13
ProQuest LLC	8
Journal of Experimental…	7
ETS Research Report Series	6
Journal of Educational…	6
Applied Measurement in…	5
Grantee Submission	5
Practical Assessment,…	5
Assessment & Evaluation in…	4
Journal of Psychoeducational…	4
Language Testing	4
Educational Measurement:…	3
Educational Psychology Review	3
International Journal of…	3
International Journal of…	3
Language Assessment Quarterly	3
Language Testing in Asia	3
Online Submission	3
Advances in Health Sciences…	2
College Board	2
Contemporary Educational…	2
Educational Assessment	2
Educational Research and…	2
English Language Teaching	2
International Journal of…	2
More ▼

Plake, Barbara S.	7
Huntley, Renee M.	5
Tollefson, Nona	4
Wainer, Howard	4
Baghaei, Purya	3
Bennett, Randy Elliot	3
Halpin, Glennelle	3
Katz, Irvin R.	3
Lunz, Mary E.	3
Allen, Nancy L.	2
Anderson, Paul S.	2
Berger, Aliza E.	2
Bowles, Ryan P.	2
Brownell, Sara E.	2
Bulut, Okan	2
Cizek, Gregory J.	2
DeBoer, George E.	2
Debeer, Dries	2
Ebel, Robert L.	2
Engelhard, George, Jr.	2
Frisbie, David A.	2
Gerde, Hope K.	2
Gu, Lin	2
Hardcastle, Joseph	2
More ▼

Test of English as a Foreign…	9
National Assessment of…	7
ACT Assessment	6
Trends in International…	6
Program for International…	4
SAT (College Admission Test)	3
Advanced Placement…	2
International English…	2
State Trait Anxiety Inventory	2
Defining Issues Test	1
Dynamic Indicators of Basic…	1
Embedded Figures Test	1
Gates MacGinitie Reading Tests	1
Graduate Record Examinations	1
Mathematics Anxiety Rating…	1
National Assessment Program…	1
Peabody Individual…	1
Peabody Picture Vocabulary…	1
Raven Progressive Matrices	1
Sentence Completion Test	1
Sequential Tests of…	1
Stanford Achievement Tests	1
Wechsler Individual…	1
Wechsler Intelligence Scale…	1
More ▼