ERIC - Search Results

Publication Date

In 2026	0
Since 2025	8
Since 2022 (last 5 years)	31
Since 2017 (last 10 years)	79
Since 2007 (last 20 years)	136

Descriptor

Test Items	226
Test Length	226
Item Response Theory	90
Sample Size	66
Test Construction	66
Computer Assisted Testing	54
Adaptive Testing	52
Simulation	51
Test Reliability	44
Error of Measurement	41
Comparative Analysis	40
Difficulty Level	40
Accuracy	38
Item Analysis	37
Test Format	37
Test Validity	32
Correlation	30
Computation	29
Statistical Analysis	29
Test Bias	29
Monte Carlo Methods	28
Models	27
Scores	27
Item Banks	26
Goodness of Fit	23
More ▼

Publication Type

Reports - Research	155
Journal Articles	138
Reports - Evaluative	41
Speeches/Meeting Papers	32
Dissertations/Theses -…	19
Reports - Descriptive	7
Numerical/Quantitative Data	6
Guides - Non-Classroom	4
Tests/Questionnaires	3
Information Analyses	2
Opinion Papers	2
Historical Materials	1
Reference Materials -…	1
More ▼

Education Level

Higher Education	14
Postsecondary Education	13
Secondary Education	8
Elementary Education	6
Elementary Secondary Education	6
High Schools	4
Early Childhood Education	3
Grade 3	3
Middle Schools	3
Grade 6	2
Intermediate Grades	2
Primary Education	2
Grade 11	1
Grade 12	1
Junior High Schools	1
Preschool Education	1
More ▼

Audience

Researchers	9
Administrators	1
Community	1
Practitioners	1

Location

Turkey	2
Alabama	1
Asia	1
Australia	1
Germany	1
Illinois (Chicago)	1
Indiana	1
Iran	1
Israel	1
Japan	1
Netherlands	1
New Jersey	1
Peru	1
South Korea	1
Taiwan	1
Ukraine	1
More ▼

Laws, Policies, & Programs

Job Training Partnership Act…	1
Race to the Top	1

What Works Clearinghouse Rating

Test Items X

Showing 196 to 210 of 226 results Save | Export

A Study of the Effects of Variation of Short-Term Memory Load, Reading Response Length, and Processing Hierarchy on TOEFL Listening Comprehension Item Performance. Report 33.

Download full text

Henning, Grant – 1991

Criticisms of the Test of English as a Foreign Language (TOEFL) have included speculation that the listening test places too much burden on short-term memory as compared with comprehension, that a knowledge of reading is required to respond successfully, and that many items appear to require mere recall and matching rather than higher-order…

Descriptors: Adults, Auditory Stimuli, Cognitive Processes, Educational Assessment

Comparison of Difficulties and Reliabilities of Math-Completion and Multiple-Choice Item Formats.

Download full text

Oosterhof, Albert C.; Coats, Pamela K. – 1981

Instructors who develop classroom examinations that require students to provide a numerical response to a mathematical problem are often very concerned about the appropriateness of the multiple-choice format. The present study augments previous research relevant to this concern by comparing the difficulty and reliability of multiple-choice and…

Descriptors: Comparative Analysis, Difficulty Level, Grading, Higher Education

An Adaptive Algebra Test: A Testlet-Based, Hierarchically-Structured Test with Validity-Based Scoring. Technical Report No. 90-92.

Download full text

Wainer, Howard; And Others – 1990

The initial development of a testlet-based algebra test was previously reported (Wainer and Lewis, 1990). This account provides the details of this excursion into the use of hierarchical testlets and validity-based scoring. A pretest of two 15-item hierarchical testlets was carried out in which examinees' performance on a 4-item subset of each…

Descriptors: Adaptive Testing, Algebra, Comparative Analysis, Computer Assisted Testing

Testing Practices of High-School Teachers. Bulletin, 1936, No. 9

Download full text

Lee, J. Murray; Segel, David – Office of Education, United States Department of the Interior, 1936

In order to make an intelligent advance in any school practice a knowledge of what schools are doing in that practice is almost indispensable, since a transition in procedures must be a growth from the one to the other. This bulletin gives this background of facts concerning the use of tests and examinations by the different subject departments in…

Descriptors: Testing, Teachers, Standardized Tests, Principals

Methods for Equating Mental Tests. Interim Report for Period March 1982-October 1984.

Download full text

Gialluca, Kathleen A.; And Others – 1984

In this study, simulated and actual Air Force test data were used to compare the different procedures for equating mental tests: conventional (equipercentile and linear), Item Response Theory (IRT), and strong true-score theory (STST); data collection designs used were single-group, equivalent-groups, and anchor test. Equating transformations were…

Descriptors: Adults, Cognitive Ability, Cognitive Tests, Comparative Analysis

Bias and Information of Bayesian Adaptive Testing. Research Report 83-2.

Download full text

Weiss, David J.; McBride, James R. – 1983

Monte Carlo simulation was used to investigate score bias and information characteristics of Owen's Bayesian adaptive testing strategy, and to examine possible causes of score bias. Factors investigated in three related studies included effects of item discrimination, effects of fixed vs. variable test length, and effects of an accurate prior…

Descriptors: Ability Identification, Adaptive Testing, Bayesian Statistics, Computer Assisted Testing

Comparative Racial Analysis of Enlisted Advancement Exams: Item Differentiation. Final Report.

Download full text

Robertson, David W.; And Others – 1977

A comparative study of item analysis was conducted on the basis of race to determine whether alternative test construction or processing might increase the proportion of black enlisted personnel among those passing various military technical knowledge examinations. The study used data from six specialists at four grade levels and investigated item…

Descriptors: Difficulty Level, Enlisted Personnel, Item Analysis, Occupational Tests

Some Guidelines for Determining the Length of Objectives-Based Criterion-Referenced Tests.

Berk, Ronald A. – 1979

Four factors essential to determining how many items should be constructed or sampled for a set of objectives are examined: (1) importance and type of decisions to be made with the results; (2) importance and emphases assigned to the instructional and behavioral objectives; (3) number of objectives; (4) practical constraints, such as item writing…

Descriptors: Behavioral Objectives, Course Objectives, Criterion Referenced Tests, Decision Making

Optimal Item Selection with Credentialing Examinations.

Download full text

Hambleton, Ronald K.; And Others – 1987

The study compared two promising item response theory (IRT) item-selection methods, optimal and content-optimal, with two non-IRT item selection methods, random and classical, for use in fixed-length certification exams. The four methods were used to construct 20-item exams from a pool of approximately 250 items taken from a 1985 certification…

Descriptors: Comparative Analysis, Content Validity, Cutting Scores, Difficulty Level

The Effect of Keying All Options Correct on Equating Functions and Scores.

Download full text

Lenel, Julia C.; Gilmer, Jerry S. – 1986

In some testing programs an early item analysis is performed before final scoring in order to validate the intended keys. As a result, some items which are flawed and do not discriminate well may be keyed so as to give credit to examinees no matter which answer was chosen. This is referred to as allkeying. This research examined how varying the…

Descriptors: Equated Scores, Item Analysis, Latent Trait Theory, Licensing Examinations (Professions)

Effect of the Guessing Parameter on the Estimation of the Item Discrimination and Difficulty Parameters When Three-Parameter Logistic Model Is Assumed.

Samejima, Fumiko – 1986

Item analysis data fitting the normal ogive model were simulated in order to investigate the problems encountered when applying the three-parameter logistic model. Binary item tests containing 10 and 35 items were created, and Monte Carlo methods simulated the responses of 2,000 and 500 examinees. Item parameters were obtained using Logist 5.…

Descriptors: Computer Simulation, Difficulty Level, Guessing (Tests), Item Analysis

Factors Influencing the Psychometric Characteristics of an Adaptive Testing Strategy for Test Batteries.

Download full text

Maurelli, Vincent A.; Weiss, David J. – 1981

A monte carlo simulation was conducted to assess the effects in an adaptive testing strategy for test batteries of varying subtest order, subtest termination criterion, and variable versus fixed entry on the psychometric properties of an existent achievement test battery. Comparisons were made among conventionally administered tests and adaptive…

Descriptors: Achievement Tests, Adaptive Testing, Computer Assisted Testing, Latent Trait Theory

Optimal Number of Options: An Investigation of the Assumption of Proportionality.

Peer reviewed

Budescu, David V.; Nevo, Baruch – Journal of Educational Measurement, 1985

The proportionality model assumes that total testing time is proportional to the number of test items and the number of options per multiple choice test item. This assumption was examined, using test items having from two to five options. The model was not supported. (Author/GDC)

Descriptors: College Entrance Examinations, Foreign Countries, Higher Education, Item Analysis

The Second Century of Ability Testing: Some Predictions and Speculations

Peer reviewed

Direct link

Embretson, Susan E. – Measurement: Interdisciplinary Research and Perspectives, 2004

The last century was marked by dazzling changes in many areas, such as technology and communications. Predictions into the second century of testing are seemingly difficult in such a context. Yet, looking back to the turn of the last century, Kirkpatrick (1900), in his American Psychological Association presidential address, presented fundamental…

Descriptors: Ability, Testing, Futures (of Society), Psychometrics

Test-Retest Analyses of the Test of English as a Foreign Language. TOEFL Research Reports Report 45.

Download full text

Henning, Grant – 1993

This study provides information about the total and component scores of the Test of English as a Foreign Language (TOEFL). First, the study provides comparative global and component estimates of test-retest, alternate-form, and internal-consistency reliability, controlling for sources of measurement error inherent in the examinees and the testing…

Descriptors: Difficulty Level, English (Second Language), Error of Measurement, Estimation (Mathematics)

« Previous Page | Next Page »

Pages: 1 | ... | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16

Educational and Psychological…	33
ProQuest LLC	19
Journal of Educational…	16
Applied Measurement in…	9
Applied Psychological…	9
ETS Research Report Series	9
International Journal of…	7
International Journal of…	7
Journal of Educational and…	5
Measurement:…	4
Journal of Psychoeducational…	3
Assessment & Evaluation in…	2
Education and Information…	2
Educational Sciences: Theory…	2
Eurasian Journal of…	2
Grantee Submission	2
Journal of Experimental…	2
Journal of Technology,…	2
Physical Review Physics…	2
ACT Education Corp.	1
AERA Online Paper Repository	1
Advanced Education	1
Anatomical Sciences Education	1
Asia Pacific Education Review	1
Assessment and Evaluation in…	1
More ▼

Wainer, Howard	6
Hambleton, Ronald K.	4
Wang, Wen-Chung	4
Berk, Ronald A.	3
Burton, Richard F.	3
Cohen, Allan S.	3
Huggins-Manley, Anne Corinne	3
Lee, Won-Chan	3
Lee, Yi-Hsuan	3
Pommerich, Mary	3
Reckase, Mark D.	3
Sijtsma, Klaas	3
Wang, Chun	3
Weiss, David J.	3
Zhang, Jinming	3
Bradshaw, Laine	2
Bulut, Okan	2
Chen, Shu-Ying	2
Cheng, Ying	2
Chernyshenko, Oleksandr S.	2
Cui, Ying	2
De Ayala, R. J.	2
Diao, Qi	2
Dogan, Nuri	2
More ▼

Program for International…	4
Test of English as a Foreign…	3
Trends in International…	3
SAT (College Admission Test)	2
ACT Assessment	1
Advanced Placement…	1
Armed Forces Qualification…	1
COMPASS (Computer Assisted…	1
Comprehensive Tests of Basic…	1
Force Concept Inventory	1
Iowa Tests of Basic Skills	1
MacArthur Communicative…	1
Medical College Admission Test	1
National Longitudinal Study…	1
New Jersey College Basic…	1
Otis Lennon School Ability…	1
Raven Advanced Progressive…	1
School and College Ability…	1
Stanford Binet Intelligence…	1
Texas Assessment of Basic…	1
Texas Educational Assessment…	1
Wechsler Intelligence Scale…	1
Wechsler Intelligence Scales…	1
More ▼