ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	1

Descriptor

Difficulty Level	15
Test Format	15
Testing Problems	15
Test Items	9
Multiple Choice Tests	5
Test Construction	5
Test Validity	5
Comparative Analysis	4
Higher Education	4
Reading Tests	4
Educational Assessment	3
Latent Trait Theory	3
Measurement Techniques	3
Reading Comprehension	3
Secondary Education	3
Computer Assisted Testing	2
Elementary Secondary Education	2
Foreign Countries	2
Item Analysis	2
Item Banks	2
Item Response Theory	2
Language Tests	2
Mathematical Models	2
National Surveys	2
Statistical Analysis	2
More ▼

Source

International Journal of…	1
Journal of Educational…	1

Publication Type

Reports - Research	10
Speeches/Meeting Papers	8
Reports - Evaluative	3
Journal Articles	2
Collected Works - Proceedings	1
Information Analyses	1
Opinion Papers	1

Education Level

Audience

Researchers

Location

Netherlands	2
Sweden	1
United Kingdom (England)	1
United Kingdom (Northern…	1
United Kingdom (Wales)	1
Virginia	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
Sequential Tests of…	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

FIPC Linking across Multidimensional Test Forms: Effects of Confounding Difficulty within Dimensions

Peer reviewed

Direct link

Kim, Sohee; Cole, Ki Lynn; Mwavita, Mwarumba – International Journal of Testing, 2018

This study investigated the effects of linking potentially multidimensional test forms using the fixed item parameter calibration. Forms had equal or unequal total test difficulty with and without confounding difficulty. The mean square errors and bias of estimated item and ability parameters were compared across the various confounding tests. The…

Descriptors: Test Items, Item Response Theory, Test Format, Difficulty Level

Differential Performance of Males and Females on Easy to Hard Item Arrangements; Influence of Feedback at the Item Level.

Plake, Barbara S.; And Others – 1983

Differential test performance by undergraduate males and females enrolled in a developmental educational psychology course (n=167) was reported on a quantitative examination as a function of item arrangement. Males were expected to perform better than females on tests whose items arranged easy to hard. Plake and Ansorge (1982) speculated this may…

Descriptors: Difficulty Level, Feedback, Higher Education, Scoring

The Effects of Mode of Test Administration on Test Performance.

Download full text

Lee, Jo Ann; And Others – 1984

The difficulty of test items administered by paper and pencil were compared with the difficulty of the same items administered by computer. The study was conducted to determine if an interaction exists between mode of test administration and ability. An arithmetic reasoning test was constructed for this study. All examinees had taken the Armed…

Descriptors: Adults, Comparative Analysis, Computer Assisted Testing, Difficulty Level

The Effect of the Position of an Item within a Test on the Item Difficulty Value.

Download full text

Rubin, Lois S.; Mott, David E. W. – 1984

An investigation of the effect on the difficulty value of an item due to position placement within a test was made. Using a 60-item operational test comprised of 5 subtests, 60 items were placed as experimental items on a number of spiralled test forms in three different positions (first, middle, last) within the subtest composed of like items.…

Descriptors: Difficulty Level, Item Analysis, Minimum Competency Testing, Reading Tests

How Well Can We Compare Scores on Test Forms That Are Constructed by Examinees' Choice?

Peer reviewed

Wainer, Howard; And Others – Journal of Educational Measurement, 1994

The comparability of scores on test forms that are constructed through examinee item choice is examined in an item response theory framework. The approach is illustrated with data from the College Board's Advanced Placement Test in Chemistry taken by over 18,000 examinees. (SLD)

Descriptors: Advanced Placement, Chemistry, Comparative Analysis, Constructed Response

Difficulty Level of Classroom Tests: Communicating with Teachers and Students.

Bowman, Robert W., Jr.; Frary, Robert B. – 1983

College teachers often use norm-referenced classroom tests which are too easy for distinguishing adequately among levels of student achievement, yet they are reluctant to adopt more difficult tests. We explored the basis for current practices concerning test difficulty through informal interviews and questionnaires completed by faculty members and…

Descriptors: Achievement Tests, Difficulty Level, Higher Education, Measurement Techniques

Comparison of Difficulties and Reliabilities of Math-Completion and Multiple-Choice Item Formats.

Download full text

Oosterhof, Albert C.; Coats, Pamela K. – 1981

Instructors who develop classroom examinations that require students to provide a numerical response to a mathematical problem are often very concerned about the appropriateness of the multiple-choice format. The present study augments previous research relevant to this concern by comparing the difficulty and reliability of multiple-choice and…

Descriptors: Comparative Analysis, Difficulty Level, Grading, Higher Education

The Effect of Position and Format on the Difficulty of Assessment Exercises.

Download full text

Burton, Nancy W.; And Others – 1976

Assessment exercises (items) in three different formats--multiple-choice with an "I don't know" (IDK) option, multiple-choice without the IDK, and open-ended--were placed at the beginning, middle and end of 45-minute assessment packages (instruments). A balanced incomplete blocks analysis of variance was computed to determine the biasing…

Descriptors: Age Differences, Difficulty Level, Educational Assessment, Guessing (Tests)

Objective Based Tests for Reading Comprehension.

van Roosmalen, Willem M. M. – 1983

The construction of objective tests for native language reading comprehension is described. The tests were designed for the early secondary school years in several kinds of schools, vocational and non-vocational. The description focuses on the use of the Rasch model in test development, to develop a large pool of homogenous items and establish…

Descriptors: Ability Grouping, Difficulty Level, Foreign Countries, Item Banks

Explanatory Skills.

Miller, George A. – 1986

In assessing the quality of science teaching for an effort such as the National Assessment of Educational Progress (NAEP), it is important to understand what is meant by scientific thinking--the search for explanations. Instruction should involve higher-order cognitive skill development, but it is difficult to measure reasoning and understanding…

Descriptors: Cognitive Processes, Difficulty Level, Educational Assessment, Educational Testing

CATs, Testlets, and Test Construction: A Rationale for Putting Test Developers Back into CAT.

Wainer, Howard; Kiely, Gerard L. – 1986

Recent experience with the Computerized Adaptive Test (CAT) has raised a number of concerns about its practical applications. The concerns are principally involved with the concept of having the computer construct the test from a precalibrated item pool, and substituting statistical characteristics for the test developer's skills. Problems with…

Descriptors: Adaptive Testing, Algorithms, Computer Assisted Testing, Construct Validity

Context Dependence of One-Question-per-Passage Measures of Reading Comprehension.

Oaster, T. R. F.; And Others – 1986

This study hypothesized that items in the one-question-per-passage format would be less easily answered when administered without their associated contexts than conventional reading comprehension items. A total of 256 seventh and eighth grade students were administered both Forms 3A and 3B of the Sequential Tests of Educational Progress (STEP 11).…

Descriptors: Context Effect, Difficulty Level, Grade 7, Grade 8

Practical Questions about Item Response Models in Large-Scale Assessment Programs.

Download full text

Legg, Sue M.; Algina, James – 1986

This paper focuses on the questions which arise as test practitioners monitor score scales derived from latent trait theory. Large scale assessment programs are dynamic and constantly challenge the assumptions and limits of latent trait models. Even though testing programs evolve, test scores must remain reliable indicators of progress.…

Descriptors: Difficulty Level, Educational Assessment, Elementary Secondary Education, Equated Scores

Variation among Examiners and Protocols on Oral Examinations.

Lunz, Mary E.; And Others – 1989

A method for understanding and controlling the multiple facets of an oral examination (OE) or other judge-intermediated examination is presented and illustrated. This study focused on determining the extent to which the facets model (FM) analysis constructs meaningful variables for each facet of an OE involving protocols, examiners, and…

Descriptors: Computer Software, Difficulty Level, Evaluators, Examiners

Practice and Problems in Language Testing 5. Non-Classical Test Theory; Final Examinations in Secondary Schools. Papers Presented at the International Language Testing Symposium (5th, Arnhem, Netherlands, March 25-26, 1982).

van Weeren, J., Ed. – 1983

Presented in this symposium reader are nine papers, four of which deal with the theory and impact of the Rasch model on language testing and five of which discuss final examinations in secondary schools in both general and specific terms. The papers are: "Introduction to Rasch Measurement: Some Implications for Language Testing" (J. J.…

Descriptors: Adolescents, Comparative Analysis, Comparative Education, Difficulty Level

Wainer, Howard	2
Algina, James	1
Bowman, Robert W., Jr.	1
Burton, Nancy W.	1
Coats, Pamela K.	1
Cole, Ki Lynn	1
Frary, Robert B.	1
Kiely, Gerard L.	1
Kim, Sohee	1
Lee, Jo Ann	1
Legg, Sue M.	1
Lunz, Mary E.	1
Miller, George A.	1
Mott, David E. W.	1
Mwavita, Mwarumba	1
Oaster, T. R. F.	1
Oosterhof, Albert C.	1
Plake, Barbara S.	1
Rubin, Lois S.	1
van Roosmalen, Willem M. M.	1
van Weeren, J., Ed.	1
More ▼