Publication Date
| In 2026 | 0 |
| Since 2025 | 197 |
| Since 2022 (last 5 years) | 1067 |
| Since 2017 (last 10 years) | 2577 |
| Since 2007 (last 20 years) | 4938 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 225 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 65 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Peer reviewedHaladyna, Thomas A. – Applied Measurement in Education, 1992
Several multiple-choice item formats are examined in the current climate of test reform. The reform movement is discussed as it affects use of the following formats: (1) complex multiple-choice; (2) alternate choice; (3) true-false; (4) multiple true-false; and (5) the context dependent item set. (SLD)
Descriptors: Cognitive Psychology, Comparative Testing, Context Effect, Educational Change
Peer reviewedJohnson, Eugene G. – Journal of Educational Measurement, 1992
Features of the design of the National Assessment of Educational Progress (NAEP) are discussed, with emphasis on the design of the 1992 assessment. Student sample designs for the NAEP and the Trial State Assessment are described, and the focused-balanced incomplete block spiraling method of item sampling is discussed. (SLD)
Descriptors: Academic Achievement, Educational Assessment, Educational Change, Elementary Secondary Education
Peer reviewedDorans, Neil J.; And Others – Journal of Educational Measurement, 1992
The standardization approach to comprehensive differential item functioning is described and contrasted with the log-linear approach to differential distractor functioning and the item-response-theory-based approach to differential alternative functioning. Data from an edition of the Scholastic Aptitude Test illustrate application of the approach…
Descriptors: Black Students, College Entrance Examinations, Comparative Testing, Distractors (Tests)
Peer reviewedSamejima, Fumiko – Applied Psychological Measurement, 1994
The Level-11 vocabulary subtest of the Iowa Tests of Basic Skills was analyzed using a two-stage latent trait approach and data set of 2,356 examinees, approximately 11 years of age. It is concluded that the nonparametric approach leads to efficient estimation of the latent trait. (SLD)
Descriptors: Achievement Tests, Distractors (Tests), Elementary Education, Elementary School Students
Peer reviewedBejar, Isaac I.; Yocom, Peter – Applied Psychological Measurement, 1991
An approach to test modeling is illustrated that encompasses both response consistency and response difficulty. This generative approach makes validation an ongoing process. An analysis of hidden figure items with 60 high school students supports the feasibility of the method. (SLD)
Descriptors: Construct Validity, Difficulty Level, Evaluation Methods, High School Students
Peer reviewedChipman, Susan F.; And Others – American Educational Research Journal, 1991
The effects of problem content on mathematics word problem performance were explored for 128 male and 128 female college students solving problems with masculine, feminine, and neutral (familiar and unfamiliar) cover stories. No effect of sex typing was found, and a small, but highly significant, effect was found for familiarity. (SLD)
Descriptors: College Students, Comparative Testing, Familiarity, Females
Peer reviewedCamilli, Gregory – Applied Psychological Measurement, 1992
A mathematical model is proposed to describe how group differences in distributions of abilities, which are distinct from the target ability, influence the probability of a correct item response. In the multidimensional approach, differential item functioning is considered a function of the educational histories of the examinees. (SLD)
Descriptors: Ability, Comparative Analysis, Equations (Mathematics), Factor Analysis
Peer reviewedHarris, Abigail M.; Carlton, Sydell T. – Applied Measurement in Education, 1993
Differential item functioning on 6 forms of the Scholastic Aptitude Test was examined for 181,228 male and 198,668 female students focusing on the points tested, the test format, and subject matter in which items are embedded. Implications of the identifiable differences are discussed. (SLD)
Descriptors: College Entrance Examinations, Comparative Analysis, Females, High School Students
Peer reviewedTrachimowicz, Ruth A.; Lee, David Y. – Optometric Education, 1999
At Illinois College of Optometry a peer-review committee evaluates the exams of both basic science and clinical courses to provide an objective means of evaluating a faculty member's teaching ability. Five aspects of the exam (content, construction, statistical analysis, adjusted questions, instructor's rationale) are evaluated to determine the…
Descriptors: Allied Health Occupations Education, College Faculty, College Instruction, Course Evaluation
Peer reviewedConiam, David – Hong Kong Journal of Applied Linguistics, 1998
Examines the process of developing an English-as-a-Second-Language (ESL) cloze test by computer based on a language corpus. Two such tests developed and pilot-tested in Hong Kong were found to have test items that were not as good as they would have been if designed by a competent human, but better than expected. (Author/MSE)
Descriptors: Applied Linguistics, Cloze Procedure, Computer Assisted Testing, Computer Software Evaluation
Peer reviewedDeMars, Christine E. – Applied Measurement in Education, 2000
Studied the effects of test consequences, response formats, gender, and ethnicity on the mathematics and science sections of the Michigan High School Proficiency Test. Results for more than 11,000 students show that students taking constructed response and multiple choice formats performed better under high stakes conditions. Discusses gender and…
Descriptors: Constructed Response, Ethnicity, High School Students, High Schools
Pae, Hye Kyeong; Wise, Justin C.; Cirino, Paul T.; Sevcik, Rose A.; Lovett, Maureen W.; Wolf, Maryanne; Morris, Robin D. – Assessment, 2005
This study examined the magnitude of differences in standard scores, convergent validity, and concurrent validity when an individual's performance was gauged using the revised and the normative update (Woodcock, 1998) editions of the Woodcock Reading Mastery Test in which the actual test items remained identical but norms have been updated. From…
Descriptors: Grade 3, Test Items, Intervention, Grade 1
Alimi, Modoupe M.; Ellece, Sibonile – Language, Culture and Curriculum, 2003
The paper discusses the relationship between learning objectives and testing procedures in the Department of English at the University of Botswana, based on an examination of some course outlines and examination papers. Observations show that course designs are deficient in the articulation of learner outcomes. This deficiency is reflected in the…
Descriptors: English (Second Language), Second Language Learning, Test Items, Scholarship
Capella, Michele E.; Turner, Ronna C. – Rehabilitation Counseling Bulletin, 2004
Although state agencies are required by law to assess their consumers' satisfaction with vocational rehabilitation (VR), each state uses its own instrument to measure satisfaction. This not only makes comparisons across states impossible but also means that the quality of these instruments varies widely from state to state. As with other…
Descriptors: Vocational Rehabilitation, State Agencies, Test Construction, Satisfaction
Clarke, Sophie; Lindsay, Katharine; McKenna, Chris; New, Steve – ALT-J: Research in Learning Technology, 2004
There has been a wealth of investigation into the use of online multiple-choice questions as a means of summative assessment, however the research into the use of formative MCQs by the same mode of delivery still remains patchy. Similarly, research and implementation has been largely concentrated within the Sciences and Medicine rather than the…
Descriptors: Summative Evaluation, Computer Assisted Testing, Online Systems, Multiple Choice Tests

Direct link
