Publication Date
| In 2026 | 0 |
| Since 2025 | 197 |
| Since 2022 (last 5 years) | 1067 |
| Since 2017 (last 10 years) | 2577 |
| Since 2007 (last 20 years) | 4938 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 225 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 65 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Bassler, Otto C.; Caulkins, Thomas G. – 1984
A model for summarizing test scores and using them to modify instructional programs is presented. The proposed model consists of two types of summaries of the data gathered through standardized tests. The first summary contains individual and single class results. Information in a "Class Item Response Record" chart provides individual student…
Descriptors: Elementary Secondary Education, Instructional Improvement, Models, Scores
Furst, Edward J. – 1983
Enough evidence has accumulated on Bloom's "Taxonomy of Educational Objectives" for the cognitive domain to justify a review of its communicability. This article covers both published and unpublished studies as well as certain informal reports that bear on this property. It also examines possibilities for improving agreement among…
Descriptors: Achievement Tests, Classification, Cognitive Processes, Diffusion (Communication)
Edwards, John; McCombie, Randy – 1983
The major purpose of the three studies reported here was to investigate possible differences in agreement/disagreement with attitude statements as a function of their type (with regard to positivity/negativity) and personalism. In the first study, 90 students completed scales on energy conservation and on having good study habits. Agreement varied…
Descriptors: Attitude Measures, Higher Education, Response Style (Tests), Semantic Differential
Dorans, Neil J.; Zeller, Karin – ETS Research Report Series, 2004
In the Spring 2003 issue of "Harvard Educational Review," Roy Freedle stated that the SAT® is both culturally and statistically biased, and he proposed a solution to ameliorate this bias. His claims, which garnered national attention, were based on serious errors in his analysis. We begin our analyses by assessing the psychometric…
Descriptors: Test Bias, Statistical Bias, Psychometrics, College Entrance Examinations
Sinharay, Sandip; Holland, Paul – ETS Research Report Series, 2006
It is a widely held belief that anchor tests should be miniature versions (i.e., minitests), with respect to content and statistical characteristics of the tests being equated. This paper examines the foundations for this belief. It examines the requirement of statistical representativeness of anchor tests that are content representative. The…
Descriptors: Test Items, Equated Scores, Evaluation Methods, Difficulty Level
Seong, Tae-Je – 1990
The similarity of item and ability parameter estimations was investigated using two numerical analysis techniques via marginal maximum likelihood estimation (MMLE) with a large simulated data set (n=1,000 examinees) and changing the number of quadrature points. MMLE estimation uses a numerical analysis technique to integrate examinees' abilities…
Descriptors: Comparative Analysis, Equations (Mathematics), Estimation (Mathematics), Mathematical Models
Zeng, Lingjia; Bashaw, Wilbur L. – 1990
A joint maximum likelihood estimation algorithm, based on the partial compensatory multidimensional logistic model (PCML) proposed by L. Zeng (1989), is presented. The algorithm simultaneously estimates item difficulty parameters, the strength of each dimension, and individuals' abilities on each of the dimensions involved in arriving at a correct…
Descriptors: Ability Identification, Algorithms, Computer Simulation, Difficulty Level
Clariana, Roy B. – 1990
Research has shown that multiple-choice questions formed by transforming or paraphrasing a reading passage provide a measure of student comprehension. It is argued that similar transformation and paraphrasing of lesson questions is an appropriate way to form parallel multiple-choice items to be used as a posttest measure of student comprehension.…
Descriptors: Comprehension, Computer Assisted Testing, Difficulty Level, Measurement Techniques
Micceri, Theodore; And Others – 1987
Several issues relating to agreement estimates for different types of data from performance evaluations are considered. New indices of agreement are presented for ordinal level items and for summative scores produced by nominal or ordinal level items. Two sets of empirical data illustrate the performance of the two formulas derived to estimate…
Descriptors: Correlation, Data Analysis, Educational Research, Estimation (Mathematics)
Alderson, J. Charles – 1990
Language testing is an area of applied linguistics that combines the exercise of professional judgment about language, learning, and the nature of the achievement of language learning with empirical data about student performance and, by inference, their abilities. The relationship between judgments and empirical data in language testing is…
Descriptors: Comparative Analysis, Difficulty Level, Evaluative Thinking, Item Analysis
Linacre, John M.; Wright, Benjamin D. – 1987
The Mantel-Haenszel (MH) procedure attempts to identify and quantify differential item performance (item bias). This paper summarizes the MH statistics, and identifies the parameters they estimate. An equivalent procedure based on the Rasch model is described. The theoretical properties of the two approaches are compared and shown to require the…
Descriptors: Algorithms, Estimation (Mathematics), Item Analysis, Measurement Techniques
Haladyna, Thomas M.; And Others – 1987
This paper discusses the development and use of "item shells" in constructing multiple-choice tests. An item shell is a "hollow" item that contains the syntactic structure and context of an item without specific content. Item shells are empirically developed from successfully used items selected from an existing item pool. Use…
Descriptors: Difficulty Level, Health Personnel, Item Banks, Multiple Choice Tests
Perkins, Kyle; Duncan, Ann – 1988
An item discriminability study of the Iowa Tests of Basic Skills Language Skills tests identified test items that are robust discriminators, psychometrically capable of separating low scorers from higher scorers in the language tests battery. The analysis was conducted by calculating a point-biserial correlation for each item on the four language…
Descriptors: Correlation, Item Analysis, Language Skills, Language Tests
Jelden, D. L. – 1987
A study of 696 undergraduates at the University of Northern Colorado was undertaken to determine the effects of computerized unit test item feedback on final examination scores. The study, which employed the PHOENIX computer managed instruction system, included students at all undergraduate levels enrolled in an Oceanography course. To determine…
Descriptors: College Students, Computer Assisted Instruction, Computer Assisted Testing, Feedback
Doolittle, Allen E. – 1984
The definition of differential item performance (DIP), often referred to as item bias, is discussed. DIP is suggested as a comprehensive term to encompass item bias (item invalidity which is unfair to certain population subgroups) and instructional bias (a valid reflection of group differences in instruction or background). This study investigated…
Descriptors: College Entrance Examinations, Higher Education, Item Analysis, Mathematics Achievement

Peer reviewed
