Publication Date
| In 2026 | 0 |
| Since 2025 | 200 |
| Since 2022 (last 5 years) | 1070 |
| Since 2017 (last 10 years) | 2580 |
| Since 2007 (last 20 years) | 4941 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 225 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 65 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Peer reviewedZacharias, Jerrold R. – National Elementary Principal, 1979
Outlines a method for constructing minimum competency tests that are legitimate, answerable, and useful. At the heart of such tests is an item bank that is open to all to view. (Author/IRT)
Descriptors: Academic Standards, Elementary Secondary Education, Item Banks, Minimum Competency Testing
Peer reviewedQuereshi, M. Y.; Fisher, Thomas L. – Educational and Psychological Measurement, 1977
Logical estimates of item difficulty made by judges were compared to empirical estimates derived from a test administration. Results indicated substantial correspondence between logical and empirical estimates, and substantial variation among judges. Further, the more elaborate the system used by judges to make estimates, the more accurate the…
Descriptors: Court Judges, Difficulty Level, Evaluation Methods, Item Analysis
Peer reviewedMartinez, Michael E.; Katz, Irvin R. – Educational Assessment, 1996
Item level differences between a type of constructed response item (figural response) and comparable multiple choice items in the domain of architecture were studied. Data from 120 architects and architecture students show that item level differences in difficulty correspond to differences in cognitive processing requirements and that relations…
Descriptors: Architects, Architecture, Cognitive Processes, Constructed Response
Peer reviewedMeel, David E. – Educational Assessment, 1997
Four issues associated with calculator-available assessments are discussed: (1) benefits and problems of calculator use; (2) components of various types of calculator-available assessments; (3) construction of calculator-available assessments; and (4) the need for additional examination of items included on calculator-available assessments. (SLD)
Descriptors: Calculators, Elementary Secondary Education, Higher Education, Mathematics Tests
Peer reviewedFitzpatrick, Anne R.; And Others – Journal of Educational Measurement, 1996
One-parameter (1PPC) and two-parameter partial credit (2PPC) models were compared using real and simulated data with constructed response items present. Results suggest that the more flexible three-parameter logistic-2PPC model combination produces better model fit than the combination of the one-parameter logistic and the 1PPC models. (SLD)
Descriptors: Comparative Analysis, Constructed Response, Goodness of Fit, Performance Based Assessment
Peer reviewedO Cinneide, Barra – Journal of European Industrial Training, 1997
In a theory/lecture course with limited use of case method, it is possible to get students to choose case study questions on examinations if topics are interesting, attention directing, and well understood. Case study test questions give students opportunities for critical analysis and self-expression. (SK)
Descriptors: Academic Achievement, Business Education, Case Studies, Graduate Students
Peer reviewedHoran, Patrick M.; DiStefano, Christine; Motl, Robert W. – Structural Equation Modeling, 2003
Examined the analysis and interpretation of wording effects associated with positively and negatively worded survey items using data from a national sample of secondary school students (National Education Longitudinal Study 1988-1994). Results for 14,374 students support the interpretation of wording effects in terms of personality traits with…
Descriptors: Personality Traits, Secondary Education, Secondary School Students, Self Esteem
Peer reviewedMeijer, Rob R. – Journal of Educational Measurement, 2002
Used empirical data from a certification test to study methods from statistical process control that have been proposed to classify an item score pattern as fitting or misfitting the underlying item response theory model in computerized adaptive testing. Results for 1,392 examinees show that different types of misfit can be distinguished. (SLD)
Descriptors: Certification, Classification, Goodness of Fit, Item Response Theory
Peer reviewedHaladyna, Thomas M.; Downing, Steven M.; Rodriguez, Michael C. – Applied Measurement in Education, 2002
Validated a taxonomy of 31 multiple-choice item-writing guidelines through a logical process that included reviewing 27 textbooks on educational testing and the results of 27 studies and reviews published since 1990. Presents the taxonomy, which is intended for classroom assessment. (SLD)
Descriptors: Classification, Literature Reviews, Multiple Choice Tests, Student Evaluation
Peer reviewedBerry, David T. R.; And Others – Psychological Assessment, 1997
The impact of varying levels of item omissions on Minnesota Multiphasic Personality Inventory-2 (MMPI-2) two-point code types was studied with MMPI-2 results from 100 psychological outpatients. Results suggest that defined code types are relatively robust for up to 30 omitted items. (SLD)
Descriptors: Clinical Diagnosis, Coding, Diagnostic Tests, Mental Disorders
Peer reviewedFrary, Robert B.; Tideman, T. Nicholaus – Educational and Psychological Measurement, 1997
Comparison of two indices of answer copying, one using only wrong responses and the other using right and wrong responses for six tests taken by from 910 to 1,154 college students suggests that indices of copying may perform differentially well according to the size of scores of examinee pairs evaluated. (SLD)
Descriptors: Cheating, College Students, Comparative Analysis, Higher Education
Peer reviewedRoussos, Louis; Stout, William – Applied Psychological Measurement, 1996
A multidimensionality-based differential item functioning (DIF) analysis paradigm is presented that unifies substantive and statistical DIF analysis approaches by linking both to a theoretically sound and mathematically rigorous multidimensional DIF conceptualization. This approach results in the potential for DIF analysis more closely integrated…
Descriptors: Cluster Analysis, Estimation (Mathematics), Hypothesis Testing, Identification
Peer reviewedPotenza, Maria T.; Stocking, Martha L. – Journal of Educational Measurement, 1997
Common strategies for dealing with flawed items in conventional testing, grounded in principles of fairness to examinees, are re-examined in the context of adaptive testing. The additional strategy of retesting from a pool cleansed of flawed items is found, through a Monte Carlo study, to bring about no practical improvement. (SLD)
Descriptors: Adaptive Testing, Computer Assisted Testing, Item Banks, Monte Carlo Methods
Peer reviewedSinar, Evan F.; Zickar, Michael J. – Applied Psychological Measurement, 2002
Examined the influence of deviant scale items on item parameter estimates of focal scale items and person parameter estimates through a comparison of item response theory (IRT) and classical test theory (CTT) models. Used Monte Carlo methods to explore results from a pilot investigation of job attitude data. Discusses implications for researchers…
Descriptors: Attitudes, Estimation (Mathematics), Monte Carlo Methods, Robustness (Statistics)
Peer reviewedLawson, Alexandra; Bordignon, Catherine; Nagy, Philip – Studies in Educational Evaluation, 2002
Studied the match between the Ontario (Canada) eighth grade curriculum for 1997 and the item pool of the Third International Mathematics and Science Study (TIMSS) and analyzed the matching process itself. Findings show that the 1997 curriculum is a better match to the TIMSS item pool, achieving the better match by enlarging the curriculum and…
Descriptors: Academic Achievement, Curriculum, Foreign Countries, Grade 8


