Publication Date
In 2025 | 42 |
Since 2024 | 165 |
Since 2021 (last 5 years) | 588 |
Since 2016 (last 10 years) | 1225 |
Since 2006 (last 20 years) | 2731 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 169 |
Practitioners | 49 |
Teachers | 32 |
Administrators | 8 |
Policymakers | 8 |
Counselors | 4 |
Students | 4 |
Media Staff | 1 |
Location
Turkey | 172 |
Australia | 81 |
Canada | 79 |
China | 70 |
United States | 55 |
Germany | 43 |
Taiwan | 43 |
Japan | 40 |
United Kingdom | 38 |
Iran | 36 |
Spain | 33 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 1 |
Meets WWC Standards with or without Reservations | 1 |
Does not meet standards | 1 |

Miller, M. David – Journal of Educational Measurement, 1986
An index of student patterns of item response, when aggregated to the class level, was shown to identify classes that have a poor match between test content and instructional coverage. The mean caution index for a class can best be interpreted knowing the within-class standard deviation of the index. (Author/LMO)
Descriptors: Classes (Groups of Students), Elementary Education, Error Patterns, Goodness of Fit

Morgenstern, Carol Faltin; Renner, John W. – Journal of Research in Science Teaching, 1984
Determined which of 10 rational thinking powers are measured by 12 commercially-available standardized science tests. One result reported is that 90 percent of the test items required only recall. The conclusion was drawn that the producers of standardized tests are not concerned with measuring student achievement of the rational powers.…
Descriptors: Biology, Chemistry, Cognitive Processes, Earth Science

Kyllonen, Patrick C.; And Others – Journal of Educational Psychology, 1984
Using 146 high school students, this research assessed the effects of aptitude, strategy training, and item characteristics on the strategic processes employed in the performance of spatial visualization tasks. Treatment effects were shown to depend on the subject's aptitude profile and on the characteristics of items. (BS)
Descriptors: Aptitude Treatment Interaction, High Schools, Item Analysis, Latent Trait Theory
Choppin, Bruce – Evaluation in Education: An International Review Series, 1985
The potential value of developing item banks that would both guarantee the quality of teacher-made examinations and allow for national comparison of student achievement is described. Use of Rasch model methods, rather than classical item analysis techniques, to calibrate the items for these banks is illustrated. (BS)
Descriptors: Achievement Tests, Computer Assisted Testing, Educational Assessment, Elementary Secondary Education
Fuhrman, Susan H. – Consortium for Policy Research in Education, 2003
To assist in the redesign of accountability systems, the Consortium for Policy Research in Education (CPRE) and the Center for Research on Evaluation, Student Standards, and Testing (CRESST) sought to assemble knowledge from new research on emerging accountability systems. A book, "Redesigning Accountability Systems for Education," edited by Susan…
Descriptors: Accountability, Educational Quality, Quality Control, Systems Analysis
DeSimone, Janet R. – Online Submission, 2004
The purpose of this study was to investigate middle school general education mathematics teachers' beliefs and knowledge of inclusive instruction and to assess whether or not teachers' classroom practices reflect their beliefs and knowledge. Administrative support and higher education teacher preparation programs were also examined. Data were…
Descriptors: Measures (Individuals), General Education, Teacher Collaboration, Mathematics Instruction

Altepeter, Tom – School Psychology Review, 1983
A critical review of the Expressive One-Word Picture Vocabulary Test (Gardner) is offered. The reviewer feels that the instrument cannot be recommended in its present form. Further research concerning the manual, and theoretical issues, (particularly test-retest stability) is strongly recommended. (Author/PN)
Descriptors: Error of Measurement, Intelligence Tests, Item Analysis, Pictorial Stimuli

Martin, Gary L.; Newman, Ian M. – Journal of School Health, 1982
A sample of 38 ninth-grade students was tested with a randomized response questionnaire designed to ask respondents sensitive questions with the assurance that responses would remain anonymous. Results of the investigation indicate that this technique can be used effectively with ninth-grade students and with large groups of individuals to obtain…
Descriptors: Adolescents, Confidentiality, Grade 9, Health Behavior

Linn, Robert L.; Slinde, Jeffrey A. – Applied Psychological Measurement, 1979
This study investigated the adequacy of the Rasch model in equating existing standardized tests with groups of examinees not widely separated in ability. With the exception of one test pair and one grade level, the Rasch model using the anchor test procedure provided a reasonably satisfactory means of equating. (Author/CTM)
Descriptors: Equated Scores, Goodness of Fit, Intermediate Grades, Item Analysis

Gabel, Dorothy L.; Sherwood, Robert D. – Journal of Chemical Education, 1979
Presented are the combined test results of chemistry students from seven Central Indiana schools on the ACS-NSTA achievement exam, Form 1975, Part 1. Additional variables investigated include sex, grade level, Longeot classification, and an item analysis of the test items. (BT)
Descriptors: Achievement Tests, Chemistry, Evaluation, Item Analysis

Preece, P. F. W. – School Science Review, 1979
Outlines some recent developments in test-item analysis which are based upon a model of educational measurement developed by the Danish mathematician Georg Rasch. Illustrations of how science teachers in the United Kingdom can use the Rasch model are also included. (HM)
Descriptors: Academic Achievement, Cognitive Measurement, Elementary Secondary Education, Evaluation Methods

Helmes, Edward – Multivariate Behavioral Research, 1989
Objective criteria for evaluating the Eysenck Personality Inventory's internal structure are discussed. An approach based on targeted rotations and the test's scoring key is proposed as a means of providing common criteria. Data from earlier structure and test results for 195 undergraduates support the utility of 3 criteria developed. (SLD)
Descriptors: Comparative Analysis, Evaluation Criteria, Evaluation Methods, Factor Structure

Hsu, Tse-chi; Yu, Lifa – Educational Measurement: Issues and Practice, 1989
How computers are used to analyze item data is reviewed, and the information that existing item-analysis programs provide is described. Summaries of studies comparing the performance of some of these packages reveal some of their current limitations. Emphasis is on the usefulness to educational practice of these packages. (SLD)
Descriptors: Computer Assisted Testing, Computer Software, Computer Software Reviews, Computer Uses in Education

Gohmann, Stephan F.; Spector, Lee C. – Journal of Economic Education, 1989
Compares the effect of content ordering and scrambled ordering on examinations in courses, such as economics, that require quantitative skills. Empirical results suggest that students do no better if they are given a content-ordered rather than a scrambled examination as student performance is not adversely affected by scrambled ordered…
Descriptors: Cheating, Economics Education, Educational Research, Grading

McKinley, Robert L. – Journal of Educational Measurement, 1988
Six procedures for combining sets of item response theory (IRT) item parameter estimates from different samples were evaluated using real and simulated response data. Results support use of covariance matrix-weighted averaging and a procedure using sample-size-weighted averaging of estimated item characteristic curves at the center of the ability…
Descriptors: College Entrance Examinations, Comparative Analysis, Computer Simulation, Estimation (Mathematics)