Publication Date
| In 2026 | 0 |
| Since 2025 | 52 |
| Since 2022 (last 5 years) | 194 |
| Since 2017 (last 10 years) | 494 |
| Since 2007 (last 20 years) | 742 |
Descriptor
| Test Items | 1186 |
| Test Reliability | 1186 |
| Test Validity | 684 |
| Test Construction | 565 |
| Foreign Countries | 348 |
| Difficulty Level | 279 |
| Item Analysis | 252 |
| Psychometrics | 233 |
| Item Response Theory | 219 |
| Factor Analysis | 183 |
| Multiple Choice Tests | 172 |
| More ▼ | |
Source
Author
| Schoen, Robert C. | 12 |
| LaVenia, Mark | 5 |
| Liu, Ou Lydia | 5 |
| Anderson, Daniel | 4 |
| Bauduin, Charity | 4 |
| DiLuzio, Geneva J. | 4 |
| Farina, Kristy | 4 |
| Haladyna, Thomas M. | 4 |
| Huck, Schuyler W. | 4 |
| Petscher, Yaacov | 4 |
| Stansfield, Charles W. | 4 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 39 |
| Researchers | 30 |
| Teachers | 24 |
| Administrators | 13 |
| Support Staff | 3 |
| Counselors | 2 |
| Students | 2 |
| Community | 1 |
| Parents | 1 |
| Policymakers | 1 |
Location
| Turkey | 68 |
| Indonesia | 37 |
| Germany | 20 |
| Canada | 17 |
| Florida | 17 |
| China | 16 |
| Australia | 15 |
| California | 12 |
| Iran | 11 |
| India | 10 |
| New York | 9 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
Maurelli, Vincent A.; Weiss, David J. – 1981
A monte carlo simulation was conducted to assess the effects in an adaptive testing strategy for test batteries of varying subtest order, subtest termination criterion, and variable versus fixed entry on the psychometric properties of an existent achievement test battery. Comparisons were made among conventionally administered tests and adaptive…
Descriptors: Achievement Tests, Adaptive Testing, Computer Assisted Testing, Latent Trait Theory
Taylor, Hugh; And Others – 1978
This guide is organized into five chapters; (1) an approach to testing--decisions made from test results, types of achievement tests, types and levels of objectives, and test validity, reliability, and practicality; (2) classroom test construction--planning, item banks, item writing, assembly, administration, and scoring; (3) test analysis-item…
Descriptors: Achievement Tests, Cutting Scores, Decision Making, Educational Objectives
Eignor, Daniel R.; And Others – 1993
The extensive computer simulation work done in developing the computer adaptive versions of the Graduate Record Examinations (GRE) Board General Test and the College Board Admissions Testing Program (ATP) Scholastic Aptitude Test (SAT) is described in this report. Both the GRE General and SAT computer adaptive tests (CATs), which are fixed length…
Descriptors: Adaptive Testing, Algorithms, Case Studies, College Entrance Examinations
Embretson, Susan E. – Measurement: Interdisciplinary Research and Perspectives, 2004
The last century was marked by dazzling changes in many areas, such as technology and communications. Predictions into the second century of testing are seemingly difficult in such a context. Yet, looking back to the turn of the last century, Kirkpatrick (1900), in his American Psychological Association presidential address, presented fundamental…
Descriptors: Ability, Testing, Futures (of Society), Psychometrics
Mazzeo, John; And Others – 1993
This report describes three exploratory studies of the performance of males and females on the multiple-choice and constructed-response sections of four Advanced Placement Examinations: United States History, Biology, Chemistry, and English Language and Composition. Analyses were carried out for each racial or ethnic group with a sample size of at…
Descriptors: Advanced Placement, College Entrance Examinations, Constructed Response, Ethnic Groups
Test-Retest Analyses of the Test of English as a Foreign Language. TOEFL Research Reports Report 45.
Henning, Grant – 1993
This study provides information about the total and component scores of the Test of English as a Foreign Language (TOEFL). First, the study provides comparative global and component estimates of test-retest, alternate-form, and internal-consistency reliability, controlling for sources of measurement error inherent in the examinees and the testing…
Descriptors: Difficulty Level, English (Second Language), Error of Measurement, Estimation (Mathematics)
Fitzpatrick, Steven J.; And Others – 1994
In 1991 the Measurement and Evaluation Center of the University of Texas at Austin was asked to develop a test for credit by examination in four lower division courses in Japanese. The test (in Japanese) was constructed from locally developed items provided by instructors of Japanese. The developed test consisted of 80 items distributed among…
Descriptors: College Students, Cutting Scores, Equivalency Tests, Higher Education
Lindner, Reinhard W.; And Others – 1996
The development of an inventory to measure self-regulated learning is reported. The first step involved the generation of an item pool based on a literature review. A pool of items was developed based on the five identified factors of metacognition, learning strategies, motivation, contextual sensitivity, and environmental utilization and control.…
Descriptors: Grade Point Average, Graduate Students, Higher Education, Item Banks
Gadalla, Tahany – 1995
The Second International Mathematics Study was conducted in 20 countries under the sponsorship of the International Association for Evaluation of Educational Achievement (IEA). Among the instruments used in this study was a questionnaire investigating student attitudes about school, instruction, and mathematics. The fit of an a priori model that…
Descriptors: Comparative Analysis, Cross Cultural Studies, Factor Structure, Foreign Countries
Daniel, Larry G.; King, Debra A. – 1994
This study offers field estimates of the factor validity and internal consistency reliability of the Self-Esteem Index (SEI) using SEI data from 208 regular and special education students in grades 3, 4, and 5. Exploratory factor analytic results support the existence of four factors as anticipated; however, various inconsistencies are noted…
Descriptors: Elementary Education, Elementary School Students, Estimation (Mathematics), Factor Structure
Trevisan, Michael S.; Sax, Gilbert – 1991
The purpose of this study was to compare the reliabilities of two-, three-, four-, and five-choice tests using an incremental option paradigm. Test forms were created incrementally, a method approximating actual test construction procedures. Participants were 154 12th-grade students from the Portland (Oregon) area. A 45-item test with two options…
Descriptors: Comparative Testing, Distractors (Tests), Estimation (Mathematics), Grade 12
Dietz, A. Steven; And Others – 1992
A study focused on the development and pilot testing of a student satisfaction instrument, the results of which may be used to identify weaknesses and strengths of nontraditional degree programs. The literature review demonstrated little empirical evidence that could support positive or negative conclusions regarding student satisfaction within…
Descriptors: College Graduates, Higher Education, Item Banks, Models
Meld, Andrea – 1990
Surveys used for program and institutional evaluation, such as self-studies conducted for accreditation review, are discussed. Frequently, these evaluations take the form of faculty surveys and student surveys. This paper explores the following general considerations associated with mail surveys and other surveys: avoidance of response bias;…
Descriptors: Accreditation (Institutions), Comparative Analysis, Higher Education, Mail Surveys
Sherman, Thomas F.; Merschman, Jane – 1987
The development and assessment of the Woodcock Reading Mastery Test (WRMT) is described in this paper. The first section, after a brief description of the test, outlines the development of the test, including its purpose, how it was tested and calibrated, its administration and scoring, use and interpretation of scores obtained, and statistical…
Descriptors: Criterion Referenced Tests, Diagnostic Tests, Elementary Secondary Education, Reading Diagnosis
O'Brien, Michael; Hampilos, John P. – 1984
The feasibility of creating an item bank from a teacher-made test was examined in two comparable sections of a graduate-level introductory measurement course. The 67-item midterm examination contained multiple-choice and master matching items, which required higher level cognitive processes such as application and analysis. The feasibility of…
Descriptors: Computer Assisted Testing, Criterion Referenced Tests, Difficulty Level, Higher Education

Peer reviewed
Direct link
