Publication Date
| In 2026 | 0 |
| Since 2025 | 5 |
| Since 2022 (last 5 years) | 10 |
| Since 2017 (last 10 years) | 33 |
| Since 2007 (last 20 years) | 51 |
Descriptor
| Test Length | 133 |
| Test Reliability | 133 |
| Test Validity | 63 |
| Test Items | 44 |
| Test Construction | 42 |
| Scores | 24 |
| Test Format | 23 |
| Computer Assisted Testing | 21 |
| Error of Measurement | 20 |
| Foreign Countries | 20 |
| Item Response Theory | 19 |
| More ▼ | |
Source
Author
Publication Type
Education Level
| Higher Education | 12 |
| Postsecondary Education | 11 |
| Elementary Education | 9 |
| Secondary Education | 6 |
| Early Childhood Education | 4 |
| Grade 6 | 4 |
| Intermediate Grades | 4 |
| Middle Schools | 4 |
| Primary Education | 4 |
| Grade 3 | 3 |
| Grade 5 | 3 |
| More ▼ | |
Audience
| Researchers | 4 |
| Practitioners | 2 |
| Community | 1 |
| Support Staff | 1 |
Location
| China | 4 |
| Turkey | 3 |
| Australia | 2 |
| Canada | 2 |
| Ireland | 2 |
| Netherlands | 2 |
| Singapore | 2 |
| United Kingdom | 2 |
| Alabama | 1 |
| California | 1 |
| Germany | 1 |
| More ▼ | |
Laws, Policies, & Programs
| Job Training Partnership Act… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedHambleton, Ronald K.; And Others – Review of Educational Research, 1978
Reviewing psychometric and statistical developments in criterion- referenced testing, this paper presents six sections: uses of criterion- referenced test scores, reliability of criterion-referenced test scores, determination of test length, determination of cut-off scores, test development and validation, and summary and suggestions for further…
Descriptors: Criterion Referenced Tests, Cutting Scores, Mastery Tests, Mathematical Models
Hopper, Margaret F. – 2001
This paper provides an overview of the types of testing accommodations used for students with disabilities and presents arguments for and against their use. It begins by discussing student participation in educational assessments and federal requirements concerning the participation of students with disabilities. The types of accommodations are…
Descriptors: Academic Accommodations (Disabilities), Academic Standards, Disabilities, Educational Assessment
Freedman, Sarah Warshauer – 1991
Writing teachers and educators can add to information from large-scale testing and teachers can strengthen classroom assessment by creating a tight fit between large-scale testing and classroom assessment. Across the years, large-scale testing programs have struggled with a difficult problem: how to evaluate student writing reliably and…
Descriptors: Elementary Secondary Education, Foreign Countries, Informal Assessment, Portfolios (Background Materials)
Schaefer, Mary M.; Gross, Susan K. – 1983
Viewing the reliability for criterion-referenced tests as that of mastery classification decisions, three models for determining reliability were examined using two test administrations so that two estimates could be compared to a standard. A major purpose of the research was to determine how several reliability coefficients (coefficient kappa, an…
Descriptors: Comparative Analysis, Correlation, Criterion Referenced Tests, Cutting Scores
Subkoviak, Michael J. – 1977
Four different procedures were used for estimating the proportion of persons who would be classified consistently as either passing both of two parallel tests or failing both. These four methods were applied at each of four different mastery level scores for each of three different length tests. Data were based on 50 replications of each procedure…
Descriptors: Criterion Referenced Tests, Cutting Scores, Data Analysis, Data Collection
Robertson, David W.; And Others – 1977
A comparative study of item analysis was conducted on the basis of race to determine whether alternative test construction or processing might increase the proportion of black enlisted personnel among those passing various military technical knowledge examinations. The study used data from six specialists at four grade levels and investigated item…
Descriptors: Difficulty Level, Enlisted Personnel, Item Analysis, Occupational Tests
Woelfel, John C.; And Others – 1976
To measure the sex role attitudes of Army personnel, an initial set of 174 items was developed. These items were administered to 721 soldiers at three Army installations; the sample consisted of 540 men and 181 women--401 of these were officers and 320 were enlisted personnel. Factor analysis of these 174 items indicated one strong…
Descriptors: Adults, Attitude Measures, Factor Structure, Females
Steele, Joe M. – 1979
The College Outcome Measures Project/American College Testing Program (COMP/ACT) Writing Assessment is described, and issues of validity and reliability in the assessment of writing samples using qualitative rating scales are explored. COMP/ACT is composed of three role-playing tasks in the social sciences, natural sciences, and arts, which are…
Descriptors: Adults, Essay Tests, Evaluators, Higher Education
Lenel, Julia C.; Gilmer, Jerry S. – 1986
In some testing programs an early item analysis is performed before final scoring in order to validate the intended keys. As a result, some items which are flawed and do not discriminate well may be keyed so as to give credit to examinees no matter which answer was chosen. This is referred to as allkeying. This research examined how varying the…
Descriptors: Equated Scores, Item Analysis, Latent Trait Theory, Licensing Examinations (Professions)
Hambleton, Ronald K. – 1986
The problem of determining optimal test lengths with fixed total testing time has proved to be a difficult one for criterion-referenced test developers. An algorithm is needed which can be used by test developers to allocate available testing time to maximize the validity of their total criterion-referenced tests or testing programs. To be…
Descriptors: Algorithms, Criterion Referenced Tests, Elementary Secondary Education, Psychometrics
Maurelli, Vincent A.; Weiss, David J. – 1981
A monte carlo simulation was conducted to assess the effects in an adaptive testing strategy for test batteries of varying subtest order, subtest termination criterion, and variable versus fixed entry on the psychometric properties of an existent achievement test battery. Comparisons were made among conventionally administered tests and adaptive…
Descriptors: Achievement Tests, Adaptive Testing, Computer Assisted Testing, Latent Trait Theory
Embretson, Susan E. – Measurement: Interdisciplinary Research and Perspectives, 2004
The last century was marked by dazzling changes in many areas, such as technology and communications. Predictions into the second century of testing are seemingly difficult in such a context. Yet, looking back to the turn of the last century, Kirkpatrick (1900), in his American Psychological Association presidential address, presented fundamental…
Descriptors: Ability, Testing, Futures (of Society), Psychometrics
Test-Retest Analyses of the Test of English as a Foreign Language. TOEFL Research Reports Report 45.
Henning, Grant – 1993
This study provides information about the total and component scores of the Test of English as a Foreign Language (TOEFL). First, the study provides comparative global and component estimates of test-retest, alternate-form, and internal-consistency reliability, controlling for sources of measurement error inherent in the examinees and the testing…
Descriptors: Difficulty Level, English (Second Language), Error of Measurement, Estimation (Mathematics)
Lunz, Mary E.; And Others – 1990
This study explores the test-retest consistency of computer adaptive tests of varying lengths. The testing model used was designed as a mastery model to determine whether an examinee's estimated ability level is above or below a pre-established criterion expressed in the metric (logits) of the calibrated item pool scale. The Rasch model was used…
Descriptors: Ability Identification, Adaptive Testing, College Students, Comparative Testing
McKinley, Robert L.; Reckase, Mark D. – 1981
A study was conducted to compare tailored testing procedures based on a Bayesian ability estimation technique and on a maximum likelihood ability estimation technique. The Bayesian tailored testing procedure selected items so as to minimize the posterior variance of the ability estimate distribution, while the maximum likelihood tailored testing…
Descriptors: Academic Ability, Adaptive Testing, Bayesian Statistics, Comparative Analysis

Direct link
