ERIC - Search Results

Publication Date

In 2026	0
Since 2025	5
Since 2022 (last 5 years)	10
Since 2017 (last 10 years)	33
Since 2007 (last 20 years)	51

Descriptor

Test Length	133
Test Reliability	133
Test Validity	63
Test Items	44
Test Construction	42
Scores	24
Test Format	23
Computer Assisted Testing	21
Error of Measurement	20
Foreign Countries	20
Item Response Theory	19
Comparative Analysis	16
Statistical Analysis	16
Psychometrics	15
Difficulty Level	14
Item Analysis	14
Adaptive Testing	13
Language Tests	13
Testing Problems	13
Correlation	12
Higher Education	12
Mathematical Models	12
Testing	12
Mastery Tests	11
Cutting Scores	10
More ▼

Publication Type

Reports - Research	91
Journal Articles	74
Speeches/Meeting Papers	18
Reports - Evaluative	16
Reports - Descriptive	6
Tests/Questionnaires	4
Guides - Non-Classroom	3
Information Analyses	2
Opinion Papers	2
Reference Materials -…	2
Collected Works - Serials	1
Guides - General	1
Numerical/Quantitative Data	1
Reports - General	1
More ▼

Education Level

Higher Education	12
Postsecondary Education	11
Elementary Education	9
Secondary Education	6
Early Childhood Education	4
Grade 6	4
Intermediate Grades	4
Middle Schools	4
Primary Education	4
Grade 3	3
Grade 5	3
Grade 7	3
Junior High Schools	3
Elementary Secondary Education	2
Grade 2	2
Grade 4	2
Grade 8	2
High Schools	2
Grade 1	1
Grade 9	1
Kindergarten	1
More ▼

Audience

Researchers	4
Practitioners	2
Community	1
Support Staff	1

Location

China	4
Turkey	3
Australia	2
Canada	2
Ireland	2
Netherlands	2
Singapore	2
United Kingdom	2
Alabama	1
California	1
Germany	1
Illinois (Chicago)	1
Indiana	1
Japan	1
Kenya	1
Maryland	1
New Jersey	1
New Zealand	1
Pennsylvania	1
Peru	1
Poland	1
Portugal	1
South Korea	1
Spain	1
Taiwan	1
More ▼

Laws, Policies, & Programs

Job Training Partnership Act…

What Works Clearinghouse Rating

Test Reliability X

Showing 106 to 120 of 133 results Save | Export

Criterion-Referenced Testing and Measurement: A Review of Technical Issues and Developments

Peer reviewed

Hambleton, Ronald K.; And Others – Review of Educational Research, 1978

Reviewing psychometric and statistical developments in criterion- referenced testing, this paper presents six sections: uses of criterion- referenced test scores, reliability of criterion-referenced test scores, determination of test length, determination of cut-off scores, test development and validation, and summary and suggestions for further…

Descriptors: Criterion Referenced Tests, Cutting Scores, Mastery Tests, Mathematical Models

The Implications of Accommodations in Testing Students with Disabilities.

Download full text

Hopper, Margaret F. – 2001

This paper provides an overview of the types of testing accommodations used for students with disabilities and presents arguments for and against their use. It begins by discussing student participation in educational assessments and federal requirements concerning the participation of students with disabilities. The types of accommodations are…

Descriptors: Academic Accommodations (Disabilities), Academic Standards, Disabilities, Educational Assessment

Evaluating Writing: Linking Large-Scale Testing and Classroom Assessment. Occasional Paper No. 27.

Download full text

Freedman, Sarah Warshauer – 1991

Writing teachers and educators can add to information from large-scale testing and teachers can strengthen classroom assessment by creating a tight fit between large-scale testing and classroom assessment. Across the years, large-scale testing programs have struggled with a difficult problem: how to evaluate student writing reliably and…

Descriptors: Elementary Secondary Education, Foreign Countries, Informal Assessment, Portfolios (Background Materials)

A Comparison of Reliability Estimates from Single and Double Administrations of Criterion-Referenced Tests.

Schaefer, Mary M.; Gross, Susan K. – 1983

Viewing the reliability for criterion-referenced tests as that of mastery classification decisions, three models for determining reliability were examined using two test administrations so that two estimates could be compared to a standard. A major purpose of the research was to determine how several reliability coefficients (coefficient kappa, an…

Descriptors: Comparative Analysis, Correlation, Criterion Referenced Tests, Cutting Scores

Evaluation of Criterion-Referenced Reliability Coefficients. Final Report.

Download full text

Subkoviak, Michael J. – 1977

Four different procedures were used for estimating the proportion of persons who would be classified consistently as either passing both of two parallel tests or failing both. These four methods were applied at each of four different mastery level scores for each of three different length tests. Data were based on 50 replications of each procedure…

Descriptors: Criterion Referenced Tests, Cutting Scores, Data Analysis, Data Collection

Comparative Racial Analysis of Enlisted Advancement Exams: Item Differentiation. Final Report.

Download full text

Robertson, David W.; And Others – 1977

A comparative study of item analysis was conducted on the basis of race to determine whether alternative test construction or processing might increase the proportion of black enlisted personnel among those passing various military technical knowledge examinations. The study used data from six specialists at four grade levels and investigated item…

Descriptors: Difficulty Level, Enlisted Personnel, Item Analysis, Occupational Tests

A Preliminary Version of a Scale to Measure Sex-Role Attitudes in the Army. Research Memorandum 76-3.

Download full text

Woelfel, John C.; And Others – 1976

To measure the sex role attitudes of Army personnel, an initial set of 174 items was developed. These items were administered to 721 soldiers at three Army installations; the sample consisted of 540 men and 181 women--401 of these were officers and 320 were enlisted personnel. Factor analysis of these 174 items indicated one strong…

Descriptors: Adults, Attitude Measures, Factor Structure, Females

The Assessment of Writing Proficiency via Qualitative Ratings of Writing Samples.

Steele, Joe M. – 1979

The College Outcome Measures Project/American College Testing Program (COMP/ACT) Writing Assessment is described, and issues of validity and reliability in the assessment of writing samples using qualitative rating scales are explored. COMP/ACT is composed of three role-playing tasks in the social sciences, natural sciences, and arts, which are…

Descriptors: Adults, Essay Tests, Evaluators, Higher Education

The Effect of Keying All Options Correct on Equating Functions and Scores.

Download full text

Lenel, Julia C.; Gilmer, Jerry S. – 1986

In some testing programs an early item analysis is performed before final scoring in order to validate the intended keys. As a result, some items which are flawed and do not discriminate well may be keyed so as to give credit to examinees no matter which answer was chosen. This is referred to as allkeying. This research examined how varying the…

Descriptors: Equated Scores, Item Analysis, Latent Trait Theory, Licensing Examinations (Professions)

Determining Optimal Test Lengths with a Fixed Total Testing Time.

Download full text

Hambleton, Ronald K. – 1986

The problem of determining optimal test lengths with fixed total testing time has proved to be a difficult one for criterion-referenced test developers. An algorithm is needed which can be used by test developers to allocate available testing time to maximize the validity of their total criterion-referenced tests or testing programs. To be…

Descriptors: Algorithms, Criterion Referenced Tests, Elementary Secondary Education, Psychometrics

Factors Influencing the Psychometric Characteristics of an Adaptive Testing Strategy for Test Batteries.

Download full text

Maurelli, Vincent A.; Weiss, David J. – 1981

A monte carlo simulation was conducted to assess the effects in an adaptive testing strategy for test batteries of varying subtest order, subtest termination criterion, and variable versus fixed entry on the psychometric properties of an existent achievement test battery. Comparisons were made among conventionally administered tests and adaptive…

Descriptors: Achievement Tests, Adaptive Testing, Computer Assisted Testing, Latent Trait Theory

The Second Century of Ability Testing: Some Predictions and Speculations

Peer reviewed

Direct link

Embretson, Susan E. – Measurement: Interdisciplinary Research and Perspectives, 2004

The last century was marked by dazzling changes in many areas, such as technology and communications. Predictions into the second century of testing are seemingly difficult in such a context. Yet, looking back to the turn of the last century, Kirkpatrick (1900), in his American Psychological Association presidential address, presented fundamental…

Descriptors: Ability, Testing, Futures (of Society), Psychometrics

Test-Retest Analyses of the Test of English as a Foreign Language. TOEFL Research Reports Report 45.

Download full text

Henning, Grant – 1993

This study provides information about the total and component scores of the Test of English as a Foreign Language (TOEFL). First, the study provides comparative global and component estimates of test-retest, alternate-form, and internal-consistency reliability, controlling for sources of measurement error inherent in the examinees and the testing…

Descriptors: Difficulty Level, English (Second Language), Error of Measurement, Estimation (Mathematics)

Test-Retest Consistency of Computer Adaptive Tests.

Lunz, Mary E.; And Others – 1990

This study explores the test-retest consistency of computer adaptive tests of varying lengths. The testing model used was designed as a mastery model to determine whether an examinee's estimated ability level is above or below a pre-established criterion expressed in the metric (logits) of the calibrated item pool scale. The Rasch model was used…

Descriptors: Ability Identification, Adaptive Testing, College Students, Comparative Testing

A Comparison of a Bayesian and a Maximum Likelihood Tailored Testing Procedure.

Download full text

McKinley, Robert L.; Reckase, Mark D. – 1981

A study was conducted to compare tailored testing procedures based on a Bayesian ability estimation technique and on a maximum likelihood ability estimation technique. The Bayesian tailored testing procedure selected items so as to minimize the posterior variance of the ability estimate distribution, while the maximum likelihood tailored testing…

Descriptors: Academic Ability, Adaptive Testing, Bayesian Statistics, Comparative Analysis

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9

Educational and Psychological…	13
Journal of Psychoeducational…	8
Applied Psychological…	5
Journal of Educational…	5
Psychometrika	4
Applied Measurement in…	3
Language Testing	3
Assessment & Evaluation in…	2
ETS Research Report Series	2
International Journal of…	2
Journal of Personality…	2
Psychological Assessment	2
Research Matters	2
ACT Education Corp.	1
AERA Online Paper Repository	1
African Educational Research…	1
Anatomical Sciences Education	1
Assessment	1
Assessment and Evaluation in…	1
College Student Journal	1
Contemporary Educational…	1
Education and Information…	1
Educational Research and…	1
Educational Sciences: Theory…	1
Eurasian Journal of…	1
More ▼

Hambleton, Ronald K.	4
Burton, Richard F.	3
Cliff, Norman	2
Gilmer, Jerry S.	2
Huynh, Huynh	2
Lee, Yi-Hsuan	2
Leite, Walter L.	2
Livingston, Samuel A.	2
Marcoulides, Katerina M.	2
Raborn, Anthony W.	2
Reckase, Mark D.	2
Wilcox, Rand R.	2
Yao, Lihua	2
Zhang, Jinming	2
de Jong, John H. A. L.	2
Abrams, Matthew	1
Allison, Paul A.	1
Almeida, Leandro S.	1
Anderson, Judith A.	1
Andrea Fuster	1
Andy Rick Sánchez-Villena	1
Anthony, Christopher J.	1
Anthony, Christopher James	1
Arens, A. Katrin	1
More ▼

Wechsler Adult Intelligence…	3
McCarthy Scales of Childrens…	2
Peabody Picture Vocabulary…	2
Test of English as a Foreign…	2
Wechsler Intelligence Scale…	2
ACT Assessment	1
ACTFL Oral Proficiency…	1
Adaptive Behavior Scale	1
Armed Forces Qualification…	1
Comprehensive Tests of Basic…	1
Developmental Indicators for…	1
Draw a Person Test	1
Fennema Sherman Mathematics…	1
Iowa Tests of Basic Skills	1
MacArthur Communicative…	1
Matching Familiar Figures Test	1
Measures of Academic Progress	1
Medical College Admission Test	1
Minnesota Multiphasic…	1
Multidimensional…	1
National Assessment of…	1
Positive and Negative Affect…	1
School and College Ability…	1
Self Description Questionnaire	1
Stanford Binet Intelligence…	1
More ▼