ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	3
Since 2007 (last 20 years)	4

Descriptor

Item Sampling	52
Test Reliability	52
Test Validity	22
Test Construction	20
Item Analysis	16
Criterion Referenced Tests	14
Statistical Analysis	13
Test Interpretation	11
Test Items	10
Achievement Tests	9
Mathematical Models	8
Measurement Techniques	8
Norm Referenced Tests	8
Comparative Analysis	7
Educational Assessment	6
Elementary Secondary Education	6
Error of Measurement	6
Evaluation Methods	6
Item Banks	6
Test Theory	6
Career Development	5
Difficulty Level	5
Models	5
Sampling	5
Testing Problems	5
More ▼

Source

Educational and Psychological…	5
Applied Psychological…	3
Journal of Educational…	2
Assessment & Evaluation in…	1
Australian Journal of Career…	1
College Student Journal	1
Evaluation and Research in…	1
Health Education (Washington…	1
Illinois School Research	1
International Journal of…	1
Physical Review Physics…	1
Practical Assessment,…	1
Studies in Educational…	1
More ▼

Publication Type

Reports - Research	23
Journal Articles	10
Speeches/Meeting Papers	10
Reports - Evaluative	4
Guides - General	2
Reports - Descriptive	2
Collected Works - Proceedings	1
Guides - Non-Classroom	1
Non-Print Media	1
Reference Materials -…	1
Reports - General	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	2
Elementary Education	1
Grade 4	1
Intermediate Grades	1
Postsecondary Education	1
Secondary Education	1

Audience

Researchers	2
Practitioners	1

Location

Australia	1
Bosnia and Herzegovina	1
Croatia	1
Netherlands	1
Pennsylvania	1
Slovenia	1

Laws, Policies, & Programs

Assessments and Surveys

Pennsylvania Educational…	2
Adjective Check List	1
Graduate Record Examinations	1
Kuder Occupational Interest…	1
National Assessment of…	1
SAT (College Admission Test)	1
Self Directed Search	1
Strong Interest Inventory	1

What Works Clearinghouse Rating

Test Reliability X

Showing 31 to 45 of 52 results Save | Export

An Empirical Examination of a Modified Matrix Sampling Procedure as an Evaluation Tool for Grades 7 to 12 in a Midwestern School District

Peer reviewed

Direct link

Liang, Xin – Evaluation and Research in Education, 2003

Multiple matrix sampling is a data collection technique that ensures accuracy and efficiency in group performance. It has been widely used in large-scale curriculum evaluation since the 1980s. However, the design does not always fully embrace the dynamics of local evaluation demands. The purpose of this study is to introduce a modified matrix…

Descriptors: Curriculum Evaluation, Item Sampling, Matrices, Statistical Studies

Aspects and Applications of Criterion-Referenced Tests

Kriewall, Thomas E. – Illinois School Research, 1972

Author discusses and defines criterion tests in the context of classroom needs that have created much of the interest in the theory at this time. The primary source of interest is related to the growing implementation of individualized curricula. (Author/CB)

Descriptors: Criterion Referenced Tests, Difficulty Level, Individualized Instruction, Item Analysis

The Development and Interpretation of Criterion - Referenced Tests.

Download full text

Kriewall, Thomas E.; Hirsch, Edward – 1969

As an alternative to a classical test theory basis for criterion-referenced test construction, it is proposed that a strict item-sampling model be used. The computer's role in such a model is outlined. The assumptions of the model are carefully defined and its properties reviewed. The relationship between mastery criteria and such sampling plans…

Descriptors: Arithmetic, Behavioral Objectives, Computer Assisted Instruction, Criterion Referenced Tests

Some Item Analysis and Test Theory for a System of Computer-Assisted Test Construction for Individualized Instruction

Peer reviewed

Lord, Frederic M. – Applied Psychological Measurement, 1977

Under given conditions, conventional testing and computer-generated repeatable testing (CGRT) are equally effective for estimating examinee ability; CGRT is more effective for estimating the mean ability level of a group and less effective for estimating ability differences among individuals. These conclusion are drawn from domain-referenced test…

Descriptors: Career Development, Computer Assisted Testing, Difficulty Level, Group Norms

Construction and Use of Criterion-Referenced Tests in Program Evaluation Studies. Laboratory of Psychometric and Evaluation Research Report No. 102.

Download full text

Gifford, Janice A.; Hambleton, Ronald K. – 1980

Technical considerations associated with item selection and reliability assessment are considered in relation to criterion-referenced tests constructed to provide group information. The purpose is to emphasize test building and the evaluation of test scores in program evaluation studies. It is stressed that an evaluator employ a performance or…

Descriptors: Criterion Referenced Tests, Group Testing, Item Sampling, Models

Scale-Score Reporting of National Assessment Data (Final Report).

Download full text

Mislevy, Robert J.; And Others – 1982

An approach was developed based on item-response models defined at the level of salient subject groups rather than at the level of individuals, designed for use with multiple-matrix sampling designs. In each of three National Assessment of Educational Progress (NAEP) mathematics subtopics, Reiser's group-effects latent trait model was fitted to…

Descriptors: Educational Assessment, Item Analysis, Item Sampling, Latent Trait Theory

The Generalizability of District Means Using Multiple Matrix Sampling.

Pandey, Tej N. – 1978

The concept under investigation was the reliability of estimates of mean scores of groups under various assumptions of multiple-matrix sampling when reliabilities are computed according to procedures based on generalizability theory. Four different cases were compared with respect to the generalizability coefficients depending upon whether pupils…

Descriptors: Achievement Tests, Analysis of Variance, Basic Skills, Elementary Secondary Education

Achievement Test Items--Methods of Study. CSE Monograph Series in Evaluation, 6.

Harris, Chester W.; And Others – 1977

The implications of a mathematical model of test scores are explored where the data are limited to a random sample of items without replacement from an indefinitely large population or item domain in which items are scored either zero or one. The purpose is to obtain an unbiased estimate of a student's proportion of items correct in the item…

Descriptors: Academic Achievement, Achievement Tests, Annotated Bibliographies, Bibliographies

The Generalizability of Elementary School Student Ratings of Attitudes Toward School Subjects.

Download full text

Carloni, John A.; Kolen, Michael J. – 1980

Generalizability theory was used to analyze the dependability of elementary school student ratings of attitudes toward school subjects. The rating scales under investigation have been developed to measure the attitudes of students toward four school subjects at both the primary and intermediate levels. Two generalizability coefficients, differing…

Descriptors: Attitude Measures, Comparative Analysis, Elementary Education, Elementary School Mathematics

Riding the Rasch Tiger. Part 1: Laying the Item Bank Foundation (Paul Volker Would Approve).

Forster, Fred – 1987

Studies carried out over a 12-year period addressed fundamental questions on the use of Rasch-based item banks. Large field tests administered in grades 3-8 of reading, mathematics, and science items, as well as standardized test results were used to explore the possible effects of many factors on item calibrations. In general, the results…

Descriptors: Achievement Tests, Difficulty Level, Elementary Education, Item Analysis

Aspects and Applications of Criterion-Referenced Tests.

PDF pending restoration

Kriewall, Thomas E. – 1972

The measurement information generated by CRT's is designed for use in instructional management systems where classifications of pupils for treatment are to be decided on the basis of minimal data consistent with predetermined limits for the errors of misclassification. The measures obtained are content specific estimates of proficiency useful for…

Descriptors: Ability Grouping, Academic Achievement, Criterion Referenced Tests, Decision Making

A Cloze Is a Cloze Is a Cloze?

Download full text

Brown, James Dean – 1983

This study attempted to determine the effectiveness of cloze procedures as norm-referenced instruments by comparing the differential responses of four groups of college students of English as a second language on two identical cloze passages. The responses were scored using both exact-answer and acceptable-word methods. The results indicate that…

Descriptors: Cloze Procedure, College Students, Comparative Analysis, English (Second Language)

Characteristics of Samples and Linking Items Affecting a Partial Pre-Calibrations Design.

Download full text

Cook, Linda L.; And Others – 1987

This study tests several explanations for discrepant results in an earlier study (Cook et al., 1985) which presented a partial pre-calibration method for equating new editions of the Scholastic Aptitude Test (SAT) to the same scale as older editions. In contrast to full pre-calibration, which seeks to equate all items from two or more editions,…

Descriptors: College Entrance Examinations, Concurrent Validity, Equated Scores, Estimation (Mathematics)

An Approach to Measuring the Achievement or Proficiency of an Examinee.

Wilcox, Rand R. – 1979

Mastery tests are analyzed in terms of the number of skills to be mastered and the number of items per skill, in order that correct decisions of mastery or nonmastery will be made to a desired degree of probability. It is assumed that a random sample of skills will be selected for measurement, that each skill will be measured by the same number of…

Descriptors: Achievement Tests, Cutting Scores, Decision Making, Equivalency Tests

An Introduction to Generalizability Theory as a Contributor to Evaluation Research.

Gillmore, Gerald M. – 1979

It is argued in this paper that generalizability theory provides a uniquely useful framework for defining and quantifying the dependability of data for decision making. It does so by requiring careful specification of the conditions of measurement and the anticipated sources of variation in the results of the measurement procedure. A distinction…

Descriptors: Analysis of Variance, Criterion Referenced Tests, Decision Making, Educational Assessment

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Kriewall, Thomas E.	3
Cliff, Norman	2
Harris, Chester W.	2
Kohr, Richard L., Comp.	2
Pandey, Tej N.	2
Shoemaker, David M.	2
Adams, Raymond J.	1
Askegaard, Lewis D.	1
Athanasou, James A.	1
Austin, Dean A.	1
Aviani, Ivica	1
Barack, Leonard I.	1
Bashkov, Bozhidar M.	1
Boone, William J.	1
Brown, James Dean	1
Burton, Richard F.	1
Carloni, John A.	1
Clauser, Jerome C.	1
Cook, Linda L.	1
Epstein, Kenneth I.	1
Erceg, Nataša	1
Eren Can Aybek	1
Estes, Carole	1
Estes, Gary D.	1
More ▼