ERIC - Search Results

Publication Date

In 2026	0
Since 2025	5
Since 2022 (last 5 years)	10
Since 2017 (last 10 years)	33
Since 2007 (last 20 years)	51

Descriptor

Test Length	133
Test Reliability	133
Test Validity	63
Test Items	44
Test Construction	42
Scores	24
Test Format	23
Computer Assisted Testing	21
Error of Measurement	20
Foreign Countries	20
Item Response Theory	19
Comparative Analysis	16
Statistical Analysis	16
Psychometrics	15
Difficulty Level	14
Item Analysis	14
Adaptive Testing	13
Language Tests	13
Testing Problems	13
Correlation	12
Higher Education	12
Mathematical Models	12
Testing	12
Mastery Tests	11
Cutting Scores	10
More ▼

Publication Type

Reports - Research	91
Journal Articles	74
Speeches/Meeting Papers	18
Reports - Evaluative	16
Reports - Descriptive	6
Tests/Questionnaires	4
Guides - Non-Classroom	3
Information Analyses	2
Opinion Papers	2
Reference Materials -…	2
Collected Works - Serials	1
Guides - General	1
Numerical/Quantitative Data	1
Reports - General	1
More ▼

Education Level

Higher Education	12
Postsecondary Education	11
Elementary Education	9
Secondary Education	6
Early Childhood Education	4
Grade 6	4
Intermediate Grades	4
Middle Schools	4
Primary Education	4
Grade 3	3
Grade 5	3
Grade 7	3
Junior High Schools	3
Elementary Secondary Education	2
Grade 2	2
Grade 4	2
Grade 8	2
High Schools	2
Grade 1	1
Grade 9	1
Kindergarten	1
More ▼

Audience

Researchers	4
Practitioners	2
Community	1
Support Staff	1

Location

China	4
Turkey	3
Australia	2
Canada	2
Ireland	2
Netherlands	2
Singapore	2
United Kingdom	2
Alabama	1
California	1
Germany	1
Illinois (Chicago)	1
Indiana	1
Japan	1
Kenya	1
Maryland	1
New Jersey	1
New Zealand	1
Pennsylvania	1
Peru	1
Poland	1
Portugal	1
South Korea	1
Spain	1
Taiwan	1
More ▼

Laws, Policies, & Programs

Job Training Partnership Act…

What Works Clearinghouse Rating

Test Reliability X

Showing 76 to 90 of 133 results Save | Export

Determining the Lengths for Criterion-Referenced Tests.

Peer reviewed

Hambleton, Ronald K.; And Others – Journal of Educational Measurement, 1983

A new method was developed to assist in the selection of a test length by utilizing computer simulation procedures and item response theory. A demonstration of the method presents results which address the influences of item pool heterogeneity matched to the objectives of interest and the method of item selection. (Author/PN)

Descriptors: Computer Programs, Criterion Referenced Tests, Item Banks, Latent Trait Theory

Multiple Choice and True-False: Reliability and Validity Compared.

Peer reviewed

Green, Kathy – Journal of Experimental Education, 1979

Reliabilities and concurrent validities of teacher-made multiple-choice and true-false tests were compared. No significant differences were found even when multiple-choice reliability was adjusted to equate testing time. (Author/MH)

Descriptors: Comparative Testing, Higher Education, Multiple Choice Tests, Test Format

Short-Forms of the Schedule for Nonadaptive and Adaptive Personality (SNAP) for Self- and Collateral Ratings: Development, Reliability, and Validity.

Peer reviewed

Harlan, Elena; Clark, Lee Anna – Assessment, 1999

Reports the development of a paragraph-descriptor short form of the Schedule for Nonadaptive and Adaptive Personality (SNAP); (L. Clark, 1993) with self- and other versions. Data from 294 college students, with parental ratings for 94 students, support the reliability and validity of the measure. (SLD)

Descriptors: Adjustment (to Environment), College Students, Higher Education, Parents

Development of a Shortened Form of the Fennema-Sherman Mathematics Attitudes Scales.

Peer reviewed

Mulhern, Fiona; Rae, Gordon – Educational and Psychological Measurement, 1998

Data from 196 Irish school children were analyzed and used to develop a shortened version of the Fennema-Sherman Mathematics Attitudes Scales (E. Fennema and J. Sherman, 1976). Internal consistency estimates of the reliability of scores on the whole scale and each of the subscales of the original and short form were favorable. (SLD)

Descriptors: Attitude Measures, Elementary Education, Elementary School Students, Foreign Countries

Estimating the Reliability of Classifications Based on Composite Scores.

Download full text

Livingston, Samuel A. – 1984

Much previously published material for estimating the reliability of classification has been based on the assumption that a test consists of a known number of equally weighted items. The test score is the number of those items answered correctly. These methods cannot be used with classifications based on weighted composite scores, especially if…

Descriptors: Equated Scores, Essay Tests, Estimation (Mathematics), Mathematical Models

Contributions to Criterion-Referenced Testing Technology.

Peer reviewed

Hambleton, Ronald K., Ed. – Applied Psychological Measurement, 1980

This special issue covers recent technical developments in the field of criterion-referenced testing. An introduction, six papers, and two commentaries dealing with test development, test score uses, and evaluation of scores review relevant literature, offer new models and/or results, and suggest directions for additional research. (SLD)

Descriptors: Criterion Referenced Tests, Mastery Tests, Measurement Techniques, Standard Setting (Scoring)

Concurrent Validity and Reliability of the Kaufman Version of the McCarthy Scales Short Form for a Sample of Mexican-American Children.

Peer reviewed

Valencia, Richard R.; Rankin, Richard J. – Educational and Psychological Measurement, 1983

The concurrent validity and reliability of Kaufman's short-form version of the McCarthy Scales of Children's Abilities were examined for a sample of 342 Mexican-American preschool and kindergarten age children. The results showed that generally the positive psychometric properties of the Kaufman short form were also noted for the children in this…

Descriptors: High Risk Students, Mexican Americans, Preschool Education, Preschool Tests

On the Theory of a Set of Tests Which Differ Only in Length

Peer reviewed

Kristof, Walter – Psychometrika, 1971

Descriptors: Cognitive Measurement, Error of Measurement, Mathematical Models, Psychological Testing

Multiple-Choice and True/False Tests: Myths and Misapprehensions

Peer reviewed

Direct link

Burton, Richard F. – Assessment and Evaluation in Higher Education, 2005

Examiners seeking guidance on multiple-choice and true/false tests are likely to encounter various faulty or questionable ideas. Twelve of these are discussed in detail, having to do mainly with the effects on test reliability of test length, guessing and scoring method (i.e. number-right scoring or negative marking). Some misunderstandings could…

Descriptors: Guessing (Tests), Multiple Choice Tests, Objective Tests, Test Reliability

The Standard Errors of the Feldt-Gilmer Congeneric Reliability Coefficients: Iowa Testing Programs Occasional Papers. Number 31.

PDF pending restoration

Gilmer, Jerry S.; Feldt, Leonard S. – 1982

The Feldt-Gilmer congeneric reliability coefficients make it possible to estimate the reliability of a test composed of parts of unequal, unknown length. The approximate standard errors of the Feldt-Gilmer coefficients are derived via a method using the multivariate Taylor's expansion. Monte Carlo simulation is employed to corroborate the…

Descriptors: Educational Testing, Error of Measurement, Mathematical Formulas, Mathematical Models

The Development of More Efficient Measures for Evaluating Language Impairments in Aphasic Patients.

Download full text

Phillips, Phyllis P.; Halpin, Gerald – 1975

Because it generally took over an hour to administer the Porch Index of Communicative Ability (PICA), a shorter but comparable version of the test was developed. The original test was designed to quantify aphasic patients' ability level on common communicative tasks and consisted of 18 ten-item subtests. Each item resulted in a proficiency rating,…

Descriptors: Adults, Aphasia, Equated Scores, Language Handicaps

The Generalizability of the Matching Familiar Figures Test.

Watkins, John M.; And Others – 1978

Generalizability theory was applied to the Matching Familiar Figures Test (MFF), an instrument commonly employed to assess reflection-impulsivity in children, in order to analyze the dependability of the MFF at four grade levels: second, third, fourth, and fifth. The MFF was individually administered to 114 boys. A completely crossed, two-facet…

Descriptors: Age Differences, Cognitive Development, Cognitive Tests, Conceptual Tempo

Multiple Choice and True/False Tests: Reliability Measures and Some Implications of Negative Marking

Peer reviewed

Direct link

Burton, Richard F. – Assessment & Evaluation in Higher Education, 2004

The standard error of measurement usefully provides confidence limits for scores in a given test, but is it possible to quantify the reliability of a test with just a single number that allows comparison of tests of different format? Reliability coefficients do not do this, being dependent on the spread of examinee attainment. Better in this…

Descriptors: Multiple Choice Tests, Error of Measurement, Test Reliability, Test Items

A Comparison of Two Item Selection Procedures for Building Criterion-Referenced Tests.

Download full text

Haladyna, Tom; Roid, Gale – 1981

Two approaches to criterion-referenced test construction are compared. Classical test theory is based on the practice of random sampling from a well-defined domain of test items; latent trait theory suggests that the difficulty of the items should be matched to the achievement level of the student. In addition to these two methods of test…

Descriptors: Criterion Referenced Tests, Error of Measurement, Latent Trait Theory, Test Construction

Test Length and Validity: An Application of Test Theory to a Finite World.

Myers, Charles T. – 1978

The viewpoint is expressed that adding to test reliability by either selecting a more homogeneous set of items, restricting the range of item difficulty as closely as possible to the most efficient level, or increasing the number of items will not add to test validity and that there is considerable danger that efforts to increase reliability may…

Descriptors: Achievement Tests, Item Analysis, Multiple Choice Tests, Test Construction

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9

Educational and Psychological…	13
Journal of Psychoeducational…	8
Applied Psychological…	5
Journal of Educational…	5
Psychometrika	4
Applied Measurement in…	3
Language Testing	3
Assessment & Evaluation in…	2
ETS Research Report Series	2
International Journal of…	2
Journal of Personality…	2
Psychological Assessment	2
Research Matters	2
ACT Education Corp.	1
AERA Online Paper Repository	1
African Educational Research…	1
Anatomical Sciences Education	1
Assessment	1
Assessment and Evaluation in…	1
College Student Journal	1
Contemporary Educational…	1
Education and Information…	1
Educational Research and…	1
Educational Sciences: Theory…	1
Eurasian Journal of…	1
More ▼

Hambleton, Ronald K.	4
Burton, Richard F.	3
Cliff, Norman	2
Gilmer, Jerry S.	2
Huynh, Huynh	2
Lee, Yi-Hsuan	2
Leite, Walter L.	2
Livingston, Samuel A.	2
Marcoulides, Katerina M.	2
Raborn, Anthony W.	2
Reckase, Mark D.	2
Wilcox, Rand R.	2
Yao, Lihua	2
Zhang, Jinming	2
de Jong, John H. A. L.	2
Abrams, Matthew	1
Allison, Paul A.	1
Almeida, Leandro S.	1
Anderson, Judith A.	1
Andrea Fuster	1
Andy Rick Sánchez-Villena	1
Anthony, Christopher J.	1
Anthony, Christopher James	1
Arens, A. Katrin	1
More ▼

Wechsler Adult Intelligence…	3
McCarthy Scales of Childrens…	2
Peabody Picture Vocabulary…	2
Test of English as a Foreign…	2
Wechsler Intelligence Scale…	2
ACT Assessment	1
ACTFL Oral Proficiency…	1
Adaptive Behavior Scale	1
Armed Forces Qualification…	1
Comprehensive Tests of Basic…	1
Developmental Indicators for…	1
Draw a Person Test	1
Fennema Sherman Mathematics…	1
Iowa Tests of Basic Skills	1
MacArthur Communicative…	1
Matching Familiar Figures Test	1
Measures of Academic Progress	1
Medical College Admission Test	1
Minnesota Multiphasic…	1
Multidimensional…	1
National Assessment of…	1
Positive and Negative Affect…	1
School and College Ability…	1
Self Description Questionnaire	1
Stanford Binet Intelligence…	1
More ▼