Publication Date
| In 2026 | 0 |
| Since 2025 | 14 |
| Since 2022 (last 5 years) | 62 |
| Since 2017 (last 10 years) | 133 |
| Since 2007 (last 20 years) | 419 |
Descriptor
| Item Analysis | 957 |
| Test Validity | 957 |
| Test Reliability | 535 |
| Test Construction | 425 |
| Test Items | 303 |
| Foreign Countries | 210 |
| Factor Analysis | 200 |
| Psychometrics | 169 |
| Correlation | 116 |
| Statistical Analysis | 110 |
| Achievement Tests | 109 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Location
| Turkey | 52 |
| Canada | 15 |
| Iran | 11 |
| Australia | 10 |
| China | 10 |
| California | 7 |
| India | 7 |
| Indonesia | 7 |
| United Kingdom | 7 |
| Florida | 6 |
| Japan | 6 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 5 |
| Individuals with Disabilities… | 4 |
| Elementary and Secondary… | 1 |
| Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Guilliams, Clark I. – 1975
Chicano and Amerindian vocabulary scale responses from the Stanford-Binet (LM) and Wechsler Intelligence Scale for Children were item-analyzed for 1,009 subjects. The response patterns differed both by ethnic group and test, as well as by age. The most common, and recurring, pattern found was "level-of-difficulty" gradient…
Descriptors: American Indians, Correlation, Disadvantaged, Elementary Education
Swezey, Robert W.; Pearlstein, Richard B. – 1975
This manual outlines the rationale for using the Criterion Referenced Test (CRT) approach and suggests specific guidelines for test developers to use in constructing test items. Methods for assessing the adequacy of a CRT are also provided. (Author/RC)
Descriptors: Behavioral Objectives, Check Lists, Comparative Analysis, Criterion Referenced Tests
Ross, Steven J.; Okabe, Junko – International Journal of Testing, 2006
Test validity is predicated on there being a lack of bias in tasks, items, or test content. It is well-known that factors such as test candidates' mother tongue, life experiences, and socialization practices of the wider community may serve to inject subtle interactions between individuals' background and the test content. When the gender of the…
Descriptors: Gender Bias, Language Tests, Test Validity, Reading Comprehension
Byrd-Bredbenner, Carol; Wheatley, Virginia; Schaffner, Donald; Bruhn, Christine; Blalock, Lydia; Maurer, Jaclyn – Journal of Food Science Education, 2007
Little is known about the food safety knowledge of young adults. In addition, few knowledge questionnaires and no comprehensive, criterion-referenced measure that assesses the full range of food safety knowledge could be identified. Without appropriate, valid, and reliable measures and baseline data, it is difficult to develop and implement…
Descriptors: Safety, Food Standards, Diseases, Criterion Referenced Tests
Clark, Burton A. – 1990
This first test and technical manual explains the development, purpose, use, validity, and reliability of the criterion-referenced final examination for the Chemistry of Hazardous Materials course of the National Fire Academy in Emmitsburg (Maryland). Because of possible diversity in the testing knowledge and background of readers, the manual…
Descriptors: Chemistry, Criterion Referenced Tests, Cutting Scores, Fire Fighters
Oaster, T. R. F.; And Others – 1986
This study hypothesized that items in the one-question-per-passage format would be less easily answered when administered without their associated contexts than conventional reading comprehension items. A total of 256 seventh and eighth grade students were administered both Forms 3A and 3B of the Sequential Tests of Educational Progress (STEP 11).…
Descriptors: Context Effect, Difficulty Level, Grade 7, Grade 8
Crowe, Kevin; Snow, Mary – 1986
The Nevada Department of Education implemented a standard setting study of the Pre-Professional Skills Tests. The tests constitute a secure test battery developed by the Educational Testing Service to assess basic proficiency in reading, writing, and mathematics via three multiple-choice tests and a written essay test. The Nevada study, in the…
Descriptors: College Entrance Examinations, Competency Based Teacher Education, Cutting Scores, Essay Tests
Ridley, Dennis R. – 1986
This report presents an analysis of a student evaluation survey, the Uniform Student Evaluation Survey (USES), used to provide a systematic, campus-wide basis for meaningful feedback on the instructional effectiveness of college teaching at Christopher Newport College (Virginia). This study addressed three issues as to the reliability, validity,…
Descriptors: College Faculty, Correlation, Curriculum Evaluation, Feedback
Baker, Eva L.; And Others – 1980
The materials presented were developed for use in a series of conferences on testing and instruction sponsored by the National Institute of Education, with the United States Office of Education, the UCLA Center for the Study of Evaluation, and a network of research and development agencies. They are intended for use by school practitioners and…
Descriptors: Criterion Referenced Tests, Elementary Secondary Education, Evaluation Criteria, Item Analysis
Robertson, David W.; Montague, William E. – 1976
In order to look for evidence of possible racial bias in six of the Navy tests for the Enlisted Advancement System, statistical data for a group of black personnel were compared with data for a selected matching group of whites. The first step was to compute item difficulty indices (the percent passing each item) for the black group, for the…
Descriptors: Blacks, Comparative Analysis, Difficulty Level, Enlisted Personnel
Roid, Gale; Haladyna, Tom – 1978
The technology of transforming sentences from prose instruction into test questions was examined by comparing two methods of selecting sentences (keyword vs. rare singleton), two types of question words (nouns vs. adjectives), and two foil construction methods (writer's choice vs. algorithmic). Four item writers created items using each…
Descriptors: Algorithms, Cloze Procedure, Comparative Analysis, Criterion Referenced Tests
Holley, Freda M. – 1976
To explain the discrepancy between median scores on the 1976 administration of the California Achievement Tests (CAT) and the Sequential Tests of Educational Progress (STEP) in the Austin Independent School District (AISD), ten technical variables typical of achievement tests were considered as explanations. (1) The STEP may measure different…
Descriptors: Academic Achievement, Achievement Tests, Comparative Analysis, Curriculum
Canner, Jane M.; Lenke, Joanne M. – 1980
Item response data were obtained from large samples of students in Grades K-community college, taking the following tests: Stanford Early School Achievement Test, Stanford Achievement Test, Stanford Test of Academic Skills, Stanford Diagnostic Reading Test, and Stanford Diagnostic Mathematical Test. Data were classified as fitting or non-fitting…
Descriptors: Achievement Tests, Elementary Secondary Education, Goodness of Fit, Item Analysis
Tenenbaum, Arlene Bonnie; Miller, Christine A. – 1977
In the evaluation of Project Information Packages (PIPs), a content analysis was performed to detect congruence between items in a norm-referenced test and the content in six exemplary compensatory education program curricula. Gains on congruent items were used to assess the effectiveness of the programs. Preliminary results show that the amount…
Descriptors: Academic Achievement, Achievement Gains, Achievement Tests, Compensatory Education
O'Reilly, Robert P.; And Others – 1976
The report proposes to complete the validation and refinement of a new domain referenced testing technology designed to assess literal comprehension ability in students in grades 1-12. The domain referenced measures in this technology, along with other more traditional measures of reading comprehension, literal and non-literal, are subsequently…
Descriptors: Classroom Techniques, Cloze Procedure, Criterion Referenced Tests, Elementary Secondary Education

Peer reviewed
Direct link
