Publication Date
| In 2026 | 0 |
| Since 2025 | 5 |
| Since 2022 (last 5 years) | 53 |
| Since 2017 (last 10 years) | 155 |
| Since 2007 (last 20 years) | 553 |
Descriptor
Source
Author
| Hambleton, Ronald K. | 37 |
| Popham, W. James | 30 |
| Ediger, Marlow | 15 |
| Roid, Gale | 12 |
| Baker, Eva L. | 11 |
| Wilcox, Rand R. | 11 |
| Berk, Ronald A. | 10 |
| Haladyna, Tom | 10 |
| Livingston, Samuel A. | 10 |
| Millman, Jason | 10 |
| Nitko, Anthony J. | 10 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 255 |
| Teachers | 141 |
| Researchers | 99 |
| Administrators | 31 |
| Policymakers | 17 |
| Parents | 13 |
| Community | 9 |
| Students | 6 |
| Counselors | 2 |
| Support Staff | 1 |
Location
| Georgia | 85 |
| Australia | 50 |
| Florida | 36 |
| Missouri | 31 |
| Texas | 28 |
| Canada | 24 |
| Oklahoma | 18 |
| Illinois | 17 |
| United States | 17 |
| South Carolina | 16 |
| California | 15 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 5 |
Peer reviewedWilcox, Rand R. – Educational and Psychological Measurement, 1981
This paper describes and compares procedures for estimating the reliability of proficiency tests that are scored with latent structure models. Results suggest that the predictive estimate is the most accurate of the procedures. (Author/BW)
Descriptors: Criterion Referenced Tests, Scoring, Test Reliability
Stevens, Karmenlita L. – ProQuest LLC, 2009
The purpose of this study is to compare the teacher retention rates in public elementary and middle schools in Georgia that met or did not meet the academic performance component of Adequate Yearly Progress. The teacher retention rates were expected to be higher in schools that met the academic performance component of AYP and lower in the schools…
Descriptors: Report Cards, Middle Schools, Teacher Persistence, Educational Improvement
Tatsuoka, Curtis – Measurement: Interdisciplinary Research and Perspectives, 2009
In this commentary, the author addresses what is referred to as the deterministic input, noisy "and" gate (DINA) model. The author mentions concerns with how this model has been formulated and presented. In particular, the author points out that there is a lack of recognition of the confounding of profiles that generally arises and then discusses…
Descriptors: Test Items, Classification, Psychometrics, Item Response Theory
Maris, Gunter; Bechger, Timo – Measurement: Interdisciplinary Research and Perspectives, 2009
Rupp and Templin (2008) do a good job at describing the ever expanding landscape of Diagnostic Classification Models (DCM). In many ways, their review article clearly points to some of the questions that need to be answered before DCMs can become part of the psychometric practitioners toolkit. Apart from the issues mentioned in this article that…
Descriptors: Factor Analysis, Classification, Psychometrics, Item Response Theory
Horton, Tracy D. – ProQuest LLC, 2010
The purpose of this study was to examine the effect an afterschool program had on middle school at-risk students' standardized test scores and behavior. The study examined students who participated in the 21st Century Community Learning Center afterschool program at two similar schools in a county in Northwest Georgia. Data were compiled for the…
Descriptors: Reading Achievement, Standardized Tests, At Risk Students, Measures (Individuals)
Petscher, Yaacov; Foorman, Barbara – Society for Research on Educational Effectiveness, 2009
The current study will examine possible contextual effects relative to differences in reading comprehension performance in the state of Florida. While the Reading First (RF) Impact study examined such difference using a regression discontinuity design, the authors are primarily interested in other analytic methods that might answer different…
Descriptors: Reading Comprehension, Criterion Referenced Tests, Comparative Analysis, Reading Programs
Herring, Phillip Allen – ProQuest LLC, 2009
The purpose of the study was to analyze the science outreach program, Science In Motion (SIM), located in Mobile, Alabama. This research investigated what impact the SIM program has on student cognitive functioning and teacher efficacy and also investigated teacher perceptions and attitudes regarding the program. To investigate student…
Descriptors: Teacher Effectiveness, Science Programs, Outreach Programs, Criterion Referenced Tests
Sinharay, Sandip; Haberman, Shelby J. – Measurement: Interdisciplinary Research and Perspectives, 2009
In this commentary, the authors discuss some of the issues regarding the use of diagnostic classification models that practitioners should keep in mind. In the authors experience, these issues are not as well known as they should be. The authors then provide recommendations on diagnostic scoring.
Descriptors: Scoring, Reliability, Validity, Classification
Klecker, Beverly M.; Chapman, Ann – Online Submission, 2008
The purpose of this paper was three-fold: (1) to review mastery learning and criterion-based assessment; (2) to advocate extending these concepts to higher education; and (3) to invite MSERA members to join in research projects examining mastery learning in higher education. The authors used Guskey's (2001) definition of mastery learning from his…
Descriptors: Feedback (Response), Higher Education, Heuristics, Mastery Learning
Alsbury, Thomas L. – Leadership and Policy in Schools, 2008
District-level leadership often has been perceived as irrelevant to educational reform. This study compared district political and apolitical board and superintendent turnover to student performance change on the state criterion-referenced test. Results included student test score decline as board turnover increased, particularly in smaller…
Descriptors: Test Score Decline, Criterion Referenced Tests, Educational Change, Scores
Nehm, Ross H.; Schonfeld, Irvin Sam – Journal of Research in Science Teaching, 2008
Growing recognition of the central importance of fostering an in-depth understanding of natural selection has, surprisingly, failed to stimulate work on the development and rigorous evaluation of instruments that measure knowledge of it. We used three different methodological tools, the Conceptual Inventory of Natural Selection (CINS), a modified…
Descriptors: Evolution, Science Education, Interviews, Measures (Individuals)
Marshall, J. Laird; Haertel, Edward H. – 1975
For classical, norm-referenced test reliability, Cronbach's alpha has been shown to be equal to the mean of all possible split-half Pearson product-moment correlation coefficients, adjusted by the Spearman-Brown prophecy formula. For criterion-referenced test reliability, in an analogous vein, this paper provides the rationale behind, the analysis…
Descriptors: Criterion Referenced Tests, Statistical Analysis, Test Reliability
Swezey, Robert W. – 1976
Though domain-oriented and norm-referenced tests are appropriate for some situations, objective-oriented and criterion-referenced tests must be used to gather additional information. Objectives for such tests must include a statement of the desired performance, the test conditions, and the standards of acceptance. When tests are constructed the…
Descriptors: Criterion Referenced Tests, Speeches, Test Construction, Testing
Haladyna, Thomas M. – 1976
The objectives of this study were to first determine whether or not the empirical item analysis of domain referenced tests (DR) was justified; and second, in the event that it was, which of a set of recommended procedures was most effective for determining item quality. The analysis that followed led to the conclusion that empirical procedures…
Descriptors: Criterion Referenced Tests, Item Analysis, Statistical Analysis
Herbig, Manfred – Programmed Learning and Educational Technology, 1976
The relationship between criterion, test items, and instruction is discussed to show the problems of the pretest and posttest evaluation of criterion referenced items. (JY)
Descriptors: Criterion Referenced Tests, Item Analysis, Pretests Posttests

Direct link
