Publication Date
| In 2026 | 0 |
| Since 2025 | 49 |
| Since 2022 (last 5 years) | 211 |
| Since 2017 (last 10 years) | 492 |
| Since 2007 (last 20 years) | 984 |
Descriptor
| Test Validity | 3908 |
| Test Reliability | 1517 |
| Testing | 1090 |
| Test Construction | 1014 |
| Testing Problems | 1008 |
| Computer Assisted Testing | 616 |
| Elementary Secondary Education | 553 |
| Foreign Countries | 494 |
| Higher Education | 490 |
| Standardized Tests | 488 |
| Test Interpretation | 433 |
| More ▼ | |
Source
Author
| Ebel, Robert L. | 16 |
| Hambleton, Ronald K. | 13 |
| Green, Donald Ross | 10 |
| Popham, W. James | 10 |
| Linn, Robert L. | 9 |
| Haney, Walt | 8 |
| Koretz, Daniel | 8 |
| Sireci, Stephen G. | 8 |
| Thompson, Bruce | 8 |
| Tindal, Gerald | 8 |
| Hilliard, Asa G., III | 7 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 137 |
| Researchers | 134 |
| Teachers | 51 |
| Administrators | 34 |
| Policymakers | 18 |
| Counselors | 11 |
| Students | 8 |
| Parents | 5 |
| Support Staff | 4 |
| Community | 2 |
Location
| Canada | 57 |
| Australia | 40 |
| California | 40 |
| China | 34 |
| United Kingdom (England) | 31 |
| United Kingdom | 29 |
| New York | 28 |
| United States | 26 |
| Florida | 22 |
| Germany | 21 |
| Turkey | 20 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedNewsham, Gwen S. – Canadian Modern Language Review, 1989
Information in published articles on communicative testing is examined and discussed from the point of view of a classroom teacher. The administrability, reliability, and validity of communicative language testing are highlighted. (MSE)
Descriptors: Communicative Competence (Languages), Language Tests, Research Utilization, Scholarly Journals
Peer reviewedBerry, David T. R.; And Others – Psychological Assessment, 1992
Validity of 3 scales of the Minnesota Multiphasic Personality Inventory (MMPI), the F, back F, and variable response inconsistency, for detecting self-reported partially random responding was supported by 3 studies involving 195 college students and 68 community participants but not by a study with 32 police job applicants. (SLD)
Descriptors: Adults, College Students, Comparative Testing, Higher Education
Poteet, James A. – Diagnostique, 1990
A framework is presented for implementing standardized achievement testing. Fundamental concepts and formats of the tests are reviewed, and useful references are listed. Standardized tests are considered and categorized in terms of administration format, functions, validity, and type (cognitive versus noncognitive). Thirteen basic recommendations…
Descriptors: Achievement Tests, Elementary Secondary Education, Learning Problems, Standardized Tests
Peer reviewedGillespie, Maggie – Community College Review, 1993
Analyzes Hughes and Nelson's study of placement testing at Riverside Community College and offers an alternative method of evaluating placement systems using logistic regression. Reanalyzes data from an ASSET placement test validity study using logistic regression. (DMM)
Descriptors: Community Colleges, Evaluation Methods, Evaluation Research, Statistical Analysis
Peer reviewedAnglin, M. Douglas; And Others – Evaluation Review, 1993
Reliability and validity of self-reported behavior within a deviant population are examined using data from 2 interviews with 323 narcotics addicts conducted 10 years apart (1974-75 and 1985-86). Results complement existing reliability and validity studies of alcohol use, and suggest that quality information can be obtained from heroin users. (SLD)
Descriptors: Comparative Testing, Drinking, Drug Addiction, Evaluation Methods
Cizek, Gregory J. – Phi Delta Kappan, 1991
This rejoinder to Grant Wiggins on performance assessment suggests that true educational reform will undoubtedly be evidenced by something more substantial than pocket folders bulging with student work. Labeling performance tests "authentic" does not ensure their validity, reliability, or incorruptibility. Such tests are neither replacements nor…
Descriptors: Elementary Secondary Education, Multiple Choice Tests, Performance Based Assessment, Pilot Projects
Peer reviewedCohen, Robert; And Others – Academic Medicine, 1991
The performance of foreign medical school graduates on multistation standardized patient-based tests was used to determine the validity and generalizability of global ratings of their clinical competence made by expert examiners. Results suggest that these ratings can be used as an effective form of assessment in this context. (Author/MSE)
Descriptors: Foreign Medical Graduates, Higher Education, Holistic Approach, Medical Education
Putnam, Frank W.; And Others – Child Abuse and Neglect: The International Journal, 1993
Evaluation of the Child Dissociative Checklist found it to be a reliable and valid observer report measure of dissociation in children, including sexually abused girls and children with dissociative disorder and with multiple personality disorder. The checklist, which is appended, is intended as a clinical screening instrument and research measure…
Descriptors: Check Lists, Children, Emotional Disturbances, Psychological Evaluation
Peer reviewedGellman, Estelle S. – Action in Teacher Education, 1993
Portfolio assessment can be a valuable tool in assessing professional proficiency in teachers if appropriate attention is given to issues of reliability and validity. The Teaching Assessment Project at Stanford University has explored portfolios as an alternative to traditional methods of teacher evaluation. (IAH)
Descriptors: Elementary Secondary Education, Portfolios (Background Materials), Teacher Competencies, Teacher Competency Testing
Peer reviewedWrobel, Nancy Howells; Lachar, David – Psychology in the Schools, 1998
Examines the comparative validity of a parent-report scale and a self-report scale, both designed to assess behavioral and emotional problems. Results, based on 111 children in regular education classrooms, indicate that parent reports were more sensitive to overt behavioral problems, whereas self-reports were sensitive to mood disturbances and…
Descriptors: Behavior Problems, Children, Comparative Testing, Elementary Education
Peer reviewedEriksson, Marten – Applied Psycholinguistics, 2001
Explores the criterion-related validity of the Swedish version of the Communicative Development Inventories--Words & Sentences (SECDI-W&S). In two follow-up procedures, SECDI-W&S was used to assess vocabulary and grammar skills in 32 children. Overall results confirm that the criterion-related validity of the SECDI is sound. (Author/VWL)
Descriptors: Criterion Referenced Tests, Grammar, Language Tests, Personal Narratives
Peer reviewedNorris, Charles E. – Journal of Research in Music Education, 2000
Explores the validity of reproduction tonal memory tests by examining the relationships among performances on an existing reproduction tonal memory test and several recognition tonal memory tests. Tested 210 fifth through twelfth grade students. Concludes that there is a moderate relationship among performances on the tests. Includes references.…
Descriptors: Higher Education, Intermediate Grades, Music Education, Secondary Education
Peer reviewedJournal of School Improvement, 2000
States that standard scores are the numerical universal language for reporting and comparisons. Discusses what standard scores are, specifically, and why they are used, along with how the conversion assessment of raw scores to standard scores is accomplished. Provides contact information for those who would like to further their knowledge on the…
Descriptors: Educational Practices, Elementary Secondary Education, Higher Education, Standard Setting (Scoring)
Peer reviewedQualls, Audrey L.; Moss, Angela D. – Educational and Psychological Measurement, 1996
The extent to which testing practices complied with professional guidelines regarding reliability and validity evidence was studied in research appearing in American Psychological Association journals. Documentation of reliability and validity was reported for 20% of the 2,157 instruments studied in these papers. About half supported one or the…
Descriptors: Congruence (Mathematics), Documentation, Educational Practices, Research Reports
Peer reviewedGriggs, Richard A. – Teaching of Psychology, 2000
Presents a class activity, in which students take two tests, that requires minimal preparation and encourages discussion on important aspects of testing, such as testing bias. Describes the procedure. Includes the two tests and the answers. (CMK)
Descriptors: Course Content, Educational Strategies, Higher Education, Intelligence


