Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 10 |
| Since 2017 (last 10 years) | 23 |
| Since 2007 (last 20 years) | 57 |
Descriptor
| Test Content | 136 |
| Test Reliability | 136 |
| Test Validity | 108 |
| Test Construction | 50 |
| Test Format | 40 |
| Testing | 39 |
| Test Interpretation | 35 |
| Test Reviews | 34 |
| Standardized Tests | 32 |
| Elementary Secondary Education | 30 |
| Children | 27 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 10 |
| Teachers | 6 |
| Administrators | 4 |
| Policymakers | 1 |
| Students | 1 |
Location
| California | 2 |
| Canada | 2 |
| China | 2 |
| Colorado (Denver) | 2 |
| Massachusetts | 2 |
| New York (New York) | 2 |
| Nigeria | 2 |
| North Carolina (Charlotte) | 2 |
| Tennessee (Memphis) | 2 |
| Australia | 1 |
| Delaware | 1 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 3 |
| Every Student Succeeds Act… | 1 |
| Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Budescu, David V.; And Others – 1994
Modified Parallel Analysis (MPA) is a heuristic method for assessing "approximate unidimensionality" of item pools. It compares the second eigenvalue of the observed correlation matrix with the corresponding eigenvalue extracted from a "parallel" matrix generated by a unidimensional and locally independent model. Revised…
Descriptors: Equations (Mathematics), Heuristics, Item Analysis, Item Banks
Brown, Richard S.; Coughlin, Ed – Regional Educational Laboratory Mid-Atlantic, 2007
This report examines the availability and quality of predictive validity data for a selection of benchmark assessments identified by state and district personnel as in use within Mid-Atlantic Region jurisdictions. Based on a review of practices within the school districts in the region, this report details the benchmark assessments being used, in…
Descriptors: Test Content, Academic Achievement, Predictive Validity, Program Effectiveness
Peer reviewedSaunders, Phillip – Journal of Economic Education, 1991
Discusses the content and cognitive specification of the third edition of the Test of Understanding in College Economics. Presents examples of the construction and sampling criteria employed in the latest and previous versions of the test. Explains that the test emphasizes recognition and understanding of basic terms, concepts, and principles with…
Descriptors: Economics Education, Educational Testing, Higher Education, Student Evaluation
Peer reviewedMardell-Czudnowski, Carol; Goldenberg, Dorothea S. – Learning Disabilities: A Multidisciplinary Journal, 1999
Describes the rationale and development of a new screening test, Developmental Indicators for the Assessment of Learning-3 (DIAL-3), which is based on previous editions. Empirical research is presented to document item additions and deletions as well as the continued maintenance of three performance areas: motor, concepts, and language. (Author/CR)
Descriptors: Child Development, Children, Cognitive Development, Disabilities
Peer reviewedStraus, Murray A.; Hamby, Sherry L.; Finkelhor, David; Moore, David W.; Runyan, Desmond – Child Abuse & Neglect: The International Journal, 1998
A study of 1,000 children examined the effectiveness of the Parent-Child Conflict Tactics Scales (CTSPC) in measuring parental psychological and physical maltreatment of children, as well as nonviolent modes of discipline. The CTSPC was found to be better suited to measuring child maltreatment than the original Conflict Tactics Scales. (Author/CR)
Descriptors: Child Abuse, Child Neglect, Discipline, Evaluation Methods
Searls, Evelyn F. – 1997
This monograph describes the third edition of the Wechsler Intelligence Scale for Children (WISC-III) and its relationship to reading/learning disabilities. It is designed for educators and students in education who want to go beyond the numerical values of the WISC-III intelligence quotients and understand the implications of the scores for the…
Descriptors: Disability Identification, Elementary Secondary Education, Intelligence Tests, Learning Disabilities
Peer reviewedRaphael, Dennis; Brown, Ivan; Renwick, Rebecca – International Journal of Disability, Development and Education, 1999
A study examined the reliability and validity of the Quality of Life Instrument Package using data from 500 persons with developmental disabilities in Ontario. Data indicate that most of the instruments found in the package met acceptable psychometric standards. Appropriate uses for the full and short version are discussed. (Author/CR)
Descriptors: Adults, Evaluation Methods, Foreign Countries, Mental Retardation
Peer reviewedHaney, Walt; Fowler, Clarke; Wheelock, Anne; Bebell, Damian; Malec, Nicole – Education Policy Analysis Archives, 1999
Using data from state and academic reports, an independent committee of researchers has evaluated the Massachusetts Teacher Tests. Scores are found to be highly unreliable, and the tests are found to contain questionable content. Suspending use of the tests is recommended. (SLD)
Descriptors: Beginning Teachers, Elementary Secondary Education, State Programs, Teacher Evaluation
Peer reviewedWainer, Howard – Education Policy Analysis Archives, 1999
The critique of the Massachusetts Teacher Tests by W. Haney and others points out some flaws in the tests but ignores the fact that the tests provide some useful information to guide teacher selection decisions. Calls for additional study of these teacher evaluation instruments. (SLD)
Descriptors: Beginning Teachers, Elementary Secondary Education, State Programs, Teacher Evaluation
Morreale, Sherwyn P.; And Others – 1996
This publication identifies and examines existing assessment instruments in oral communication for K-12 and higher education, and abstracts and describes assessment instruments and systematically reports their availability and interest to scholars, teachers, and administrators. It also provides background research needed to develop oral…
Descriptors: Alternative Assessment, Elementary Secondary Education, Higher Education, Oral Language
Melnick, Steven A.; Henk, William A. – 1997
This paper compares two methods of establishing content validity, forced-choice judgmental review and a latent category judgmental review. It also compares content validity evidence with the results of a scale reliability analysis and makes recommendations of the two content validity procedures. Two different groups of graduate students enrolled…
Descriptors: Classification, Comparative Analysis, Content Validity, Graduate Students
Ross, Steven; Hua, Te-Fang – 1994
A general issue related to language program development involves the empirical rationalization of cut score decisions in criterion-referenced language tests. Cut score dependability focuses on the consistency of the decisions in repeated testing or the assessment of language learner performances. In this case, the issue is to determine the optimal…
Descriptors: Achievement Gains, Criterion Referenced Tests, English (Second Language), Higher Education
Peer reviewedNoijons, Jose – CALICO Journal, 1994
Defines computer assisted language testing (CALT), discusses the various processes involved, outlines the advantages and disadvantages, and examines psychometric aspects of computer testing. A table of factors distinguishes between test content and the mechanics of test taking. These factors constitute a table for developing a CALT checklist. (24…
Descriptors: Check Lists, Computer Assisted Testing, Factor Analysis, Feedback
Peer reviewedFreedman, Elaine S. – Oxford Review of Education, 1998
Examines the similarities and differences in the methodologies and findings of six projects that investigated the statutory testing of 11-year-old children in England and Wales. Reveals a similarity of results despite a diversity of methods. Reviews the findings in relation to other research and provides suggestions for future research methods.…
Descriptors: British National Curriculum, Educational Practices, Educational Research, Elementary Education
Epstein, Michael H. – Diagnostique, 2000
This article provides an overview of strength-based assessment of children and adolescents with emotional and behavioral disorders and discusses the development and psychometric characteristics of the Behavioral and Emotional Rating Scale (BERS). The BERS is a norm-referenced, standardized test that assesses the strengths of children and…
Descriptors: Adolescents, Behavior Disorders, Behavior Rating Scales, Children


