Publication Date
| In 2026 | 0 |
| Since 2025 | 433 |
| Since 2022 (last 5 years) | 1911 |
| Since 2017 (last 10 years) | 4483 |
| Since 2007 (last 20 years) | 6968 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 830 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 159 |
| Spain | 129 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 111 |
| Taiwan | 108 |
| Netherlands | 102 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Peer reviewedArmstrong, Ronald D.; And Others – Journal of Educational Statistics, 1994
A network-flow model is formulated for constructing parallel tests based on classical test theory while using test reliability as the criterion. Practitioners can specify a test-difficulty distribution for values of item difficulties as well as test-composition requirements. An empirical study illustrates the reliability of generated tests. (SLD)
Descriptors: Algorithms, Computer Assisted Testing, Difficulty Level, Item Banks
Peer reviewedZimmerman, Donald W.; And Others – Applied Psychological Measurement, 1993
Some of the methods originally used to find relationships between reliability and power associated with a single measurement are extended to difference scores. Results, based on explicit power calculations, show that augmenting the reliability of measurement by reducing error score variance can make significance tests of difference more powerful.…
Descriptors: Equations (Mathematics), Error of Measurement, Individual Differences, Mathematical Models
Peer reviewedHumphreys, Lloyd G.; And Others – Applied Psychological Measurement, 1993
Two articles discuss the controversy about the relationship between reliability and the power of significance tests in response to the discussion of Donald W. Zimmerman, Richard H. Williams, and Bruno D. Zumbo. Lloyd G. Humphreys emphasizes the differences between what statisticians can do and constraints on researchers. Zimmerman, Williams, and…
Descriptors: Error of Measurement, Individual Differences, Power (Statistics), Research Methodology
Peer reviewedRoznowski, Mary; Smith, Marna L. – Intelligence, 1993
Measurement and psychometric quality of the Sternberg task (S. Sternberg, 1966, 1969), a memory search task, was investigated with 78 undergraduates. Individual performance was fairly homogeneous across responses, fairly unstable over time, and fairly stable across stimulus content. Implications for individual differences research are discussed.…
Descriptors: Cognitive Tests, Evaluation Methods, Higher Education, Individual Differences
Peer reviewedMatson, Johnny L.; Smiroldo, Brandi B. – Research in Developmental Disabilities, 1997
A study tested the validity of the Diagnostic Assessment for the Severely Handicapped-II (DASH-II) for determining the presence of mania (bipolar disorder) in 22 individuals with severe mental retardation. Results found the mania subscale to be internally consistent and able to be used to classify manic and control subjects accurately. (Author/CR)
Descriptors: Adults, Clinical Diagnosis, Disability Identification, Evaluation Methods
Peer reviewedDozois, David J. A.; Ahnberg, Jamie L.; Dobson, Keith S. – Psychological Assessment, 1998
Provides psychometric information on the second edition of the Beck Depression Inventory (BDI-II) (A. Beck, R. Steer, and G. Brown, 1996) for internal consistency, factorial validity, and gender differences. Results indicate that the BDI-II is a stronger instrument than its predecessor in terms of factor structure. (SLD)
Descriptors: Depression (Psychology), Factor Analysis, Factor Structure, Psychometrics
Peer reviewedScarsellone, Jana M. – Journal of Speech, Language, and Hearing Research, 1998
Hearing in Noise Test (HINT) list equivalency was examined using 24 listeners (ages 60 to 70) with sensorineural hearing impairments. Four speech conditions were tested, including a quiet condition and three noise conditions. Results found that for the three noise conditions, all lists were within 2dB of the means, indicating list equivalency.…
Descriptors: Auditory Evaluation, Auditory Perception, Communication Research, Generalization
Peer reviewedMatson, Johnny L.; Mayville, Erik A.; Bielecki, JoAnne; Barnes, W. Harvin; Bamburg, Jay W.; Baglio, Christopher S. – Research in Developmental Disabilities, 1998
A study involving 200 adults with mental retardation investigated the interrater reliability and internal consistency of the Matson Evaluation of Drug Side Effects (MEDS), a scale designed to evaluate commonly identified side effects with a psychometrically sound checklist. The MEDS had excellent consistency across raters and good internal…
Descriptors: Adults, Drug Therapy, Drug Use, Evaluation Methods
Moss, Pamela A.; Schutz, Aaron – Phi Delta Kappan, 1999
Considers four key decision points in the National Board for Professional Teaching Standards' assessment-development process: development of content standards; development of tasks guiding candidates in providing evidence about their teaching; development of scoring rubrics and benchmarks; and determination of the performance standard that…
Descriptors: Elementary Secondary Education, Performance Based Assessment, Portfolio Assessment, Scoring Rubrics
Riccio, Cynthia A.; Boan, Candace H.; Staniszewski, Deborah; Hynd, George W. – Diagnostique, 1997
A study involving 120 school-aged children that investigated the concurrent validity of measures of written language found that the Wechsler Individual Achievement Test Written Expression subtest correlates moderately with the Written Expression subtest of the Peabody Individual Achievement Test-Revised and the Spontaneous Writing Quotient of the…
Descriptors: Elementary Secondary Education, Learning Disabilities, Test Reliability, Test Validity
Peer reviewedKapci, Emine G. – Early Child Development and Care, 1999
This study examined the validity and reliability of the Pre-school Behaviour Checklist (PBCL) for Turkish nursery school children. Data were obtained from 902 children, 24 to 82 months old, attending state or private nursery schools. Findings suggested that the PBCL has psychometric properties comparable to the British sample and could be used…
Descriptors: Behavior Problems, Check Lists, Child Behavior, Foreign Countries
Peer reviewedKaminski, Ruth A.; Good, Roland H., III – School Psychology Review, 1996
Examines the reliability, validity, and sensitivity of experimental measures developed to assess three areas of early literacy: phonological awareness, vocabulary development, and fluency in letter naming. Results indicate which measures display adequate psychometric properties for kindergartners not yet reading. Experimental measures were less…
Descriptors: Emergent Literacy, Grade 1, Kindergarten Children, Language Fluency
Owen, T. Ross – Journal of Educational Opportunity, 1997
A study investigated the validity and reliability of a new instrument for assessing the wellness lifestyles of Upward Bound students. Subjects were 42 students from five high schools using the program. The study examined 14 variables, including total scores, 10 subscales, and three demographic variables (age, race, gender), and concluded that the…
Descriptors: College Students, High School Students, High Schools, Measurement Techniques
Peer reviewedMaes, B.; Fryns, J. P.; Ghesquiere, P.; Borghgraef, M. – Mental Retardation, 2000
A study investigated the effectiveness of a phenotypic checklist for identifying 110 males with fragile X syndrome and 79 controls, matched for age, level of cognitive development, and social adaptation. Results indicated that those boys who are likely to be diagnosed as having fragile X syndrome can be identified. (Contains references.)…
Descriptors: Adults, Check Lists, Children, Clinical Diagnosis
Peer reviewedReese, Elaine; Read, Stephanie – Journal of Child Language, 2000
Assessed long-term predictive validity of the MacArthur Communicative Development Inventories: Words and Sentences (CDI:WS) for children's expressive and receptive vocabulary development. Sixty-one New Zealand children were assessed with a New Zealand version of the CDI, and with the Expressive Vocabulary Test and Peabody Picture Vocabulary…
Descriptors: Child Language, Educational Attainment, Foreign Countries, Language Tests


