Publication Date
| In 2026 | 2 |
| Since 2025 | 454 |
| Since 2022 (last 5 years) | 1933 |
| Since 2017 (last 10 years) | 4505 |
| Since 2007 (last 20 years) | 6990 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 837 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 161 |
| Spain | 129 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 111 |
| Taiwan | 108 |
| Netherlands | 102 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Peer reviewedPfeiffer, Steven I.; Reddy, Linda A.; Kletzel, Jeffrey E.; Schmelzer, Elizabeth R.; Boyer, Lynn M. – School Psychology Quarterly, 2000
Surveys 354 nationally certified school psychologists on the perceived usefulness of the Wechsler Intelligence Scale for Children-III (WISC-III) in general and profile analysis in particular. Practitioners rated the WISC-III as very useful for determining diagnosis and educational placement, but less useful for developing instructional strategies…
Descriptors: Children, Clinical Diagnosis, Intelligence Tests, Profiles
Peer reviewedMittag, Kathleen C.; Thompson, Bruce – Educational Researcher, 2000
Surveyed AERA members regarding their perceptions of: statistical issues and statistical significance testing; the general linear model; stepwise methods; score reliability; type I and II errors; sample size; statistical probabilities as exclusive measures of effect size; p values as direct measures of result value; and p values evaluating…
Descriptors: Educational Research, Elementary Secondary Education, Research Methodology, Statistical Significance
Peer reviewedBaume, David; Yorke, Mantz – Studies in Higher Education, 2002
Analyzed the assessments of 53 portfolios used to evaluate participants in a development course for higher education teachers at the United Kingdom's Open University. Findings included a high reliability in assessment at the level of course outcomes, and that cumulation of component assessments is very likely to reduce the reliability of overall…
Descriptors: Foreign Countries, Higher Education, Interrater Reliability, Portfolio Assessment
Peer reviewedGaudet, Laura; Pulos, Steve; Crethar, Hugh; Burger, Susan – Education and Training in Mental Retardation and Developmental Disabilities, 2002
In this study, self-reports of 34 individuals with developmental disabilities (DD) were compared with proxy ratings from family and providers. Correlations between the ratings of individuals with DD and the proxy raters were low, as were the correlations between family members and providers. In all scales except "cognition," the individual with DD…
Descriptors: Adults, Developmental Disabilities, Evaluation Methods, Interrater Reliability
Peer reviewedRimmer, James H.; Riley, Barth B.; Rubin, Stephen S. – American Journal of Health Promotion, 2001
Assessed the psychometric properties of the Physical Activity and Disability Survey (PADS), which measures physical activity for people with disabilities and chronic health conditions. Cross-sectional and pre-post designs were employed with 103 people who had disabilities and chronic health conditions. Results supported the PADS' reliability and…
Descriptors: Chronic Illness, Disabilities, Evaluation Methods, Physical Activity Level
Peer reviewedConnolly, Mary Beth; Crits-Christoph, Paul; Shelton, Richard C.; Hollon, Steven; Kurtz, John; Barber, Jacques P.; Butler, Stephen F.; Baker, Sharon; Thase, Michael E. – Journal of Counseling Psychology, 1999
Evaluates the reliability and validity of a new self-report measure of Self-Understanding of Interpersonal Patterns (SUIP). Measure demonstrates good internal consistency, test-retest reliability, and discriminant validity. The SUIP further demonstrates convergent validity with measures of analytical and self-improving personality traits in a…
Descriptors: Interpersonal Competence, Psychotherapy, Self Concept, Self Concept Measures
Peer reviewedWiener, Judith; Smith, Catherine M. – Journal of College Reading and Learning, 1999
Analyzes a learning disabilities (LD) questionnaire administered to 150 adults including college and university students, and adults referred to a psychoeducational clinic for assessment or treatment related to LD. Finds that the LD screening test had internal consistency and good test-retest reliability, as well as criterion validity. Finds…
Descriptors: Higher Education, Learning Disabilities, Psychoeducational Methods, Test Construction
Peer reviewedStockbrugger, Barry A.; Haennel, Robert G. – Journal of Strength and Conditioning Research, 2001
Evaluated the validity and reliability of a medicine ball throw test to evaluate explosive power. Data on competitive sand volleyball players who performed a medicine ball throw and a standard countermovement jump indicated that the medicine ball throw test was a valid and reliable way to assess explosive power for an analogous total-body movement…
Descriptors: Athletes, Evaluation Methods, Muscular Strength, Plyometrics
Peer reviewedGati, Itamar; Saka, Noa – Journal of Career Assessment, 2001
The Career Decision-Making Difficulties Questionnaire was completed in a Hebrew paper version (n=417) and Internet version (n=837), showing similar internal consistency and reliability in both versions. Response pattern of 24% of Internet users was questionable. Comparison of results from English paper (n=403) and Internet (n=182) versions found…
Descriptors: Career Choice, Computer Assisted Testing, Decision Making, English
Peer reviewedKelly, William E. – College Student Journal, 2004
Undergraduate students (N = 150) participated in a study developing a 10-item scale (the Noctcaelador Inventory; NI) to measure noctcaelador: adoration and attachment to the night-sky. The NI demonstrated good internal consistency, test-retest reliability, normality, and preliminary validity. The scale significantly correlated with self-reported…
Descriptors: Undergraduate Students, Test Reliability, Measures (Individuals), Individual Differences
Reliability and Validity of the Beck Depression Inventory--II with Adolescent Psychiatric Inpatients
Osman, Augustine; Kopper, Beverly A; Barrios, Frank; Gutierrez, Peter M.; Bagge, Courtney L. – Psychological Assessment, 2004
This investigation was conducted to validate the Beck Depression Inventory--II (BDI-II; A. T. Beck, R. A. Steer, & G. K. Brown, 1996) in samples of adolescent psychiatric inpatients. The sample in each substudy was primarily Caucasian. In Study 1, expert raters (N=7) and adolescent psychiatric inpatients (N=13) evaluated the BDI-II items to assess…
Descriptors: Patients, Test Reliability, Test Validity, Depression (Psychology)
Peer reviewedCruz, Luiz M.; Moreira, Marcelo J. – Journal of Human Resources, 2005
The authors evaluate Angrist and Krueger (1991) and Bound, Jaeger, and Baker (1995) by constructing reliable confidence regions around the 2SLS and LIML estimators for returns-to-schooling regardless of the quality of the instruments. The results indicate that the returns-to-schooling were between 8 and 25 percent in 1970 and between 4 and 14…
Descriptors: School Attendance Legislation, Compulsory Education, Measurement Techniques, Computation
Goldberg, Mark F. – Education Digest: Essential Readings Condensed for Quick Review, 2004
Tests are a natural part of education, from the quizzes, essays, and classroom tests that teachers have traditionally administered to the high-stakes tests that states use to make decisions about graduation, promotion, and school funding and governance. In this article, the author stresses the need to learn the unintended consequences of…
Descriptors: Testing, High Stakes Tests, Standardized Tests, Federal Legislation
Raju, Nambury S.; Oshima, T.C. – Educational and Psychological Measurement, 2005
Two new prophecy formulas for estimating item response theory (IRT)-based reliability of a shortened or lengthened test are proposed. Some of the relationships between the two formulas, one of which is identical to the well-known Spearman-Brown prophecy formula, are examined and illustrated. The major assumptions underlying these formulas are…
Descriptors: Item Response Theory, Test Reliability, Evaluation Methods, Computation
O'Rourke, Norm – Educational and Psychological Measurement, 2004
The Center for Epidemiologic Studies-Depression (CES-D) Scale is among the most commonly used measures of depressive symptomatology. Despite this, a paucity of research has been undertaken to examine the psychometric properties of responses to this scale. This meta-analytic study examined previously published studies of caregiving to identify…
Descriptors: Measures (Individuals), Psychometrics, Generalization, Depression (Psychology)

Direct link
