Publication Date
| In 2026 | 0 |
| Since 2025 | 433 |
| Since 2022 (last 5 years) | 1911 |
| Since 2017 (last 10 years) | 4483 |
| Since 2007 (last 20 years) | 6968 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 830 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 159 |
| Spain | 129 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 111 |
| Taiwan | 108 |
| Netherlands | 102 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Peer reviewedByrne, Barbara M.; And Others – Journal of Adolescent Research, 1994
Describes a study that sought to test for the factorial validity of the French version of the Beck Depression Inventory separately for nonclinical adolescent French-speaking males and females; cross-validate findings across a second independent sample for each gender; and test for equivalent factorial structure across gender for that population.…
Descriptors: Adolescents, Depression (Psychology), Emotional Problems, Measures (Individuals)
Peer reviewedRobitschek, Christine – Measurement and Evaluation in Counseling and Development, 1998
The Personal Growth Initiative Scale (PGIS) assesses intentionality in personal growth. Three studies were undertaken to (1) investigate the PGI construct; (2) describe the development and factor structure of the PGIS; and (3) test the internal consistency, temporal stability, and convergent and discriminant validity of scores on the PGIS. (EMK)
Descriptors: College Students, Counseling, Individual Development, Self Efficacy
Peer reviewedKuhne, Michael; Wiener, Judith – Learning Disability Quarterly, 2000
The stability of peer status of children (ages 9 to 12) with (N=38) and without learning disabilities (LD) was examined through sociometric measures twice in the same school year. Although test-retest reliability was good, children with LD were likely to lose peer status and be seen by peers in less favorable terms at Time 2 than Time 1. (Contains…
Descriptors: Intermediate Grades, Learning Disabilities, Peer Acceptance, Peer Relationship
Peer reviewedClark, Kenneth – Mathematics Teacher, 1999
Explains and demonstrates a procedure that is commonly used to determine the reliability of a test in such a way that a person who has modest arithmetical skills can carry out the same analysis on a classroom test or examination. (ASK)
Descriptors: Mathematics Education, Secondary Education, Secondary School Mathematics, Test Construction
Peer reviewedLaufer, Batia; Nation, Paul – Language Testing, 1999
Investigated the reliability, validity, and practicality of a controlled production measure of vocabulary, consisting of items from five frequency levels and using a completion-item format. Two equivalent test forms were compared. The test was found to be useful in distinguishing between different proficiency groups. (Author/MSE)
Descriptors: Difficulty Level, Language Tests, Second Languages, Test Construction
Peer reviewedFall, Marijane; McLeod, Elizabeth H. – Professional School Counseling, 2001
Evaluates the revised editions of the Self-Efficacy Scale for use with children in schools. Addresses the reliability and validity of the two versions of the scale. Proposes counseling interventions to increase student self-efficacy. (Contains 22 references, 1 table, and an appendix.) (GCP)
Descriptors: Children, Counseling Techniques, Elementary Education, School Counseling
van I Jzendoorn,Marinus H.; Vereijken, Carolus M.J.L.; Bakermans-Kranenburg, Marian J.; Riksen-Walraven, Marianne J. – Child Development, 2004
The reliability and validity of the Attachment Q Sort (AQS; Waters & Deane, 1985) was tested in a series of meta-analyses on 139 studies with 13,835 children. The observer AQS security score showed convergent validity with Strange Situation procedure (SSP) security (r=31) and excellent predictive validity with sensitivity measures (r=39). Its…
Descriptors: Q Methodology, Predictive Validity, Attachment Behavior, Test Validity
Nystrom, Peter – Scandinavian Journal of Educational Research, 2004
Reliability is a problem inherent in all educational assessments, but the amount of attention this particular problem should be given is related to the function and use of the assessment. In this article, classification accuracy is put forward as a conceptualization of reliability that is meaningful for a large number of educational assessments.…
Descriptors: Test Validity, Test Reliability, Mathematics Tests, Foreign Countries
Foust, Michelle Singer; Elicker, Joelle D.; Levy, Paul E. – Journal of Vocational Behavior, 2006
The authors developed and validated a measure of employees' attitudes toward lateness at work. Analyses provided clear evidence of the reliability and validity of the new measure. Specifically, high reliabilities were observed in both student (a = 0.82) and employee (a = 0.84) samples. Using objective lateness data from organizations, the measure…
Descriptors: Measures (Individuals), Employee Attitudes, Work Attitudes, Test Reliability
Williams, Jo; Allison, Carrie; Scott, Fiona; Stott, Carol; Bolton, Patrick; Baron-Cohen, Simon; Brayne, Carol – Autism: The International Journal of Research & Practice, 2006
The Childhood Asperger Syndrome Test (CAST) is a 37-item parental self-completion questionnaire to screen for autism spectrum conditions in research. Good test accuracy was demonstrated in studies with primary school aged children in mainstream schools. The aim of this study was to investigate the test-retest reliability of the CAST. Parents of…
Descriptors: Asperger Syndrome, Parent Attitudes, Questionnaires, Young Children
Muller, Jorg M. – Educational and Psychological Measurement, 2006
A new test index is defined as the probability of obtaining two randomly selected test scores (PDTS) as statistically different. After giving a concept definition of the test index, two simulation studies are presented. The first analyzes the influence of the distribution of test scores, test reliability, and sample size on PDTS within classical…
Descriptors: Test Reliability, Probability, Scores, Item Response Theory
Wise, Lauress L. – Educational Measurement: Issues and Practice, 2006
Uses and consequences of educational testing have increased dramatically in recent years. Professional standards to ensure fair treatment of all affected by test results are more important than ever, but standards for developing and using educational tests are only helpful if they are followed. Test developers and users each have a role to play in…
Descriptors: Educational Testing, Standards, Accountability, Cooperation
Shields, Alan L.; Caruso, John C. – Educational and Psychological Measurement, 2004
The CAGE is a commonly used alcohol screening instrument. Although considerable work has been done on the validity of CAGE scores, relatively little information is available on their reliability. Reliability induction and generalization studies were performed for the CAGE. Of the 259 studies available for analysis, only 19 (7.3%) contained…
Descriptors: Logical Thinking, Generalization, Test Reliability, Questionnaires
Gierl, Mark J.; Gotzmann, Andrea; Boughton, Keith A. – Applied Measurement in Education, 2004
Differential item functioning (DIF) analyses are used to identify items that operate differently between two groups, after controlling for ability. The Simultaneous Item Bias Test (SIBTEST) is a popular DIF detection method that matches examinees on a true score estimate of ability. However in some testing situations, like test translation and…
Descriptors: True Scores, Simulation, Test Bias, Student Evaluation
Einarsdottir, Johanna; Ingham, Roger J. – American Journal of Speech-Language Pathology, 2005
Purpose: This article critically reviews evidence to determine whether the use of disfluency typologies, such as "syllable repetitions" or "prolongations", has assisted the understanding or treatment of developmental stuttering. Consideration is given to whether there is a need for a fundamental shift in the basis for constructing measures of…
Descriptors: Stuttering, Measures (Individuals), Evidence, Test Reliability

Direct link
