Publication Date
| In 2026 | 0 |
| Since 2025 | 28 |
| Since 2022 (last 5 years) | 117 |
| Since 2017 (last 10 years) | 228 |
| Since 2007 (last 20 years) | 561 |
Descriptor
| Evaluation Methods | 1408 |
| Test Reliability | 1408 |
| Test Validity | 954 |
| Student Evaluation | 339 |
| Test Construction | 305 |
| Foreign Countries | 217 |
| Higher Education | 183 |
| Measurement Techniques | 170 |
| Psychometrics | 168 |
| Elementary Secondary Education | 147 |
| Evaluation Criteria | 122 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 74 |
| Practitioners | 72 |
| Teachers | 29 |
| Administrators | 18 |
| Policymakers | 11 |
| Students | 4 |
| Counselors | 3 |
| Support Staff | 3 |
| Community | 1 |
| Parents | 1 |
Location
| Australia | 24 |
| United Kingdom | 22 |
| Canada | 18 |
| Turkey | 16 |
| China | 14 |
| United States | 14 |
| California | 11 |
| Netherlands | 10 |
| Florida | 9 |
| Texas | 8 |
| United Kingdom (England) | 8 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Peer reviewedEvans, Julia L.; Craig, Holly K. – Journal of Speech and Hearing Research, 1992
Analysis of spontaneous language samples of 10 children (ages 8-9) with specific language impairments found that interviews were a reliable, valid, and efficient assessment context, eliciting the same profile of behaviors as a freeplay context without altering diagnostic classifications. (Author/JDD)
Descriptors: Data Collection, Discourse Analysis, Educational Diagnosis, Efficiency
Williams, Gladys A.; Asher, Steven R. – American Journal on Mental Retardation, 1992
Results from a survey of 62 students (ages 8-13) with mild mental retardation and 62 students without retardation indicated that high percentages of both groups understood what loneliness means; a loneliness questionnaire yielded satisfactory internal reliability; and boys but not girls with mental retardation reported more loneliness than did…
Descriptors: Comparative Analysis, Concept Formation, Elementary Education, Emotional Development
Peer reviewedGagne, Francoys; And Others – Gifted Child Quarterly, 1993
Forty prototypical descriptions representing 4 aptitude domains and 4 talent fields were rated by 2,343 intermediate-level pupils and their teachers, and indices of interpeer agreement were computed. A majority of the prototypes maintained acceptable interpeer agreement levels. Interpeer agreement depended primarily on the specific aptitude or…
Descriptors: Ability Identification, Evaluation Methods, Gifted, Intermediate Grades
Peer reviewedRoznowski, Mary – Intelligence, 1993
A measurement and psychometric examination of cognitive tasks was carried out with 195 undergraduates, investigating stability of latencies and number correct scores over a 2-week period and relations with standardized test and ability measure scores. Results are discussed in terms of the need for evaluating cognitive task measurement properties.…
Descriptors: Ability, Cognitive Processes, Cognitive Tests, Evaluation Methods
Peer reviewedBecker, Heather; And Others – Health Values: The Journal of Health Behavior, Education & Promotion, 1993
Describes the development of a measure of self-perceived abilities to implement various health promoting behaviors, reporting on a study of the psychometric properties of the measure in three samples. Results suggest the measure demonstrates adequate reliability and demonstrates predicted relationships with other health measures. (SM)
Descriptors: College Students, Data Collection, Disabilities, Evaluation Methods
Stubbings, Vicki; Martin, Garry L. – American Journal on Mental Retardation, 1998
A study compared the accuracy of experienced staff with a learning test in predicting the ability of 18 persons with mental retardation to learn 12 training tasks. Results found the Assessment of Basic Learning Abilities test was more accurate for predicting client performance than the assessments of experienced staff. (Author/CR)
Descriptors: Attitudes, Cognitive Ability, Competence, Evaluation Methods
Peer reviewedFox, James; Conroy, Maureen; Heckaman, Kelly – Behavioral Disorders, 1998
Twenty-four studies involving functional assessment of students with emotional and behavioral disorders (E/BD) or those at risk for E/BD are reviewed in three main areas: (1) characteristics of participants; (2) types of functional assessment procedures and instruments employed; and (3) the reliability and validity of these instruments and…
Descriptors: Behavior Disorders, Elementary Education, Emotional Disturbances, Evaluation Methods
Peer reviewedCross, Vinette; Hicks, Carolyn; Barwell, Fred – Assessment & Evaluation in Higher Education, 2001
Using videos of physiotherapy students, compared two assessment forms for validity and reliability (the first currently used by an academic program and the second developed from practitioners' perceptions of competence). Also investigated effects of training on assessment decisions. Found wide differences in individual ability to assess students…
Descriptors: Clinical Experience, Comparative Analysis, Competence, Evaluation Methods
Eaves, Ronald C.; Williams, Thomas O., Jr. – Psychology in the Schools, 2006
The reliability and construct validity of ratings for the Autism Behavior Checklist were examined with a sample of 198 children diagnosed with autistic disorder and conditions often confused with autism. Alpha coefficients for the five scales of the ABC as well as the Total Score were reported and the factor structure of the ABC was examined…
Descriptors: Check Lists, Test Reliability, Test Validity, Factor Analysis
Wang, Wen-Chung; Chen, Hsueh-Chu – Educational and Psychological Measurement, 2004
As item response theory (IRT) becomes popular in educational and psychological testing, there is a need of reporting IRT-based effect size measures. In this study, we show how the standardized mean difference can be generalized into such a measure. A disattenuation procedure based on the IRT test reliability is proposed to correct the attenuation…
Descriptors: Test Reliability, Rating Scales, Sample Size, Error of Measurement
Yasuda, Tomoyuki; Lawrenz, Cathy; Whitlock, Rod Van; Lubin, Bernard; Lei, Pui-Wa – Educational and Psychological Measurement, 2004
Intraindividual variability in positive and negative affect was assessed by the positive affect (Contentment, Joy, Vigor, Love, and Excitement) and negative affect (Depression, Hostility, Anxiety, Agitation, and Social Anxiety) subscales of the state version of the Comprehensive Personality and Affect Scales (COPAS) during a 3-week period. Using…
Descriptors: Measures (Individuals), Error of Measurement, Depression (Psychology), Anxiety
Davis, Mark H.; Capobianco, Sal; Kraus, Linda A. – Educational and Psychological Measurement, 2004
For a number of years, the dominant approach to measuring individual differences in how people respond to interpersonal conflict has been the dual-concerns model, which assesses five broad conflict styles said to result from one's standing on two underlying dimensions: concern for self and concern for other. This article describes the development…
Descriptors: Psychometrics, Social Desirability, Conflict, Test Validity
Brunner, Martin; SuB, Heinz-Martin – Educational and Psychological Measurement, 2005
Two aspects of the reliability of multidimensional measures can be distinguished: the amount of scale score variance that is accounted for by all underlying factors (composite reliability) and the degree to which the scale score reflects one particular factor (construct reliability). Confidence intervals for composite and construct reliabilities…
Descriptors: Measures (Individuals), Intervals, Intelligence Tests, Evaluation Methods
Invernizzi, Marcia A.; Landrum, Timothy J.; Howell, Jennifer L.; Warley, Heather P. – Reading Teacher, 2005
The authors describe a potential disconnect between research and practice in literacy assessment and instruction. They organize discussion around professionally recognized standards for the evaluation of educational assessments and assessment practices. These standards address both technical aspects of tests (e.g., validity; reliability;…
Descriptors: Test Validity, Test Reliability, Test Construction, Test Bias
Wiebe, John S.; Penley, Julie A. – Psychological Assessment, 2005
The Beck Depression Inventory-II (BDI-II; A. T. Beck, R. A. Steer, & G. K. Brown, 1996) is a widely used measure of depressive symptomatology originally authored in English and then translated to Spanish. However, there are very limited data available on the Spanish translation. This study compared the psychometric characteristics of the…
Descriptors: Translation, Factor Structure, Factor Analysis, Psychometrics

Direct link
