Publication Date
In 2025 | 205 |
Since 2024 | 705 |
Since 2021 (last 5 years) | 2293 |
Since 2016 (last 10 years) | 4594 |
Since 2006 (last 20 years) | 6899 |
Descriptor
Test Reliability | 14762 |
Test Validity | 9771 |
Test Construction | 4248 |
Foreign Countries | 3657 |
Psychometrics | 2361 |
Factor Analysis | 2251 |
Measures (Individuals) | 1717 |
Evaluation Methods | 1401 |
Higher Education | 1384 |
Correlation | 1234 |
Questionnaires | 1228 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 452 |
Practitioners | 319 |
Teachers | 128 |
Administrators | 73 |
Policymakers | 33 |
Counselors | 31 |
Students | 17 |
Parents | 10 |
Community | 6 |
Support Staff | 5 |
Location
Turkey | 797 |
Australia | 236 |
Canada | 205 |
China | 195 |
Indonesia | 142 |
Spain | 124 |
United States | 121 |
United Kingdom | 117 |
Germany | 106 |
Taiwan | 103 |
Netherlands | 99 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 2 |
Meets WWC Standards with or without Reservations | 2 |
Does not meet standards | 1 |

Winsler, Adam; Wallace, Gregory L. – Early Education and Development, 2002
Examined psychometric properties of the Preschool and Kindergarten Behavior Scales with preschoolers. Found that cross- informant correlations were poor for social skills, low for internalizing behaviors, and modest for externalizing behaviors. Parents and teachers rated boys as having more externalizing behaviors than girls. Parents identified…
Descriptors: Behavior Problems, Behavior Rating Scales, Interpersonal Competence, Measures (Individuals)

Kirk, Karen Iler – Volta Review, 1998
This review describes the theory behind two new measures of spoken word recognition for children with sensory aids, the Lexical and the Multisyllabic Lexical Neighborhood Tests. It then summarizes data concerning the tests' word familiarity, interlist equivalency, and test-retest reliability. Results indicate that deaf children with cochlear…
Descriptors: Auditory Tests, Children, Cochlear Implants, Cognitive Processes

DesRosiers, Fabiana; Vrsalovic, Wendy T.; Knauf, Diana E.; Vargas, Maribel; Busch-Rossnagel, Nancy A. – Merrill-Palmer Quarterly, 1999
Examined psychometric properties of Caregiver Inventory of Self-Concept (CISC) used with a largely Latino sample of 6- to 66-month-olds, and Tasks for Observation of Self-Concept (TOSC) used with 15- to 48-month-olds. Coefficient alpha and factor analysis provided evidence for test reliability and validity. Self-concept development followed…
Descriptors: Age Differences, Factor Analysis, Hispanic Americans, Measures (Individuals)
Holden, Gary; Anastas, Jeane; Meenaghan, Thomas – Journal of Social Work Education, 2005
This replication study continued the examination of the psychometric properties of the Foundation Practice Self-Efficacy scale (FPSE) with a sample of MSW students. As in the original study, evidence was found regarding the reliability, validity, and sensitivity to change of this measure. First, internal reliability estimates for the FPSE all…
Descriptors: Graduate Students, Social Work, Masters Programs, Self Efficacy
Wang, Wen-Chung; Chen, Hsueh-Chu – Educational and Psychological Measurement, 2004
As item response theory (IRT) becomes popular in educational and psychological testing, there is a need of reporting IRT-based effect size measures. In this study, we show how the standardized mean difference can be generalized into such a measure. A disattenuation procedure based on the IRT test reliability is proposed to correct the attenuation…
Descriptors: Test Reliability, Rating Scales, Sample Size, Error of Measurement
Yasuda, Tomoyuki; Lawrenz, Cathy; Whitlock, Rod Van; Lubin, Bernard; Lei, Pui-Wa – Educational and Psychological Measurement, 2004
Intraindividual variability in positive and negative affect was assessed by the positive affect (Contentment, Joy, Vigor, Love, and Excitement) and negative affect (Depression, Hostility, Anxiety, Agitation, and Social Anxiety) subscales of the state version of the Comprehensive Personality and Affect Scales (COPAS) during a 3-week period. Using…
Descriptors: Measures (Individuals), Error of Measurement, Depression (Psychology), Anxiety
Pajares, Frank; Cheong, Yuk Fai; Oberman, Paul – Educational and Psychological Measurement, 2004
The purpose of this study was to develop scales to assess instrumental help seeking, executive help seeking, perceived benefits of help seeking, and avoidance of help seeking and to examine their psychometric properties by conducting factor and reliability analyses. As this is the first attempt to examine the latent structures underlying the…
Descriptors: Psychometrics, Academic Achievement, Student Motivation, Computer Science
Davis, Mark H.; Capobianco, Sal; Kraus, Linda A. – Educational and Psychological Measurement, 2004
For a number of years, the dominant approach to measuring individual differences in how people respond to interpersonal conflict has been the dual-concerns model, which assesses five broad conflict styles said to result from one's standing on two underlying dimensions: concern for self and concern for other. This article describes the development…
Descriptors: Psychometrics, Social Desirability, Conflict, Test Validity
Brunner, Martin; SuB, Heinz-Martin – Educational and Psychological Measurement, 2005
Two aspects of the reliability of multidimensional measures can be distinguished: the amount of scale score variance that is accounted for by all underlying factors (composite reliability) and the degree to which the scale score reflects one particular factor (construct reliability). Confidence intervals for composite and construct reliabilities…
Descriptors: Measures (Individuals), Intervals, Intelligence Tests, Evaluation Methods
Invernizzi, Marcia A.; Landrum, Timothy J.; Howell, Jennifer L.; Warley, Heather P. – Reading Teacher, 2005
The authors describe a potential disconnect between research and practice in literacy assessment and instruction. They organize discussion around professionally recognized standards for the evaluation of educational assessments and assessment practices. These standards address both technical aspects of tests (e.g., validity; reliability;…
Descriptors: Test Validity, Test Reliability, Test Construction, Test Bias
DiPietro, Janet A. – Mental Retardation and Developmental Disabilities Research Reviews, 2005
The complexities of neurobehavioral assessment of the fetus, which can be neither directly viewed nor manipulated, cannot be understated. Impetus to develop methods for measuring fetal neurobehavioral development has been provided by the recognition that individual differences in neurobehavioral functioning do not originate with birth and…
Descriptors: Metabolism, Stimulation, Predictive Validity, Pregnancy
Koch, Kourtland R. – Journal of Adult Education, 2004
This study is a replication of an original study conducted by James and Blank (1991) which examined the relationship between educational attainment and adult performance using the Multi-Modal Paired Associates Learning Test-Revised (MMPALT-II) (Cherry, 1981). The MMPALT-II was designed to measure an individual's demonstrated perceptual modality…
Descriptors: Cognitive Style, Educational Attainment, Learning Strategies, Replication (Evaluation)
Paivio, Sandra, C.; Cramer, Kenneth, M. – Child Abuse & Neglect: The International Journal, 2004
Objective: The aims of this study were to examine (1) the psychometric properties of the Childhood Trauma Questionnaire [CTQ; Bernstein, D., Fink, L., Handelsman, L., Foote, J., Lovejoy, M., Wenzel, K., Sapareto, E., & Ruggiero, J. (1994). Initial reliability and validity of a new retrospective measure of child abuse and neglect. American…
Descriptors: Undergraduate Students, Questionnaires, Psychometrics, Test Reliability
Napoli, Anthony R.; Raymond, Lanette A. – Research in Higher Education, 2004
Motivating students to perform well on assessment tests is difficult when students know the results have no academic consequence. The present study evaluates the influence of assessment context (graded vs. non-graded) on the reliability of an assessment measure. Results indicate the graded condition produces higher reliability (r = 0.71) than the…
Descriptors: Test Reliability, College Outcomes Assessment, Nongraded Student Evaluation, Grade Equivalent Scores
Flisher, Alan, J.; Evans, Janet; Muller, Martie; Lombard, Carl – Journal of Adolescence, 2004
There is a paucity of test-retest reliability data for adolescent self-reports of a wide range of risk behaviours. Grade 8 and 11 Students (N=358) completed a questionnaire on two occasions between 10 and 14 days apart. It included items about use of various substances, violent behaviour, suicidality, and sexuality. Cohen's kappa was almost…
Descriptors: Test Reliability, Measurement Techniques, Adolescents, At Risk Persons