Publication Date
| In 2026 | 0 |
| Since 2025 | 433 |
| Since 2022 (last 5 years) | 1911 |
| Since 2017 (last 10 years) | 4483 |
| Since 2007 (last 20 years) | 6968 |
Descriptor
| Test Reliability | 15006 |
| Test Validity | 9977 |
| Test Construction | 4353 |
| Foreign Countries | 3811 |
| Psychometrics | 2416 |
| Factor Analysis | 2296 |
| Measures (Individuals) | 1780 |
| Evaluation Methods | 1408 |
| Higher Education | 1389 |
| Questionnaires | 1259 |
| Factor Structure | 1245 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 830 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 159 |
| Spain | 129 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 111 |
| Taiwan | 108 |
| Netherlands | 102 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Wang, Wen-Chung; Chen, Hsueh-Chu – Educational and Psychological Measurement, 2004
As item response theory (IRT) becomes popular in educational and psychological testing, there is a need of reporting IRT-based effect size measures. In this study, we show how the standardized mean difference can be generalized into such a measure. A disattenuation procedure based on the IRT test reliability is proposed to correct the attenuation…
Descriptors: Test Reliability, Rating Scales, Sample Size, Error of Measurement
Yasuda, Tomoyuki; Lawrenz, Cathy; Whitlock, Rod Van; Lubin, Bernard; Lei, Pui-Wa – Educational and Psychological Measurement, 2004
Intraindividual variability in positive and negative affect was assessed by the positive affect (Contentment, Joy, Vigor, Love, and Excitement) and negative affect (Depression, Hostility, Anxiety, Agitation, and Social Anxiety) subscales of the state version of the Comprehensive Personality and Affect Scales (COPAS) during a 3-week period. Using…
Descriptors: Measures (Individuals), Error of Measurement, Depression (Psychology), Anxiety
Pajares, Frank; Cheong, Yuk Fai; Oberman, Paul – Educational and Psychological Measurement, 2004
The purpose of this study was to develop scales to assess instrumental help seeking, executive help seeking, perceived benefits of help seeking, and avoidance of help seeking and to examine their psychometric properties by conducting factor and reliability analyses. As this is the first attempt to examine the latent structures underlying the…
Descriptors: Psychometrics, Academic Achievement, Student Motivation, Computer Science
Davis, Mark H.; Capobianco, Sal; Kraus, Linda A. – Educational and Psychological Measurement, 2004
For a number of years, the dominant approach to measuring individual differences in how people respond to interpersonal conflict has been the dual-concerns model, which assesses five broad conflict styles said to result from one's standing on two underlying dimensions: concern for self and concern for other. This article describes the development…
Descriptors: Psychometrics, Social Desirability, Conflict, Test Validity
Brunner, Martin; SuB, Heinz-Martin – Educational and Psychological Measurement, 2005
Two aspects of the reliability of multidimensional measures can be distinguished: the amount of scale score variance that is accounted for by all underlying factors (composite reliability) and the degree to which the scale score reflects one particular factor (construct reliability). Confidence intervals for composite and construct reliabilities…
Descriptors: Measures (Individuals), Intervals, Intelligence Tests, Evaluation Methods
Invernizzi, Marcia A.; Landrum, Timothy J.; Howell, Jennifer L.; Warley, Heather P. – Reading Teacher, 2005
The authors describe a potential disconnect between research and practice in literacy assessment and instruction. They organize discussion around professionally recognized standards for the evaluation of educational assessments and assessment practices. These standards address both technical aspects of tests (e.g., validity; reliability;…
Descriptors: Test Validity, Test Reliability, Test Construction, Test Bias
DiPietro, Janet A. – Mental Retardation and Developmental Disabilities Research Reviews, 2005
The complexities of neurobehavioral assessment of the fetus, which can be neither directly viewed nor manipulated, cannot be understated. Impetus to develop methods for measuring fetal neurobehavioral development has been provided by the recognition that individual differences in neurobehavioral functioning do not originate with birth and…
Descriptors: Metabolism, Stimulation, Predictive Validity, Pregnancy
Koch, Kourtland R. – Journal of Adult Education, 2004
This study is a replication of an original study conducted by James and Blank (1991) which examined the relationship between educational attainment and adult performance using the Multi-Modal Paired Associates Learning Test-Revised (MMPALT-II) (Cherry, 1981). The MMPALT-II was designed to measure an individual's demonstrated perceptual modality…
Descriptors: Cognitive Style, Educational Attainment, Learning Strategies, Replication (Evaluation)
Paivio, Sandra, C.; Cramer, Kenneth, M. – Child Abuse & Neglect: The International Journal, 2004
Objective: The aims of this study were to examine (1) the psychometric properties of the Childhood Trauma Questionnaire [CTQ; Bernstein, D., Fink, L., Handelsman, L., Foote, J., Lovejoy, M., Wenzel, K., Sapareto, E., & Ruggiero, J. (1994). Initial reliability and validity of a new retrospective measure of child abuse and neglect. American…
Descriptors: Undergraduate Students, Questionnaires, Psychometrics, Test Reliability
Napoli, Anthony R.; Raymond, Lanette A. – Research in Higher Education, 2004
Motivating students to perform well on assessment tests is difficult when students know the results have no academic consequence. The present study evaluates the influence of assessment context (graded vs. non-graded) on the reliability of an assessment measure. Results indicate the graded condition produces higher reliability (r = 0.71) than the…
Descriptors: Test Reliability, College Outcomes Assessment, Nongraded Student Evaluation, Grade Equivalent Scores
Flisher, Alan, J.; Evans, Janet; Muller, Martie; Lombard, Carl – Journal of Adolescence, 2004
There is a paucity of test-retest reliability data for adolescent self-reports of a wide range of risk behaviours. Grade 8 and 11 Students (N=358) completed a questionnaire on two occasions between 10 and 14 days apart. It included items about use of various substances, violent behaviour, suicidality, and sexuality. Cohen's kappa was almost…
Descriptors: Test Reliability, Measurement Techniques, Adolescents, At Risk Persons
Matteson, Alicia V.; Moradi, Bonnie – Psychology of Women Quarterly, 2005
The current study reexamined the factor structure of the Lifetime and Recent scales of the Schedule of Sexist Events (SSE; Klonoff & Landrine, 1995) and conducted the first factor analysis of the SSE-Appraisal scale ( Landrine & Klonoff, 1997). Factor analyses conducted with data from 245 women yielded, for SSE-Lifetime and SSE-Appraisal scales,…
Descriptors: Factor Analysis, Gender Bias, Psychometrics, Females
Dahl, Tove I. – Assessment in Education: Principles, Policy and Practice, 2006
The validity of the Norwegian university grading standard has been called into serious question. The implicit standards used for assessing exams and the reliability of that understanding among examiners and psychology students were investigated in three studies. Studies 1 and 2 investigated the implicit standards that examiners used when assessing…
Descriptors: Psychology, Examiners, Test Validity, Universities
Wiebe, John S.; Penley, Julie A. – Psychological Assessment, 2005
The Beck Depression Inventory-II (BDI-II; A. T. Beck, R. A. Steer, & G. K. Brown, 1996) is a widely used measure of depressive symptomatology originally authored in English and then translated to Spanish. However, there are very limited data available on the Spanish translation. This study compared the psychometric characteristics of the…
Descriptors: Translation, Factor Structure, Factor Analysis, Psychometrics
Chiriboga, David A. – Hispanic Journal of Behavioral Sciences, 2004
This article examines a subset of acculturation items pertaining to language fluency and use and to interpersonal relationships. This study included 3,050 Mexican American elders aged 65 to 99, randomly sampled from five states in the southwestern United States. A standard acculturation inventory was used as the source for two factor-derived…
Descriptors: Mexican Americans, Language Fluency, Acculturation, Test Validity

Peer reviewed
Direct link
