Publication Date
| In 2026 | 1 |
| Since 2025 | 878 |
| Since 2022 (last 5 years) | 4487 |
| Since 2017 (last 10 years) | 10420 |
| Since 2007 (last 20 years) | 21883 |
Descriptor
| Test Validity | 21728 |
| Validity | 13774 |
| Test Reliability | 10826 |
| Foreign Countries | 9848 |
| Test Construction | 6867 |
| Factor Analysis | 5755 |
| Measures (Individuals) | 5617 |
| Predictive Validity | 5018 |
| Psychometrics | 4800 |
| Reliability | 4632 |
| Correlation | 4370 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 1169 |
| Practitioners | 629 |
| Teachers | 336 |
| Administrators | 165 |
| Policymakers | 110 |
| Counselors | 63 |
| Students | 63 |
| Parents | 15 |
| Community | 12 |
| Media Staff | 10 |
| Support Staff | 8 |
| More ▼ | |
Location
| Turkey | 1387 |
| Australia | 704 |
| Canada | 626 |
| China | 527 |
| United States | 439 |
| Indonesia | 387 |
| United Kingdom | 363 |
| Germany | 338 |
| California | 337 |
| Netherlands | 334 |
| Spain | 309 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 7 |
| Meets WWC Standards with or without Reservations | 12 |
| Does not meet standards | 10 |
Silvestrone, Judy M. – New Directions for Teaching and Learning, 2004
Whether in the science or language laboratory, carrying out health care procedures or demonstrating performance arts, faculty can improve skill evaluation through transparency and authenticity in exam construction, format, and grading.
Descriptors: Language Laboratories, Performance Based Assessment, Validity, Reliability
Schantz, Susan L.; Gardiner, Joseph C.; Gasior, Donna M.; McCaffrey, Robert J.; Sweeney, Anne M.; Humphrey, Harold E. B. – Psychology in the Schools, 2004
D.V. Cicchetti, A.S. Kaufman, and S.S. Sparrow (this issue) use six criteria to evaluate the published findings from seven different studies of PCB exposure and neuropsychological function. They point out a number of weaknesses or flaws in each study and conclude that these weaknesses make the overall conclusion that PCB exposure negatively…
Descriptors: Evaluation Criteria, Prenatal Influences, Infants, Error of Measurement
Essau, Cecilia A.; Sasagawa, Satoko; Frick, Paul J. – Assessment, 2006
This study examined the structure, distribution, and correlates of a new measure of self-reported callous-unemotional (CU) traits in 1,443 adolescents (774 boys, 669 girls) between the ages of 13 to 18 years. The Inventory of Callous-Unemotional Traits was subjected to exploratory factor analysis and confirmatory factor analysis. Exploratory…
Descriptors: Adolescents, Factor Analysis, Factor Structure, Personality Measures
Hong, Eunsook; Greene, Mary T.; Higgins, Kyle – Gifted Child Quarterly, 2006
An instrument to measure teachers' instructional practices, the Instructional Practice Questionnaire, was developed and validated in three phases. The questionnaires focused on three domains of instructional practices: cognitive, interpersonal, and interpersonal. First, an initial questionnaire was developed for a pilot study, and data were…
Descriptors: Teaching Methods, Questionnaires, Resource Room Programs, Regular and Special Education Relationship
Cruce, Ty M.; Wolniak, Gregory C.; Seifert, Tricia A.; Pascarella, Ernest T. – Journal of College Student Development, 2006
This study estimated separately the unique effects of three dimensions of good practice and the global effects of a composite measure of good practices on the cognitive development, orientations to learning, and educational aspirations of students during their first year of college. Analyses of longitudinal data from a representative sample of…
Descriptors: Cognitive Development, Academic Aspiration, Orientation, College Freshmen
Hartley, S. L.; MacLean, W. E., Jr. – Journal of Intellectual Disability Research, 2006
Background: Likert-type scales are increasingly being used among people with intellectual disability (ID). These scales offer an efficient method for capturing a wide range of variance in self-reported attitudes and behaviours. This review is an attempt to evaluate the reliability and validity of Likert-type scales in people with ID. Methods:…
Descriptors: Likert Scales, Test Reliability, Test Validity, Measurement Techniques
Roussos, Louis A.; Ozbek, Ozlem Yesim – Journal of Educational Measurement, 2006
The development of the DETECT procedure marked an important advancement in nonparametric dimensionality analysis. DETECT is the first nonparametric technique to estimate the number of dimensions in a data set, estimate an effect size for multidimensionality, and identify which dimension is predominantly measured by each item. The efficacy of…
Descriptors: Evaluation Methods, Effect Size, Test Bias, Item Response Theory
Feldt, Leonard S.; Kim, Seonghoon – Educational and Psychological Measurement, 2006
Researchers sometimes need a statistical test of the hypothesis that two values of Cronbach's alpha reliability coefficient are equal. The situation may involve scores from two different measures administered to independent random samples or from the same measure administered to random samples from two different populations. Feldt derived a test…
Descriptors: Individual Testing, Test Items, Sample Size, Scores
Marquez, David X.; McAuley, Edward; Motl, Robert W.; Elavsky, Steriani; Konopack, James F.; Jerome, Gerald J.; Kramer, Arthur F. – Educational and Psychological Measurement, 2006
This study examined the validity of Geriatric Depression Scale--5 (GDS-5) scores among older sedentary adults based on its structural properties and relationship with external criteria. Participants from two samples (Ns = 185 and 93; M ages = 66 and 67 years) completed baseline assessments as part of randomized controlled exercise trials.…
Descriptors: Self Efficacy, Geriatrics, Factor Analysis, Depression (Psychology)
Noens, I.; van Berckelaer-Onnes, I.; Verpoorten, R.; van Duijn, G. – Journal of Intellectual Disability Research, 2006
Background: The ComFor (Forerunners in Communication) is an instrument to explore underlying competence for augmentative communication. More specifically, it measures perception and sense-making of non-transient forms of communication at the levels of presentation and representation. The target group consists primarily of individuals with autism…
Descriptors: Foreign Countries, Comparative Analysis, Verbal Communication, Psychometrics
Diamond, Pamela M.; Magaletta, Philip R. – Assessment, 2006
The 12-item short form of the Buss-Perry Aggression Questionnaire (BPAQ-SF) was originally developed by Bryant and Smith (2001) and modified and confirmed using confirmatory factor analysis with mentally ill offenders by Diamond, Wang, and Buffington-Vollum (2005). In the current study, construct validity of the BPAQ-SF was assessed with a sample…
Descriptors: Aggression, Personality Assessment, Measures (Individuals), Factor Analysis
Kwak, Meg M.; Ervin, Ruth A.; Anderson, Mary Z.; Austin, John – Behavior Modification, 2004
As we begin to apply functional assessment procedures in mainstream educational settings, there is a need to explore options for identifying behavior function that are not only effective but efficient and practical for school personnel to employ. Attempts to simplify the functional assessment process are evidenced by the development of informant…
Descriptors: Middle School Teachers, Test Validity, Rating Scales, Functional Behavioral Assessment
Sternberg, Robert J. – Educational Psychologist, 2004
This article describes two projects based on Robert J. Sternberg's theory of successful intelligence and designed to provide theory-based testing for university admissions. The first, Rainbow Project, provided a supplementary test of analytical, practical, and creative skills to augment the SAT in predicting college performance. The Rainbow…
Descriptors: Program Effectiveness, Ethnic Groups, Testing, Predictive Validity
Peer reviewedCurbow, Barbara; McDonnell, Karen; Spratt, Kai; Griffin, Joan; Agnew, Jacqueline – Early Childhood Research Quarterly, 2003
Developed and tested a 20-item measure of work-family interface with child care providers. Confirmed five factors: general overload, conflict of family to work, spillover of family to work, spillover of work to family, and conflict of work to family. Regression lines for low, medium, and high levels of work-family interface indicated that high…
Descriptors: Child Caregivers, Depression (Psychology), Employed Parents, Factor Analysis
Guan, Jianmin; Xiang, Ping; Keating, Xiaofen Deng – Measurement in Physical Education and Exercise Science, 2004
Although replication is important to the validity of a study and is endorsed by more and more scholars, few researchers in kinesiology attend to this issue. Some researchers may believe that statistical significance and effect size are the most important statistical issues in their research and thereby may have ignored the importance of result…
Descriptors: Statistical Significance, Effect Size, Researchers, Evaluation Methods

Direct link
