Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Hilgenkamp, Thessa I. M.; van Wijck, Ruud; Evenhuis, Heleen M. – Journal of Intellectual & Developmental Disability, 2012
Background: Physical fitness is relevant for wellbeing and health, but knowledge on the feasibility and reliability of instruments to measure physical fitness for older adults with intellectual disability is lacking. Methods: Feasibility and test-retest reliability of a physical fitness test battery (Box and Block Test, Response Time Test, walking…
Descriptors: Reaction Time, Physical Activities, Mental Retardation, Physical Fitness
Karami, Hossein – RELC Journal: A Journal of Language Teaching and Research, 2012
This paper reports an attempt to develop and validate a bilingual Persian version of the Vocabulary Size Test (VST). Due to the particular educational system in Iran, there is a dire need for a test that can effectively estimate English learners' vocabulary sizes. Previous research (Nguyen and Nation, 2011) has indicated that bilingual versions of…
Descriptors: Test Validity, Test Reliability, Second Language Learning, Monolingualism
Jia, Cunxian; Zhang, Jie – Death Studies, 2012
The study is aimed to examine the psychometric characteristics of the Duke Social Support Scale (DSSI) in young rural Chinese individuals (379 suicides, 411 controls) aged 15-34 years. Social support was measured by 23-item DSSI, which included Social Interaction Scale, Subjective Social Support, and Instrumental Social Support. DSSI had high…
Descriptors: Construct Validity, Interpersonal Relationship, Measures (Individuals), Interaction
Tsai, Min-hsiu – Action in Teacher Education, 2012
This study investigates the consistency between human raters and an automated essay scoring system in grading high school students' English compositions. A total of 923 essays from 23 classes of 12 senior high schools in Taiwan (Republic of China) were obtained and scored manually and electronically. The results show that the consistency between…
Descriptors: Foreign Countries, High School Students, Writing (Composition), Essays
Wang, Binhong – English Language Teaching, 2010
This paper first analyzed two studies on rater factors and rating criteria to raise the problem of rater agreement. After that the author reveals the causes of discrepencies in rating administration by discussing rater variability and rater bias. The author argues that rater bias can not be eliminated completely, we can only reduce the error to a…
Descriptors: Interrater Reliability, Examiners, Training, Bias
Wheeler, Gregory D. – ProQuest LLC, 2010
Research indicates that many elementary students do not comprehend that the equal sign is an indication that an equality relation exists between two structures. Instead, they perceive the equal sign as an indication that a particular procedure is to be performed. As students mature, and as their exposure to the equal sign and equality relations in…
Descriptors: Expertise, Definitions, Construct Validity, Validity
Dimitrov, Dimiter M. – Mid-Western Educational Researcher, 2010
The focus of this presidential address is on the contemporary treatment of reliability and validity in educational assessment. Highlights on reliability are provided under the classical true-score model using tools from latent trait modeling to clarify important assumptions and procedures for reliability estimation. In addition to reliability,…
Descriptors: Educational Assessment, Validity, Item Response Theory, Reliability
Creamer, Elizabeth G.; Magolda, Marcia Baxter; Yue, Jessica – Journal of College Student Development, 2010
This article presents preliminary evidence of the reliability and validity of a measure of self-authorship derived from 18 items in the Career Decision Making Survey. The research conceptualizes a quantitative measure of self-authorship as a three-part score that reflects level of agreement with statements at each of the first three phases of…
Descriptors: Self Concept Measures, Surveys, Reliability, Validity
Hall, Graham – ELT Journal, 2010
Uysal's article provides a research agenda for IELTS and lists numerous issues concerning the test's reliability and validity. She asks useful questions, but her analysis ignores the uncertainties inherent in all language test development and the wider social and political context of international high-stakes language testing. In this response, I…
Descriptors: Testing, Language Tests, English, High Stakes Tests
Vanhoof, Jan; Van Petegem, Peter – Studies in Educational Evaluation, 2010
This article focuses on school self-evaluations (SSE). It addresses whether SSE meets quality indicators and whether differences can be found between the quality judgments of school principals and inspectors. Data stem from two complementary data collections: population data of school inspections and a survey of a representative sample of school…
Descriptors: Evaluation Methods, Self Evaluation (Groups), Principals, Data Collection
Gordon, Shirley C.; Blum, Cynthia Ann; Parcells, Dax Andrew – Journal of School Nursing, 2010
School nurses may be the first health professionals to assess the onset of facial paralysis/muscle weakness in school-age children. The purpose of this study was to test the psychometric properties of the Gordon Facial Muscle Weakness Assessment Tool (GFMWT) developed by Gordon. Data were collected in two phases. In Phase 1, 4 content experts…
Descriptors: Human Body, Neurological Impairments, School Nurses, Measures (Individuals)
Field, Tiffany; Malphurs, Julie E.; Yando, Regina; Bendell, Debra; Carraway, Kirsten; Cohen, Raquel – Early Child Development and Care, 2010
Based on interviews with 120 children ranging from age 3 to 12, legal interviewers rated the grade school and middle school age children as competent and as understanding the meaning of lying. The interviewers rated the grade school children as more credible "witnesses in court" than either the preschool or the middle school age…
Descriptors: Children, Nonverbal Communication, Psychological Patterns, Court Litigation
Law, Daniel W. – Journal of Education for Business, 2010
The author surveyed 163 business students representing all business majors from a major state university. Participants completed a questionnaire utilizing a modified version of the Maslach Burnout Inventory. The data were factor analyzed to assess its basic underlying structure, and each burnout component was assessed for reliability. Results…
Descriptors: Majors (Students), Burnout, Business Administration Education, Questionnaires
Aron, Sarah B.; McCrowell, Jean; Moon, Alyson; Yamano, Ryoichi; Roark, Duston A.; Simmons, Monica; Tatanashvili, Zurab; Drake, Brett – Social Work Research, 2010
The purpose of this article is to compare four different levels of aggregation to assess their utility as areal units in child maltreatment research. The units examined are county, zip code, tract, and block group levels. Each of the four levels is analyzed to determine which show the strongest effects in modeling the correlation between poverty…
Descriptors: Poverty, Child Abuse, Counties, Correlation
Lee, John Chi-kin; Yin, Hongbiao; Zhang, Zhonghua – International Journal of Testing, 2010
This article reports the adaptation and analysis of Pintrich's Motivated Strategies for Learning Questionnaire (MSLQ) in Hong Kong. First, this study examined the psychometric qualities of the existing Chinese version of MSLQ (MSLQ-CV). Based on this examination, this study developed a revised Chinese version of MSLQ (MSLQ-RCV) for junior…
Descriptors: Foreign Countries, Questionnaires, Psychometrics, Secondary School Students

Peer reviewed
Direct link
