Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Yesil, Rustu – Educational Sciences: Theory and Practice, 2010
The main aim of this study is to develop a scale to assess the extent to which teachers display democratic behaviors they are supposed to display in in-classroom teaching practices and the level of their determination in displaying such behaviors. The study group of this survey is composed of 446 second grade high school students, 243 girls and…
Descriptors: Test Reliability, Test Validity, Measures (Individuals), Teacher Behavior
Yoo, Hyung Chol; Burrola, Kimberly S.; Steger, Michael F. – Journal of Counseling Psychology, 2010
This investigation is a preliminary report on a new measure of internalization of the model minority myth. In 3 studies, there was evidence for the validation of the 15-item Internalization of the Model Minority Myth Measure (IM-4), with 2 subscales. The Model Minority Myth of Achievement Orientation referred to the myth of Asian Americans'…
Descriptors: Asian American Students, College Students, Minority Groups, Ethnic Stereotypes
Tseng, Mei-Hui; Fu, Chung-Pei; Wilson, Brenda N.; Hu, Fu-Chang – Research in Developmental Disabilities: A Multidisciplinary Journal, 2010
The aim of this study was to adapt and evaluate the Developmental Coordination Disorder Questionnaire (DCDQ) for use in Chinese-speaking countries. A total of 1082 parents completed the DCDQ and 35 parents repeated it after 2 weeks for test-retest reliability. Two items were deleted after examination of test consistency. Cronbach's [alpha] for the…
Descriptors: Test Validity, Measures (Individuals), Psychometrics, Probability
Tweed, Mike; Ingham, Christopher – Advances in Health Sciences Education, 2010
Judgments made by the assessors observing consultations are widely used in the assessment of medical students. The aim of this research was to study judgment accuracy and confidence and the relationship between these. Assessors watched recordings of consultations, scoring the students on: a checklist of items; attributes of consultation; a…
Descriptors: Medical Students, Student Evaluation, Consultation Programs, Observation
Touchie, Claire; Humphrey-Murto, Susan; Ainslie, Martha; Myers, Kathryn; Wood, Timothy J. – Advances in Health Sciences Education, 2010
Oral examinations have become more standardized over recent years. Traditionally a small number of raters were used for this type of examination. Past studies suggested that more raters should improve reliability. We compared the results of a multi-station structured oral examination using two different rater models, those based in a station,…
Descriptors: Interrater Reliability, Internal Medicine, Evaluation Methods, Tests
Prades, Anna; Espinar, Sebastian Rodriguez – Assessment & Evaluation in Higher Education, 2010
The requirement that universities prepare students in practical competences, and assess the extent to which pre-established objectives are achieved, is generally accepted to be of growing importance. As a result, on one hand, there is an increasing volume of research related to task performance in assessment. On the other hand, there are…
Descriptors: Chemistry, College Science, Science Education, Performance Based Assessment
Ide, Bette; Dingmann, Colleen; Cuevas, Elizabeth; Meehan, Maurita – Journal of Family Social Work, 2010
This study tests the validity and reliability of the Family Adaptability and Cohesion Scale III (FACES III) in two samples of rural adolescents. The underlying theory is the linear 3-D circumplex model. The FACES III was administered to 1,632 adolescents in Grades 7 through 12 in two counties in a rural western state. The FACES III Scale and the…
Descriptors: Family Relationship, Adolescents, Measures (Individuals), Counties
Yoshimura, Yuki; MacWhinney, Brian – Applied Psycholinguistics, 2010
This study examined adult English native speakers' processing of sentences in which pronominal case marking conflicts with word order. Previous research has shown that English speakers rely heavily on word order for assigning case roles during sentence interpretation. However, in terms of cue reliability measures, we should expect English…
Descriptors: Sentences, Stimuli, Form Classes (Languages), Word Order
Adelson, Jill L.; McCoach, D. Betsy – Educational and Psychological Measurement, 2010
The purpose of this study was to compare how students in Grades 3 to 6 respond to a mathematics attitudes instrument with a 4-point Likert-type scale compared with one with an additional neutral point (a 5-point Likert-type scale). The 606 participating students from six elementary and middle schools randomly received either the 4-point or 5-point…
Descriptors: Elementary School Students, Student Attitudes, Likert Scales, Measures (Individuals)
Fetro, Joyce V.; Rhodes, Darson L.; Hey, David W. – Health Educator, 2010
During the last 20 years, youth programming has shifted from risk reduction to youth development. While numerous instruments exist to measure selected individual characteristics/competencies among youth, a comprehensive instrument to measure four constructs of personal and social skills could not be identified. The purpose of this study was to…
Descriptors: Youth Programs, Individual Characteristics, Evaluators, Health Education
Hou, Su-I – Michigan Journal of Community Service Learning, 2010
The purpose of this study was to develop a Web-based Faculty Service-Learning Beliefs Inventory (wFSLBI) assessing faculty members' views of the benefits and barriers involved with service-learning (SL) pedagogy. Analyses of the responses of 362 faculty members showed that Inventory items loaded consistently on four sub-scales: Perceived benefits…
Descriptors: College Faculty, Measures (Individuals), Service Learning, Educational Benefits
Marley, Scott C. – Journal for Specialists in Group Work, 2010
Recent articles in "The Journal for Specialists in Group Work" have discussed credibility indicators for quantitative and qualitative studies (Asner-Self, 2009; Rubel & Villalba, 2009). This article extends upon these contributions by discussing measurement issues that are relevant to producers and consumers of quantitative group research. This…
Descriptors: Credibility, Psychological Evaluation, Validity, Data Collection
Goffreda, Catherine T.; DiPerna, James Clyde – School Psychology Review, 2010
The Dynamic Indicators of Basic Early Literacy Skills (DIBELS) are brief measures of early literacy skills for students in Grades K-6 (University of Oregon, 2009; see Kaminski & Good, 1996). School psychologists and other educational professionals use DIBELS to identify students who are in need of early intervention. The purpose of this review was…
Descriptors: Early Intervention, Reading Fluency, School Psychologists, Validity
Rhodes, Ryan E.; Matheson, Deborah Hunt; Mark, Rachel – Measurement in Physical Education and Exercise Science, 2010
The purpose of this study was to compare the reliability, variability, and predictive validity of two common scaling response formats (semantic differential, Likert-type) and two numbers of response options (5-point, 7-point) in the physical activity domain. Constructs of the theory of planned behavior were chosen in this analysis based on its…
Descriptors: Likert Scales, Semantic Differential, Comparative Analysis, Reliability
Spencer, Thomas J.; Adler, Lenard A.; Qiao, Meihua; Saylor, Keith E.; Brown, Thomas E.; Holdnack, James A.; Schuh, Kory J.; Trzepacz, Paula T.; Kelsey, Douglas K. – Journal of Attention Disorders, 2010
Objective: Validation of the Adult ADHD Investigator Symptom Rating Scale (AISRS) that measures aspects of ADHD in adults. Method: Psychometric properties of the AISRS total and AISRS subscales are analyzed and compared to the Conners' Adult Attention-Deficit/Hyperactivity Disorder Rating Scale-Investigator Rated: Screening Version (CAARS-Inv:SV)…
Descriptors: Attention Deficit Hyperactivity Disorder, Adults, Patients, Symptoms (Individual Disorders)

Peer reviewed
Direct link
