Publication Date
| In 2026 | 0 |
| Since 2025 | 12 |
| Since 2022 (last 5 years) | 114 |
| Since 2017 (last 10 years) | 375 |
| Since 2007 (last 20 years) | 1130 |
Descriptor
| Comparative Analysis | 1943 |
| Reliability | 880 |
| Test Reliability | 792 |
| Foreign Countries | 554 |
| Test Validity | 443 |
| Correlation | 350 |
| Validity | 332 |
| Interrater Reliability | 327 |
| Statistical Analysis | 321 |
| Scores | 280 |
| Measures (Individuals) | 236 |
| More ▼ | |
Source
Author
| Reckase, Mark D. | 6 |
| Attali, Yigal | 5 |
| Coniam, David | 5 |
| Brennan, Robert L. | 4 |
| Crehan, Kevin D. | 4 |
| Feldt, Leonard S. | 4 |
| Hakstian, A. Ralph | 4 |
| Jones, Ian | 4 |
| Kolen, Michael J. | 4 |
| Lunz, Mary E. | 4 |
| August, Diane | 3 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 35 |
| Practitioners | 29 |
| Teachers | 15 |
| Administrators | 9 |
| Policymakers | 6 |
| Counselors | 2 |
| Media Staff | 2 |
| Parents | 1 |
| Support Staff | 1 |
Location
| Turkey | 59 |
| United States | 47 |
| Australia | 36 |
| China | 33 |
| Canada | 32 |
| United Kingdom (England) | 32 |
| United Kingdom | 28 |
| Germany | 25 |
| Netherlands | 24 |
| Taiwan | 22 |
| Hong Kong | 20 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
Rolfhus, Eric; Decker, Lauren E.; Brite, Jessica L.; Gregory, Lois – Regional Educational Laboratory Southwest (NJ1), 2010
This study of four national English language arts college readiness standards sets compares content alignment and level of alignment of the standards statements in three comparison sets to a benchmark set, the American Diploma Project (ADP), and analyzes the cognitive complexity of all four sets. Specifically, this report addresses two primary…
Descriptors: School Readiness, Language Arts, Interrater Reliability, Measures (Individuals)
Heldsinger, Sandra; Humphry, Stephen – Australian Educational Researcher, 2010
Demands for accountability have seen the implementation of large scale testing programs in Australia and internationally. There is, however, a growing body of evidence to show that externally imposed testing programs do not have a sustained impact on student achievement. It has been argued that teacher assessment is more effective in raising…
Descriptors: Testing Programs, Testing, Academic Achievement, Measures (Individuals)
Wuang, Yee-Pay; Wang, Li-Chen; Su, Chwen-Yng – Research in Developmental Disabilities: A Multidisciplinary Journal, 2010
The aim of this study was to examine the validation of the Hooper Visual Organization Test (HVOT) for use in children by testing for item fit, unidimensionality, item hierarchy, reliability, and screening capacity. A modified scoring system was devised for the HVOT so that children received some credit for being able to describe the function of…
Descriptors: Test Bias, Down Syndrome, Scoring, Item Response Theory
Bothe, Anne K. – Journal of Speech, Language, and Hearing Research, 2008
Purpose: The purposes of this study were (a) to determine whether highly experienced clinicians and researchers agreed with each other in judging the presence or absence of stuttering in the speech of children who stutter and (b) to determine how those binary stuttered/nonstuttered judgments related to categorizations of the same speech based on…
Descriptors: Stuttering, Identification, Young Children, Speech
Mann, Zennetta; McLaughlin, T. F.; Williams, Randy Lee; Derby, K. Mark; Everson, Mary – Journal of Special Education Apprenticeship, 2012
The purpose of the present study was to evaluate the effects of Direct Instruction (DI) flashcard procedure, combined with strategies and rewards on multiplication fact accuracy of two elementary school-age students. A single subject replication design across three and four sets of multiplication facts was used to evaluate outcomes. The results…
Descriptors: Direct Instruction, Instructional Materials, Mathematics Instruction, Rewards
Tsagari, Dina, Ed.; Csepes, Ildiko, Ed. – Peter Lang Frankfurt, 2012
The Guidelines for Good Practice of the European Association for Language Testing and Assessment (EALTA) stress the importance of collaboration between all parties involved in the process of developing instruments, activities and programmes for testing and assessment. Collaboration is considered to be as important as validity and reliability,…
Descriptors: Sign Language, Testing, Language Tests, Test Validity
Burrows, Lance – ProQuest LLC, 2012
This study is a quasi-experimental, longitudinal investigation into the role that extensive reading and reading strategies play in the cultivation of reading self-efficacy. Conducted over the course of one academic year, how changes in reading self-efficacy translate into changes in reading comprehension was examined. In addition, the…
Descriptors: Foreign Countries, Reading Instruction, Reading, Reading Strategies
Johnson, Erik A. – Contributions to Music Education, 2011
The purpose of this study was to determine the effect of peer-based instruction on rhythm reading achievement of instrumental and choral music students attending a large urbanfringe high school in a major metropolitan area. Participants (N = 131) included band (n = 71) and choir (n = 60) students whose backgrounds reflected extensive economic (78%…
Descriptors: Music, Music Education, Music Reading, High School Students
Glesser, Andrea L. – ProQuest LLC, 2010
This study provided a preliminary analysis of concurrent and discriminative validity for the "Early Literacy Progress Monitoring Assessment Tool" (ELP-MAT; Kaderavek, 2009). Sixty preschool students between the ages of 3 years, 6 months and 5 years of age, from early childhood programs in Northwest Ohio, participated in the study. The…
Descriptors: Early Reading, Early Childhood Education, Language Impairments, Phonological Awareness
Lee, Chang-Hun – Journal of Interpersonal Violence, 2010
This study simultaneously investigates personal and interpersonal traits that were found to be important factors of bullying behavior using data collected from 1,238 randomly selected Korean middle school students. Using a modified and expanded definition of bullying based on a more culturally sensitive approach to bullying, this study categorizes…
Descriptors: Middle School Students, Bullying, Teacher Effectiveness, Interpersonal Relationship
McBride, James R.; Ysseldyke, Jim; Milone, Michael; Stickney, Eric – Canadian Journal of School Psychology, 2010
Technical adequacy and information/cost return were examined for four early reading measures: the Dynamic Indicators of Basic Early Literacy Skills (DIBELS), STAR Early Literacy (SEL), Group Reading Assessment and Diagnostic Evaluation (GRADE), and the Texas Primary Reading Inventory (TPRI). All four assessments were administered to the same…
Descriptors: Early Reading, Reading Achievement, Adaptive Testing, Phonemic Awareness
Brochado, Ana – Quality Assurance in Education: An International Perspective, 2009
Purpose: The purpose of this paper is to examine the performance of five alternative measures of service quality in the high education sector--service quality (SERVQUAL), importance-weighted SERVQUAL, service performance (SERVPERF), importance-weighted SERVPERF, and higher education performance (HEdPERF). Design/methodology/approach: Data were…
Descriptors: Higher Education, Focus Groups, Measurement Techniques, Educational Quality
Cervellione, Kelly L.; Lee, Young-Sun; Bonanno, George A. – Educational and Psychological Measurement, 2009
Self-deception has become a construct of great interest in individual differences research because it has been associated with levels of resilience and mental health. The Balanced Inventory of Desirable Responding (BIDR) is a self-report measure used for quantifying self-deception. In this study we used Rasch modeling to examine the properties of…
Descriptors: Personality Measures, Personality Traits, Deception, Item Response Theory
Porter, Stephen R.; Rumann, Corey; Pontius, Jason – New Directions for Institutional Research, 2011
Survey data are widely used in higher education for purposes such as assessment and strategic planning. One of the most common ways of using surveys has been to assess student learning outcomes by means of proxy questions on a survey, assuming that students who engage in specific behaviors (called engagement) have learned more during college than…
Descriptors: Institutional Research, Student Surveys, Outcomes of Education, Academic Achievement
Kim, Sooyeon; von Davier, Alina A.; Haberman, Shelby – Journal of Educational Measurement, 2008
This study addressed the sampling error and linking bias that occur with small samples in a nonequivalent groups anchor test design. We proposed a linking method called the synthetic function, which is a weighted average of the identity function and a traditional equating function (in this case, the chained linear equating function). Specifically,…
Descriptors: Equated Scores, Sample Size, Test Reliability, Comparative Analysis

Peer reviewed
Direct link
