Publication Date
| In 2026 | 0 |
| Since 2025 | 56 |
| Since 2022 (last 5 years) | 282 |
| Since 2017 (last 10 years) | 778 |
| Since 2007 (last 20 years) | 2040 |
Descriptor
| Interrater Reliability | 3122 |
| Foreign Countries | 654 |
| Test Reliability | 503 |
| Evaluation Methods | 502 |
| Test Validity | 410 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 24 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
Hall, Leigh A. – Reading & Writing Quarterly, 2016
The purpose of this year-long project was to examine an instructional framework intended to help middle school teachers create instruction that responds to students' reading identities while also helping students learn the skills they need to be successful readers. The project used a formative design approach in order to achieve 3 pedagogical…
Descriptors: Self Concept, Middle School Teachers, Reading Instruction, Middle School Students
Han, Chao – Language Assessment Quarterly, 2016
As a property of test scores, reliability/dependability constitutes an important psychometric consideration, and it underpins the validity of measurement results. A review of interpreter certification performance tests (ICPTs) reveals that (a) although reliability/dependability checking has been recognized as an important concern, its theoretical…
Descriptors: Foreign Countries, Scores, English, Chinese
Dahlen, Sarah P. C.; Hanson, Kathlene – College & Research Libraries, 2017
Discovery layers provide a simplified interface for searching library resources. Libraries with limited finances make decisions about retaining indexing and abstracting databases when similar information is available in discovery layers. These decisions should be informed by student success at finding quality information as well as satisfaction…
Descriptors: Preferences, Bibliographies, Indexes, Database Management Systems
Hites, Lacey S.; Lundervold, Duane A. – International Journal of Behavioral Consultation and Therapy, 2013
Forty-four individuals, 18-47 (MN 21.8, SD 5.63) years of age, took part in a study examining the magnitude and direction of the relationship between self-report and direct observation measures of relaxation and mindfulness. The Behavioral Relaxation Scale (BRS), a valid direct observation measure of relaxation, was used to assess relaxed behavior…
Descriptors: Correlation, Observation, Metacognition, Relaxation Training
Crane, Rebecca S.; Eames, Catrin; Kuyken, Willem; Hastings, Richard P.; Williams, J. Mark G.; Bartley, Trish; Evans, Alison; Silverton, Sara; Soulsby, Judith G.; Surawy, Christina – Assessment, 2013
Background: The assessment of intervention integrity is essential in psychotherapeutic intervention outcome research and psychotherapist training. There has been little attention given to it in mindfulness-based interventions research, training programs, and practice. Aims: To address this, the Mindfulness-Based Interventions: Teaching Assessment…
Descriptors: Intervention, Evaluation Criteria, Reliability, Validity
Wright, Heather Harris; Capilouto, Gilson J.; Koutsoftas, Anthony – International Journal of Language & Communication Disorders, 2013
Background: Discourse coherence is a reflection of the listener's ability to interpret the overall meaning conveyed by the speaker. Measuring global coherence (maintenance of thematic unity of the discourse) is useful for quantifying communication impairments at the discourse level in clinical populations and for measuring response to…
Descriptors: Measures (Individuals), Feasibility Studies, Test Reliability, Construct Validity
Sadler, D. Royce – Assessment in Education: Principles, Policy & Practice, 2013
The course (module) grades entered on higher education academic records (transcripts) purportedly represent substantive levels of student achievement. They are often taken at face value and accepted as comparable across courses. Research undertaken over several decades has shown that the underlying standards against which student works are…
Descriptors: Academic Achievement, Academic Standards, Grades (Scholastic), Interrater Reliability
Wavering, Michael; Mangione, Katherine; McBride, Craig – Online Submission, 2013
A dissertation study looking at preservice teachers' alternative conceptions in earth science was completed by one of the authors. The data used for this study from the dissertation were a series of eleven interviews. (Purpose) The authors of this manuscript wanted to provide more in-depth analysis of these interviews, specifically to provide a…
Descriptors: Logical Thinking, Reliability, Coding, Data Analysis
Deygers, Bart; Van Gorp, Koen – Language Testing, 2015
Considering scoring validity as encompassing both reliable rating scale use and valid descriptor interpretation, this study reports on the validation of a CEFR-based scale that was co-constructed and used by novice raters. The research questions this paper wishes to answer are (a) whether it is possible to construct a CEFR-based rating scale with…
Descriptors: Rating Scales, Scoring, Validity, Interrater Reliability
Gelman, Susan A.; Mannheim, Bruce; Escalante, Carmen; Tapia, Ingrid Sanchez – First Language, 2015
Southern Peruvian Quechua is an indigenous language spoken primarily in rural communities in the Peruvian Andes. The language includes a syntactic construction, "-paq", that expresses purpose or function, thus providing an opportunity to trace how parents and children with little formal education express teleological concepts. The…
Descriptors: American Indian Languages, Parent Child Relationship, Language Acquisition, Foreign Countries
Tertoolen, Anja; Geldens, Jeannette; van Oers, Bert; Popeijus, Herman – International Journal of Educational Psychology, 2015
Listening to young children's voices is an issue with increasing relevance for many researchers in the field of early childhood research. At the same time, teachers and researchers are faced with challenges to provide children with possibilities to express their notions, and to find ways of comprehending children's voices. In our research we aim…
Descriptors: Foreign Countries, Program Evaluation, Coding, Educational Research
Dunst, Carl J.; Bruder, Mary Beth; Hamby, Deborah W. – Educational Research and Reviews, 2015
Findings from a metasynthesis of 15 research reviews of in service professional development to improve or change teacher content knowledge and practice and student/child knowledge and behavior are described. The research reviews included 550 studies of more than 50,000 early intervention, preschool, elementary, and secondary education teachers,…
Descriptors: Faculty Development, Pedagogical Content Knowledge, Knowledge Level, Correlation
Aitken, Madison; Martinussen, Rhonda; Wolfe, Richard G.; Tannock, Rosemary – Assessment for Effective Intervention, 2015
The Strengths and Difficulties Questionnaire (SDQ) is a 25-item screening measure for emotional and behavioral problems in children and adolescents aged 4 to 16. Structural equation modeling was used to test the five-factor structure of teacher and parent ratings on the British version of the SDQ in a community sample of 501 Canadian children aged…
Descriptors: Foreign Countries, Factor Structure, Questionnaires, Elementary School Students
Prieto, Gerardo; Nieto, Eloísa – Psicologica: International Journal of Methodology and Experimental Psychology, 2014
This paper describes how a Many Faceted Rasch Measurement (MFRM) approach can be applied to performance assessment focusing on rater analysis. The article provides an introduction to MFRM, a description of MFRM analysis procedures, and an example to illustrate how to examine the effects of various sources of variability on test takers' performance…
Descriptors: Item Response Theory, Interrater Reliability, Rating Scales, Error of Measurement
May, Michael E.; Sheng, Yanyan; Chitiyo, Morgan; Brandt, Rachel C.; Howe, Abigail P. – Education and Treatment of Children, 2014
There has been considerable emphasis on indirect functional behavior assessments in school settings. However, little research has evaluated the reliability of these methods to identify behavioral function. One indirect measure, the Questions About Behavioral Function (QABF) scale, has yet to be extensively studied in school settings, though…
Descriptors: Functional Behavioral Assessment, Disabilities, Interrater Reliability, Rating Scales

Peer reviewed
Direct link
