NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20260
Since 20250
Since 2022 (last 5 years)0
Since 2017 (last 10 years)2
Since 2007 (last 20 years)18
Audience
Researchers1
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 21 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Wang, Wen-Chung; Su, Chi-Ming; Qiu, Xue-Lan – Journal of Educational Measurement, 2014
Ratings given to the same item response may have a stronger correlation than those given to different item responses, especially when raters interact with one another before giving ratings. The rater bundle model was developed to account for such local dependence by forming multiple ratings given to an item response as a bundle and assigning…
Descriptors: Item Response Theory, Interrater Reliability, Models, Correlation
Peer reviewed Peer reviewed
Direct linkDirect link
Nehring, Andreas; Päßler, Andreas; Tiemann, Rüdiger – International Journal of Science and Mathematics Education, 2017
With regard to the moderate performance of German students in international large-scale assessments, one branch of German science education research is concerned with the construction and evaluation of competence models. Based on the theory-driven definition of competence levels, these models imply a correlation between the complexity of a…
Descriptors: Foreign Countries, Science Education, Chemistry, Science Teachers
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Ramineni, Chaitanya; Trapani, Catherine S.; Williamson, David M. – ETS Research Report Series, 2015
Automated scoring models were trained and evaluated for the essay task in the "Praxis I"® writing test. Prompt-specific and generic "e-rater"® scoring models were built, and evaluation statistics, such as quadratic weighted kappa, Pearson correlation, and standardized differences in mean scores, were examined to evaluate the…
Descriptors: Writing Tests, Licensing Examinations (Professions), Teacher Competency Testing, Scoring
Martinková, Patrícia; Goldhaber, Dan; Erosheva, Elena – Grantee Submission, 2018
Ratings are present in many areas of assessment including peer review of research proposals and journal articles, teacher observations, university admissions and selection of new hires. One feature present in any rating process with multiple raters is that different raters often assign different scores to the same assessee, with the potential for…
Descriptors: Interrater Reliability, Public School Teachers, Job Applicants, Teacher Selection
Peer reviewed Peer reviewed
Direct linkDirect link
Nail, Paul R.; Simon, Joan B.; Bihm, Elson M.; Beasley, William Howard – Journal of School Violence, 2016
According to the compensation model of aggression (Staub, 1989), some people bully to defend against their own feelings of weakness and vulnerability. Classmates and teachers rated a sample of American sixth graders in terms of trait: defensiveness (i.e., defensive egotism), self-esteem, bullying, and related behaviors. Consistent with the model,…
Descriptors: Bullying, Gender Differences, Aggression, Grade 6
Peer reviewed Peer reviewed
Direct linkDirect link
Diener, Marissa L.; Wright, Cheryl A.; Smith, Katherine N.; Wright, Scott D. – Creativity Research Journal, 2014
The goal of this study was to develop a measure of creativity that builds on the strengths of youth with autism spectrum disorders (ASD). The assessment of creativity focused on the visual-spatial abilities of these youth using 3D modeling software. One of the objectives of the research was to develop a measure of creativity in an authentic…
Descriptors: Autism, Pervasive Developmental Disorders, Creativity, Creativity Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Swank, Jacqueline M.; Lambie, Glenn W.; Witta, E. Lea – Counselor Education and Supervision, 2012
The authors examined the psychometric properties of the Counseling Competencies Scale (CCS; University of Central Florida Counselor Education Faculty, 2009), an instrument designed to assess trainee competencies as measured in their counseling skills, dispositions, and behaviors. There was strong internal consistency for the 4-factor model for…
Descriptors: Test Validity, Interrater Reliability, Counselor Training, Measures (Individuals)
Peer reviewed Peer reviewed
Direct linkDirect link
Granfeldt, Jonas; Ågren, Malin – Language Testing, 2014
One core area of research in Second Language Acquisition is the identification and definition of developmental stages in different L2s. For L2 French, Bartning and Schlyter (2004) presented a model of six morphosyntactic stages of development in the shape of grammatical profiles. The model formed the basis for the computer program Direkt Profil…
Descriptors: Second Language Learning, Language Tests, French, Language Teachers
Peer reviewed Peer reviewed
Direct linkDirect link
Cropley, David H.; Kaufman, James C. – Journal of Creative Behavior, 2012
The Creative Solution Diagnosis Scale (CSDS) is a 30-item scale based on a core of four criteria: Relevance & Effectiveness, Novelty, Elegance, and Genesis. The CSDS offers potential for the consensual assessment of functional product creativity. This article describes an empirical study in which non-expert judges rated a series of mousetrap…
Descriptors: Expertise, Creativity, Identification, Measures (Individuals)
Peer reviewed Peer reviewed
Direct linkDirect link
Pinget, Anne-France; Bosker, Hans Rutger; Quené, Hugo; de Jong, Nivja H. – Language Testing, 2014
Oral fluency and foreign accent distinguish L2 from L1 speech production. In language testing practices, both fluency and accent are usually assessed by raters. This study investigates what exactly native raters of fluency and accent take into account when judging L2. Our aim is to explore the relationship between objectively measured temporal,…
Descriptors: Native Speakers, Language Fluency, Suprasegmentals, Second Language Learning
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Zhang, Mo; Breyer, F. Jay; Lorenz, Florian – ETS Research Report Series, 2013
In this research, we investigated the suitability of implementing "e-rater"® automated essay scoring in a high-stakes large-scale English language testing program. We examined the effectiveness of generic scoring and 2 variants of prompt-based scoring approaches. Effectiveness was evaluated on a number of dimensions, including agreement…
Descriptors: Computer Assisted Testing, Computer Software, Scoring, Language Tests
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Ramineni, Chaitanya; Trapani, Catherine S.; Williamson, David M.; Davey, Tim; Bridgeman, Brent – ETS Research Report Series, 2012
Automated scoring models for the "e-rater"® scoring engine were built and evaluated for the "GRE"® argument and issue-writing tasks. Prompt-specific, generic, and generic with prompt-specific intercept scoring models were built and evaluation statistics such as weighted kappas, Pearson correlations, standardized difference in…
Descriptors: Scoring, Test Scoring Machines, Automation, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Kay, Robin H.; Knaack, Liesel – Australasian Journal of Educational Technology, 2008
While discussion of the criteria needed to assess learning objects has been extensive, a formal, systematic model for evaluation has yet to be thoroughly tested. The purpose of the following study was to develop and assess a multi-component model for evaluating learning objects. The Learning Object Evaluation Metric (LOEM) was developed from a…
Descriptors: Foreign Countries, Models, Measurement Techniques, Evaluation Criteria
Peer reviewed Peer reviewed
Direct linkDirect link
Zhou, Zheng; Xin, Tao – Psychology in the Schools, 2007
The traditional kappa statistic in assessing interrater agreement is not adequate when multiraters and multiattributes are involved. In this article, latent trait models are proposed to assess the multirater multiattribute (MRMA) agreement. Data from the Third International Mathematics and Science Studies (TIMSS) are used to illustrate the…
Descriptors: Intervention, School Psychology, Interrater Reliability, Item Response Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Aleong, Chandra – Journal of College Teaching & Learning, 2007
This paper discusses whether there are differences in performance based on differences in strategy. First, an attempt was made to determine whether the institution had a strategy, and if so, did it follow a particular model. Major models of strategy are the industry analysis approach, the resource based view or the RBV model and the more recent,…
Descriptors: Strategic Planning, Higher Education, Institutional Evaluation, Case Studies
Previous Page | Next Page »
Pages: 1  |  2