Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 3 |
| Since 2007 (last 20 years) | 6 |
Descriptor
| Interrater Reliability | 9 |
| Models | 9 |
| Test Validity | 9 |
| Statistical Analysis | 3 |
| Correlation | 2 |
| Data Analysis | 2 |
| Evaluation Methods | 2 |
| Foreign Countries | 2 |
| Language Tests | 2 |
| Measurement Techniques | 2 |
| Measures (Individuals) | 2 |
| More ▼ | |
Source
Author
| Abedi, Jamal | 1 |
| Baker, Eva L. | 1 |
| Cason, Carolyn L. | 1 |
| Cohen, Allan | 1 |
| Edward Paul Getman | 1 |
| Edwards, Alison L. | 1 |
| Glad, Johan | 1 |
| Gustafsson, Carina | 1 |
| Jergeby, Ulla | 1 |
| Kottorp, Anders | 1 |
| Lambie, Glenn W. | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 7 |
| Reports - Research | 6 |
| Reports - Evaluative | 3 |
| Dissertations/Theses -… | 1 |
| Reports - Descriptive | 1 |
| Speeches/Meeting Papers | 1 |
Education Level
| Grade 7 | 2 |
| Elementary Education | 1 |
| Elementary Secondary Education | 1 |
| Grade 1 | 1 |
| Grade 2 | 1 |
| Grade 3 | 1 |
| Grade 4 | 1 |
| Grade 5 | 1 |
| Grade 6 | 1 |
| Grade 8 | 1 |
| Higher Education | 1 |
| More ▼ | |
Audience
| Researchers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| Home Observation for… | 1 |
| Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Raczynski, Kevin; Cohen, Allan – Applied Measurement in Education, 2018
The literature on Automated Essay Scoring (AES) systems has provided useful validation frameworks for any assessment that includes AES scoring. Furthermore, evidence for the scoring fidelity of AES systems is accumulating. Yet questions remain when appraising the scoring performance of AES systems. These questions include: (a) which essays are…
Descriptors: Essay Tests, Test Scoring Machines, Test Validity, Evaluators
Ziegler, Wolfram; Staiger, Anja; Schölderle, Theresa; Vogel, Mathias – Journal of Speech, Language, and Hearing Research, 2017
Purpose: Standardized clinical assessment of dysarthria is essential for management and research. We present a new, fully standardized dysarthria assessment, the Bogenhausen Dysarthria Scales (BoDyS). The measurement model of the BoDyS is based on auditory evaluations of connected speech using 9 scales (traits) assessed by 4 elicitation methods.…
Descriptors: Auditory Evaluation, Test Reliability, Test Validity, Rating Scales
Edward Paul Getman – Online Submission, 2020
Despite calls for engaging assessments targeting young language learners (YLLs) between 8 and 13 years old, what makes assessment tasks engaging and how such task characteristics affect measurement quality have not been well studied empirically. Furthermore, there has been a dearth of validity research about technology-enhanced speaking tests for…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Learner Engagement
Glad, Johan; Kottorp, Anders; Jergeby, Ulla; Gustafsson, Carina; Sonnander, Karin – Research on Social Work Practice, 2014
Objectives: The aim of this pilot study was to explore psychometric properties of two versions of the Home Observation for Measurement of the Environment Inventory in a Swedish social service sample. Method: Social workers employed at 22 Swedish child protections agencies participated in the data collection. Both classic test theory approaches and…
Descriptors: Psychometrics, Item Response Theory, Foreign Countries, Social Services
Swank, Jacqueline M.; Lambie, Glenn W.; Witta, E. Lea – Counselor Education and Supervision, 2012
The authors examined the psychometric properties of the Counseling Competencies Scale (CCS; University of Central Florida Counselor Education Faculty, 2009), an instrument designed to assess trainee competencies as measured in their counseling skills, dispositions, and behaviors. There was strong internal consistency for the 4-factor model for…
Descriptors: Test Validity, Interrater Reliability, Counselor Training, Measures (Individuals)
Muyskens, Paul; Marston, Doug; Reschly, Amy L. – California School Psychologist, 2007
Behavioral difficulties of school-aged students are typically dealt with in a reactive, rather than preventative manner. This article examines a proactive approach, consistent with the Response-to-Intervention model, using a screening measure designed to identify students at risk for behavior difficulties and targeting these students for early…
Descriptors: Early Intervention, At Risk Students, Teacher Attitudes, Academic Achievement
Cason, Carolyn L.; And Others – 1986
Cason and Cason's model of performance rating was used to determine the extent to which variation in reviewer standards affected the reliability and validity of the program review process used to select papers for inclusion in the annual program. Data analyzed were the overall recommendation for acceptance and ratings on seven quality criteria…
Descriptors: Conference Papers, Data Analysis, Educational Research, Evaluation Criteria
Peer reviewedEdwards, Alison L. – Modern Language Journal, 1996
Examined the validity of the pragmatic approach to test difficulty put forward by Child (1987). This study investigated whether the Child discourse-type hierarchy predicts text difficulty for second-language readers. Results suggested that this hierarchy may provide a sound basis for developing foreign-language tests when it is applied by trained…
Descriptors: Adult Students, Analysis of Variance, French, Interrater Reliability
Peer reviewedAbedi, Jamal; Baker, Eva L. – Educational and Psychological Measurement, 1995
Results from a performance assessment in which 68 high school students wrote essays support the use of latent variable modeling for estimating reliability, concurrent validity, and generalizability of a scoring rubric. The latent variable modeling approach overcomes the limitations of certain conventional statistical techniques in handling…
Descriptors: Criteria, Essays, Estimation (Mathematics), Generalizability Theory

Direct link
