Publication Date
| In 2026 | 0 |
| Since 2025 | 56 |
| Since 2022 (last 5 years) | 282 |
| Since 2017 (last 10 years) | 778 |
| Since 2007 (last 20 years) | 2040 |
Descriptor
| Interrater Reliability | 3122 |
| Foreign Countries | 654 |
| Test Reliability | 503 |
| Evaluation Methods | 502 |
| Test Validity | 410 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 24 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
Helman, Amanda; Dennis, Minyi Shih; Kern, Lee – Learning Disability Quarterly, 2022
English learners (ELs) with reading disabilities (RDs) have been among the lowest performers on academic achievement tests that assess vocabulary. To meet academic demands and prepare for college or careers, ELs with RDs clearly need support in terms of vocabulary acquisition; however, relevant research is scarce. This study investigated the…
Descriptors: Vocabulary Development, English Language Learners, Reading Difficulties, Biology
Wang, Qiao – Education and Information Technologies, 2022
This study searched for open-source semantic similarity tools and evaluated their effectiveness in automated content scoring of fact-based essays written by English-as-a-Foreign-Language (EFL) learners. Fifty writing samples under a fact-based writing task from an academic English course in a Japanese university were collected and a gold standard…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Scoring
Albudoor, Nahar; Peña, Elizabeth D. – Journal of Speech, Language, and Hearing Research, 2022
Purpose: The differential diagnosis of developmental language disorder (DLD) in bilingual children represents a unique challenge due to their distributed language exposure and knowledge. The current evidence indicates that dual-language testing yields the most accurate classification of DLD among bilinguals, but there are limited personnel and…
Descriptors: Language Impairments, Bilingualism, Clinical Diagnosis, Language Tests
Kinik, Betul; Genc, Bilal – Reading Matrix: An International Online Journal, 2022
The current study presents the findings of a pre-test/post-test design to explore the efficacy of a genre-based approach to teaching argumentative essay writing during synchronous classes. The study is conducted with the participation of a group of freshman and junior year student teachers of English Language Teaching enrolled at the course of…
Descriptors: Literary Genres, English (Second Language), Second Language Learning, Second Language Instruction
Beula M. Magimairaj; Philip Capin; Sandra L. Gillam; Sharon Vaughn; Greg Roberts; Anna-Maria Fall; Ronald B. Gillam – Language, Speech, and Hearing Services in Schools, 2022
Purpose: Our aim was to evaluate the psychometric properties of the online administered format of the Test of Narrative Language--Second Edition (TNL-2; Gillam & Pearson, 2017), given the importance of assessing children's narrative ability and considerable absence of psychometric studies of spoken language assessments administered online.…
Descriptors: Computer Assisted Testing, Language Tests, Story Telling, Language Impairments
Correia, Edgar A.; Sartóris, Vítor; Fernandes, Tiago; Cooper, Mick; Berdondini, Lucia; Sousa, Daniel; Pires, Branca Sá; da Fonseca, João – British Journal of Guidance & Counselling, 2018
Within the major therapeutic paradigms, observational instruments have been developed to assess orientation-specific interventions or processes. However, to date, no such instrument exists to assess existential practices. Recent research indicates the key practices of existential therapists, and forms an empirical basis on which to develop an…
Descriptors: Foreign Countries, Psychotherapy, Allied Health Personnel, Observation
Tavares, Walter; Brydges, Ryan; Myre, Paul; Prpic, Jason; Turner, Linda; Yelle, Richard; Huiskamp, Maud – Advances in Health Sciences Education, 2018
Assessment of clinical competence is complex and inference based. Trustworthy and defensible assessment processes must have favourable evidence of validity, particularly where decisions are considered high stakes. We aimed to organize, collect and interpret validity evidence for a high stakes simulation based assessment strategy for certifying…
Descriptors: Competence, Simulation, Allied Health Personnel, Certification
Pitts, Christine; Anderson, Ross; Haney, Michele – Learning Environments Research, 2018
The purpose of the current study was to estimate reliability, internal consistency and construct validity of the Measure of Instruction for Creative Engagement (MICE) instrument. The MICE uses an iterative process of evidence collection and scoring through teacher observations to determine instructional domain ratings and overall scores. The…
Descriptors: Psychometrics, Achievement Rating, Outcome Measures, Student Evaluation
Paula, Cristiane S.; Cunha, Graccielle Rodrigues; Bordini, Daniela; Brunoni, Decio; Moya, Ana Claudia; Bosa, Cleonice Alves; Mari, Jair J.; Cogo-Moreira, Hugo – Journal of Autism and Developmental Disorders, 2018
Simple and low-cost observational-tools to detect symptoms of Autism Spectrum Disorder (ASD) are still necessary. The OERA is a new assessment tool to screen children eliciting observable behaviors with no substantial knowledge on ASD required. The sample was 99 children aged 3-10: 76 with ASD and 23 without ASD (11/23 had intellectual…
Descriptors: Autism, Pervasive Developmental Disorders, Symptoms (Individual Disorders), Disability Identification
Roohr, Katrina Crotts; Burkander, Kri; Mao, Liyang – ETS Research Report Series, 2018
Oral communication has been identified as an important skill by higher education institutions and by the workforce community. Despite its importance, minimal research has been conducted around the development of tasks to measure oral communication skills and behaviors. The purpose of this preliminary study is to evaluate the different factors…
Descriptors: Speech Communication, Video Technology, Test Construction, Scoring
Liebenberg, Petria; van der Linde, Jeannie; Schimper, Isabella; de Wet, Febe; Graham, Marien; Bornman, Juan – Language, Speech, and Hearing Services in Schools, 2023
Purpose: Language sample analysis is widely regarded as the gold standard of language assessment. However, the uncertainty regarding the optimal length of sample and the limited availability of developmental language data for nonmainstream languages such as Afrikaans complicate reliable use of the method. The study aimed to provide guidelines on…
Descriptors: Indo European Languages, Language Usage, Audiovisual Aids, Measurement Techniques
A Human-Centric Automated Essay Scoring and Feedback System for the Development of Ethical Reasoning
Lee, Alwyn Vwen Yen; Luco, Andrés Carlos; Tan, Seng Chee – Educational Technology & Society, 2023
Although artificial Intelligence (AI) is prevalent and impacts facets of daily life, there is limited research on responsible and humanistic design, implementation, and evaluation of AI, especially in the field of education. Afterall, learning is inherently a social endeavor involving human interactions, rendering the need for AI designs to be…
Descriptors: Essays, Scoring, Writing Evaluation, Computer Software
Saluja, Ronak; Cheng, Sierra; delos Santos, Keemo Althea; Chan, Kelvin K. W. – Research Synthesis Methods, 2019
Objective: Various statistical methods have been developed to estimate hazard ratios (HRs) from published Kaplan-Meier (KM) curves for the purpose of performing meta-analyses. The objective of this study was to determine the reliability, accuracy, and precision of four commonly used methods by Guyot, Williamson, Parmar, and Hoyle and Henley.…
Descriptors: Meta Analysis, Reliability, Accuracy, Randomized Controlled Trials
He, Tung-hsien – SAGE Open, 2019
This study employed a mixed-design approach and the Many-Facet Rasch Measurement (MFRM) framework to investigate whether rater bias occurred between the onscreen scoring (OSS) mode and the paper-based scoring (PBS) mode. Nine human raters analytically marked scanned scripts and paper scripts using a six-category (i.e., six-criterion) rating…
Descriptors: Computer Assisted Testing, Scoring, Item Response Theory, Essays
Jeong, Heejeong – Language Testing in Asia, 2019
In writing assessment, finding a valid, reliable, and efficient scale is critical. Appropriate scales, increase rater reliability, and can also save time and money. This exploratory study compared the effects of a binary scale and an analytic scale across teacher raters and expert raters. The purpose of the study is to find out how different scale…
Descriptors: Writing Evaluation, English (Second Language), Second Language Learning, Second Language Instruction

Peer reviewed
Direct link
