NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
Race to the Top1
What Works Clearinghouse Rating
Showing 1 to 15 of 23 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Janice Kinghorn; Katherine McGuire; Bethany L. Miller; Aaron Zimmerman – Assessment Update, 2024
In this article, the authors share their reflections on how different experiences and paradigms have broadened their understanding of the work of assessment in higher education. As they collaborated to create a panel for the 2024 International Conference on Assessing Quality in Higher Education, they recognized that they, as assessment…
Descriptors: Higher Education, Assessment Literacy, Evaluation Criteria, Evaluation Methods
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Regional Educational Laboratory Southeast, 2020
Teachers need to assess their students' current level of mathematical understanding to provide appropriate interventions for students who are struggling. Several school districts in Georgia currently use two assessments for this purpose--the Global Strategy Stage (GloSS) and the Individual Knowledge Assessment of Number (IKAN). The IKAN is…
Descriptors: Mathematics Tests, Diagnostic Tests, Test Reliability, Test Validity
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Richer, Amanda; Charmaraman, Linda; Ceder, Ineke – Afterschool Matters, 2018
Like instruments used in afterschool programs to assess children's social and emotional growth or to evaluate staff members' performance, instruments used to evaluate program quality should be free from bias. Practitioners and researchers alike want to know that assessment instruments, whatever their type or intent, treat all people fairly and do…
Descriptors: Cultural Differences, Social Bias, Interrater Reliability, Program Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Gargani, John; Strong, Michael – Journal of Teacher Education, 2015
In Gargani and Strong (2014), we describe The Rapid Assessment of Teacher Effectiveness (RATE), a new teacher evaluation instrument. Our account of the validation research associated with RATE inspired a review by Good and Lavigne (2015). Here, we reply to the main points of their review. We elaborate on the validity, reliability, theoretical…
Descriptors: Evidence, Teacher Effectiveness, Teacher Evaluation, Evaluation Methods
Dietel, Ron – Phi Delta Kappan, 2011
Two tests intended to measure student achievement of the Common Core State Standards will face intense scrutiny, but the test makers say they will include performance assessments and other items that are not multiple-choice questions. Incorporating performance items on this tests will bring up issues over scoring, costs, and validity.
Descriptors: Student Evaluation, State Standards, Test Construction, Intellectual Property
Peer reviewed Peer reviewed
Direct linkDirect link
Brooks, Val – Research Papers in Education, 2012
An aspect of assessment which has received little attention compared with perennial concerns, such as standards or reliability, is the role of judgment in marking. This paper explores marking as an act of judgment, paying particular attention to the nature of judgment and the processes involved. It brings together studies which have explored…
Descriptors: Educational Assessment, Test Reliability, Test Validity, Value Judgment
Peer reviewed Peer reviewed
Direct linkDirect link
Nunan, Anna – Language Learning in Higher Education, 2014
The Applied Language Centre at University College Dublin offers foreign language modules to students in ten languages at CEFR [Common European Framework of Reference for Languages] levels ranging from A1 to B2. Efforts have been underway in the Centre to standardise the assessment components across languages to ensure parity between module credits…
Descriptors: Second Language Learning, Second Language Instruction, College Students, Standards
Haberman, Shelby J. – Educational Testing Service, 2011
Alternative approaches are discussed for use of e-rater[R] to score the TOEFL iBT[R] Writing test. These approaches involve alternate criteria. In the 1st approach, the predicted variable is the expected rater score of the examinee's 2 essays. In the 2nd approach, the predicted variable is the expected rater score of 2 essay responses by the…
Descriptors: Writing Tests, Scoring, Essays, Language Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Morris, Christopher – Developmental Medicine & Child Neurology, 2008
To address the need for a standardized system to classify the gross motor function of children with cerebral palsy, the authors developed a five-level classification system analogous to the staging and grading systems used in medicine. Nominal group process and Delphi survey consensus methods were used to examine content validity and revise the…
Descriptors: Psychomotor Skills, Children, Test Construction, Content Validity
OECD Publishing (NJ1), 2009
The Organisation for Economic Cooperation and Development's (OECD's) Programme for International Student Assessment (PISA) surveys, which take place every three years, have been designed to collect information about 15-year-old students in participating countries. PISA examines how well students are prepared to meet the challenges of the future,…
Descriptors: Policy Formation, Scaling, Academic Achievement, Interrater Reliability
Peer reviewed Peer reviewed
Gillberg, Christopher; Gillberg, Carina; Rastam, Maria; Wentz, Elisabeth – Autism: The International Journal of Research and Practice, 2001
The development of the Asperger Syndrome (and high-functioning autism) Diagnostic Interview (ASDI) is described. Preliminary data from a clinical study of 20 individuals (ages 6-55) suggest that interrater reliability and test-retest stability may be excellent, with kappas exceeding 0.90 in both instances. The validity appears to be relatively…
Descriptors: Adults, Asperger Syndrome, Autism, Children
Peer reviewed Peer reviewed
Direct linkDirect link
Linehan, Marsha M.; Comtois, Katherine Anne; Brown, Milton Z.; Heard, Heidi L.; Wagner, Amy – Psychological Assessment, 2006
The authors describe the development of the Suicide Attempt Self-Injury Interview (SASII), an instrument designed to assess the factors involved in nonfatal suicide attempts and intentional self-injury. Using 4 cohorts of participants, authors generated SASII items and evaluated them with factor and content analyses and internal consistency…
Descriptors: Interrater Reliability, Suicide, Evaluation Methods, Self Destructive Behavior
Peer reviewed Peer reviewed
Direct linkDirect link
Schley, Sara; Albertini, John – Journal of Deaf Studies and Deaf Education, 2005
The NTID Writing Test was developed to assess the writing ability of postsecondary deaf students entering the National Technical Institute for the Deaf and to determine their appropriate placement into developmental writing courses. While previous research (Albertini et al., 1986; Albertini et al., 1996; Bochner, Albertini, Samar, & Metz, 1992)…
Descriptors: Deafness, Writing Ability, Writing Tests, College Students
Humes, Ann – 1983
This paper, as an illustration of the procedures involved in a cooperative effort, describes a project in which the Southwest Regional Laboratory (SWRL) designed and developed a minimum standards test in collaboration with a large urban school district in California. The activity described focuses on the writing sample included in the test. The…
Descriptors: High Schools, Institutional Cooperation, Interrater Reliability, Minimum Competency Testing
Stansfield, Charles W.; Kenyon, Dorry Mann – 1988
The development and validation of a Portuguese oral language test are described. The test consisted of five item types: personal conversation, giving directions, description of picture sequences, topical discourse, and oral task completion based on printed instructions. Three preliminary forms of the test were administered to a group of language…
Descriptors: Interrater Reliability, Interviews, Language Tests, Oral Language
Previous Page | Next Page ยป
Pages: 1  |  2