NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20260
Since 20250
Since 2022 (last 5 years)0
Since 2017 (last 10 years)4
Since 2007 (last 20 years)14
What Works Clearinghouse Rating
Showing 1 to 15 of 119 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Eastridge, June A.; Benson, Wendi L. – Teaching of Psychology, 2020
Research on collaborative testing has shown that it decreases test anxiety, increases learning and critical thinking skills, and allows students to practice collaboration and teamwork. However, it has most often been used as a second test following traditional individual testing. This quasi-experimental study compared two models of collaborative…
Descriptors: Group Testing, Models, Statistics, Test Anxiety
Peer reviewed Peer reviewed
Direct linkDirect link
Ketterlin-Geller, Leanne R.; Perry, Lindsey; Adams, Elizabeth – Applied Measurement in Education, 2019
Despite the call for an argument-based approach to validity over 25 years ago, few examples exist in the published literature. One possible explanation for this outcome is that the complexity of the argument-based approach makes implementation difficult. To counter this claim, we propose that the Assessment Triangle can serve as the overarching…
Descriptors: Validity, Educational Assessment, Models, Screening Tests
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Oliveri, María Elena; Rutkowski, David; Rutkowski, Lesli – ETS Research Report Series, 2018
Fifty years after the first international large-scale assessment (ILSA), participation in these studies continues to grow, with more than 50% of the world's countries participating. Concomitant with growth in ILSAs is an expansion in the diversity of participant countries with respect to languages, cultures, and educational perspectives and goals.…
Descriptors: International Assessment, Test Validity, Test Use, Alignment (Education)
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Stone, Elizabeth; Wylie, E. Caroline – ETS Research Report Series, 2019
We describe the summative assessment component within a K-12 assessment program and our development of a validity argument to support its claims with respect to intended uses and interpretations. First, we describe the "Winsight"® assessment program theory of action, a logic model elucidating mechanisms for how use of the assessment…
Descriptors: Summative Evaluation, Educational Assessment, Test Validity, Test Use
Peer reviewed Peer reviewed
Direct linkDirect link
Kane, Michael T. – Assessment in Education: Principles, Policy & Practice, 2016
How we choose to use a term depends on what we want to do with it. If "validity" is to be used to support a score interpretation, validation would require an analysis of the plausibility of that interpretation. If validity is to be used to support score uses, validation would require an analysis of the appropriateness of the proposed…
Descriptors: Test Validity, Test Interpretation, Test Use, Scores
Barret, Xian Franzinger; Cody, Anthony; Martinez, Jessica S.; Burris, Carol; Koonlaba, Amanda; McKelvy, Tiffany; Nolan, Lee-Ann; Meeks, John Louis, Jr. – Network for Public Education, 2016
Teachers choose the teaching profession because of their love of children and their desire to help them grow and blossom as learners. Across the nation, however, far too many educators are leaving the classroom. Headlines report teacher shortages in nearly every state. One factor reported in almost every story is the discouragement teachers feel…
Descriptors: Teacher Evaluation, Teacher Attitudes, Administrator Attitudes, Public School Teachers
Peer reviewed Peer reviewed
Direct linkDirect link
Casey, Leo M. – Teachers College Record, 2013
Background/Context: There is a deep and yawning chasm between the world of tests and testing practices as they ought to be and the actual tests and testing practices now imposed on American students, educators, and schools. That chasm of theory and practice is a function of the dominant paradigm of educational reform, with its theory of action…
Descriptors: Educational Change, Commercialization, Models, Test Use
Peer reviewed Peer reviewed
Direct linkDirect link
Sideridis, Georgios D.; Tsaousis, Ioannis; Al-harbi, Khaleel A. – Journal of Psychoeducational Assessment, 2015
The purpose of the present study was to extend the model of measurement invariance by simultaneously estimating invariance across multiple populations in the dichotomous instrument case using multi-group confirmatory factor analytic and multiple indicator multiple causes (MIMIC) methodologies. Using the Arabic version of the General Aptitude Test…
Descriptors: Semitic Languages, Aptitude Tests, Error of Measurement, Factor Analysis
Slater, Liz – CfBT Education Trust, 2013
Monitoring, evaluation, and quality assurance in their various forms are seen as being one of the foundation stones of high-quality education systems. De Grauwe, writing about "school supervision" in four African countries in 2001, linked the decline in the quality of basic education to the cut in resources for supervision and support.…
Descriptors: Educational Improvement, Accountability, Quality Assurance, Educational Quality
Peer reviewed Peer reviewed
Direct linkDirect link
Koch, Martha J.; DeLuca, Christopher – Assessment in Education: Principles, Policy & Practice, 2012
In this article we rethink validation within the complex contexts of high-stakes assessment. We begin by considering the utility of existing models for validation and argue that these models tend to overlook some of the complexities inherent to assessment use, including the multiple interpretations of assessment purposes and the potential…
Descriptors: Foreign Countries, Test Use, Case Studies, Educational Assessment
Peer reviewed Peer reviewed
Direct linkDirect link
Carlson, Janet F.; Benson, Nicholas; Oakland, Thomas – School Psychology International, 2010
Implications of the International Classification of Functioning, Disability and Health (ICF) on the development and use of tests in school settings are enumerated. We predict increased demand for behavioural assessments that consider a person's activities, participation and person-environment interactions, including measures that: (a) address…
Descriptors: Classification, Models, Test Construction, Test Use
Peer reviewed Peer reviewed
Direct linkDirect link
Hubley, Anita M.; Zumbo, Bruno D. – Social Indicators Research, 2011
The vast majority of measures have, at their core, a purpose of personal and social change. If test developers and users want measures to have personal and social consequences and impact, then it is critical to consider the consequences and side effects of measurement in the validation process itself. The consequential basis of test interpretation…
Descriptors: Construct Validity, Social Change, Measurement, Test Interpretation
Jia, Yujie – ProQuest LLC, 2013
This study employed Bachman and Palmer's (2010) Assessment Use Argument framework to investigate to what extent the use of a second language oral test as an exit test in a Hong Kong university can be justified. It also aimed to help test developers of this oral test identify the most critical areas in the current test design that might need…
Descriptors: Test Use, Language Tests, Oral Language, Second Language Learning
Peer reviewed Peer reviewed
Bradley, Richard W. – Measurement and Evaluation in Counseling and Development, 1994
Contends that tests were not used in counseling in the mid-20th century partially as a function of the prevailing view of what constituted science. Questions the 19th-century view of science regarding testing and then moves on to assert that the 21st-century application of tests with clients requires a substantial paradigm shift. (Author/NB)
Descriptors: Counseling, Individual Differences, Models, Test Use
Peer reviewed Peer reviewed
Glutting, Joseph J. – Journal of School Psychology, 1989
Introduces Stanford-Binet Intelligence Scale-Fourth Edition (SB4) as an attempt to revitalize Stanford-Binet by maintaining links with previous editions while simultaneously incorporating more recent developments found in other popular tests of intelligence. Discusses the SB4's theoretical foundation, materials and administration, scaling,…
Descriptors: Intelligence Tests, Models, Test Reliability, Test Use
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8