Publication Date
| In 2026 | 0 |
| Since 2025 | 4 |
| Since 2022 (last 5 years) | 9 |
| Since 2017 (last 10 years) | 29 |
| Since 2007 (last 20 years) | 66 |
Descriptor
| Test Interpretation | 383 |
| Test Use | 383 |
| Elementary Secondary Education | 120 |
| Scores | 100 |
| Test Validity | 95 |
| Standardized Tests | 83 |
| Test Construction | 77 |
| Test Results | 77 |
| Testing Problems | 72 |
| Educational Assessment | 71 |
| Achievement Tests | 67 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 64 |
| Teachers | 29 |
| Administrators | 18 |
| Researchers | 17 |
| Counselors | 6 |
| Parents | 6 |
| Students | 6 |
| Community | 3 |
| Policymakers | 3 |
Location
| Australia | 7 |
| United States | 7 |
| United Kingdom (England) | 4 |
| Ohio | 3 |
| United Kingdom | 3 |
| Canada | 2 |
| France | 2 |
| Massachusetts | 2 |
| New York | 2 |
| Texas | 2 |
| United Kingdom (Wales) | 2 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Jordan, Sally – Computers & Education, 2012
Students were observed directly, in a usability laboratory, and indirectly, by means of an extensive evaluation of responses, as they attempted interactive computer-marked assessment questions that required free-text responses of up to 20 words and as they amended their responses after receiving feedback. This provided more general insight into…
Descriptors: Learner Engagement, Feedback (Response), Evaluation, Test Interpretation
January, Stacy-Ann A.; Ardoin, Scott P.; Christ, Theodore J.; Eckert, Tanya L.; White, Mary Jane – School Psychology Review, 2016
Universal screening in elementary schools often includes administering curriculum-based measurement in reading (CBM-R); but in first grade, nonsense word fluency (NWF) and, to a lesser extent, word identification fluency (WIF) are used because of concerns that CBM-R is too difficult for emerging readers. This study used Kane's argument-based…
Descriptors: Curriculum Based Assessment, Reading Tests, Test Interpretation, Test Use
Kane, Michael T. – Journal of Educational Measurement, 2013
To validate an interpretation or use of test scores is to evaluate the plausibility of the claims based on the scores. An argument-based approach to validation suggests that the claims based on the test scores be outlined as an argument that specifies the inferences and supporting assumptions needed to get from test responses to score-based…
Descriptors: Test Interpretation, Validity, Scores, Test Use
Jin, Tan; Mak, Barley; Zhou, Pei – Language Testing, 2012
The fuzziness of assessing second language speaking performance raises two difficulties in scoring speaking performance: "indistinction between adjacent levels" and "overlap between scales". To address these two problems, this article proposes a new approach, "confidence scoring", to deal with such fuzziness, leading to "confidence" scores between…
Descriptors: Speech Communication, Scoring, Test Interpretation, Second Language Learning
Haertel, Edward – Measurement: Interdisciplinary Research and Perspectives, 2013
Validation research for educational achievement tests is often limited to an examination of intended test score interpretations. This article calls for an expansion of validation research in three dimensions. First, validation must attend to actual test use and its consequences, not just score meaning. Second, validation must attend to unintended…
Descriptors: Educational Testing, Educational Improvement, Test Validity, Achievement Tests
Kolen, Michael J.; Lee, Won-Chan – Educational Measurement: Issues and Practice, 2011
This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…
Descriptors: Test Use, Test Format, Error of Measurement, Raw Scores
Cabrera, Nolan L.; Cabrera, George A. – Educational Horizons, 2011
Just like all the high-stakes tests that determine students' futures nowadays, The Chorizo Test is a standardized test rooted in the culture of the test makers. It was originally created to be used with students in teacher training programs to sensitize them to the pitfalls inherent in standardized pencil-and-paper tests, such as linguistic bias…
Descriptors: Test Use, Standardized Tests, Social Sciences, High Stakes Tests
Hubley, Anita M.; Zumbo, Bruno D. – Social Indicators Research, 2011
The vast majority of measures have, at their core, a purpose of personal and social change. If test developers and users want measures to have personal and social consequences and impact, then it is critical to consider the consequences and side effects of measurement in the validation process itself. The consequential basis of test interpretation…
Descriptors: Construct Validity, Social Change, Measurement, Test Interpretation
Vanmali, Binaben H. – ProQuest LLC, 2012
Assessment has garnered increased interest in recent years. It is seen as critical to enhancing student learning and understanding. Formative assessment tools such as concept inventories could be valuable in moving toward such goals. Concept inventories, a recent addition to biology education, hold much promise for helping faculty to understand…
Descriptors: College Faculty, Science Teachers, Experienced Teachers, College Science
Stenlund, Tova – Assessment & Evaluation in Higher Education, 2010
The process of giving official acknowledgment to formal, informal and non-formal prior learning is commonly labelled as assessment, accreditation or recognition of prior learning (APL), representing a practice that is expanding in higher education in many countries. This paper focuses specifically on the assessment part of APL, which undoubtedly…
Descriptors: Higher Education, Validity, Prior Learning, Program Effectiveness
Gur, Bekir S.; Celik, Zafer; Ozoglu, Murat – Journal of Education Policy, 2012
In this article we provide a critique of the interpretation and utilization of Programme for International Student Assessment (PISA) results by the National Education Authorities in Turkey. First, we define and explain what OECD's PISA is. Second, we make an overview of the media coverage in Turkey of the PISA 2003 and 2006 results. Third, we…
Descriptors: Foreign Countries, Curriculum Development, Educational Quality, News Reporting
Klesch, Heather S. – ProQuest LLC, 2010
The reporting of scores on educational tests is at times misunderstood, misinterpreted, and potentially confusing to examinees and other stakeholders who may need to interpret test scores. In reporting test results to examinees, there is a need for clarity in the message communicated. As pressure rises for students to demonstrate performance at a…
Descriptors: Feedback (Response), Test Results, Focus Groups, Educational Testing
Kane, Michael T. – Educational Researcher, 2008
Lissitz and Samuelsen (2007) have proposed an operational definition of "validity" that shifts many of the questions traditionally considered under validity to a separate category associated with the utility of test use. Operational definitions support inferences about how well people perform some kind of task or how they respond to some kind of…
Descriptors: Test Use, Definitions, Validity, Classification
Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010
Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…
Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis
Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010
This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…
Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Peer reviewed
Direct link
