Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 2 |
| Since 2007 (last 20 years) | 7 |
Descriptor
| Classification | 11 |
| Test Reliability | 11 |
| Test Validity | 7 |
| Evaluation Methods | 5 |
| Test Construction | 4 |
| Measurement Techniques | 3 |
| Simulation | 3 |
| Accuracy | 2 |
| Educational Quality | 2 |
| Ethics | 2 |
| Foreign Countries | 2 |
| More ▼ | |
Source
Author
Publication Type
| Reports - Descriptive | 11 |
| Journal Articles | 9 |
| Guides - General | 1 |
| Guides - Non-Classroom | 1 |
| Numerical/Quantitative Data | 1 |
| Translations | 1 |
Education Level
| Higher Education | 2 |
| Postsecondary Education | 2 |
| Elementary Secondary Education | 1 |
| Junior High Schools | 1 |
| Middle Schools | 1 |
| Secondary Education | 1 |
Audience
| Researchers | 1 |
Location
| Austria | 1 |
| United Kingdom | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| ACT Assessment | 1 |
What Works Clearinghouse Rating
Choi, Youn-Jeng; Asilkalkan, Abdullah – Measurement: Interdisciplinary Research and Perspectives, 2019
About 45 R packages to analyze data using item response theory (IRT) have been developed over the last decade. This article introduces these 45 R packages with their descriptions and features. It also describes possible advanced IRT models using R packages, as well as dichotomous and polytomous IRT models, and R packages that contain applications…
Descriptors: Item Response Theory, Data Analysis, Computer Software, Test Bias
Ketterlin-Geller, Leanne R.; Shivraj, Pooja; Basaraba, Deni; Schielack, Jane – Investigations in Mathematics Learning, 2019
Within a multi-tier system of support (MTSS), results from universal screeners help teachers make instructional decisions early in the learning process to prevent and remediate skill gaps. Universal screeners help teachers determine students' need for and intensity of additional instructional support to reach their learning goals by efficiently…
Descriptors: Middle School Students, Readiness, Algebra, Screening Tests
Powers, Sonya; Li, Dongmei; Suh, Hongwook; Harris, Deborah J. – ACT, Inc., 2016
ACT reporting categories and ACT Readiness Ranges are new features added to the ACT score reports starting in fall 2016. For each reporting category, the number correct score, the maximum points possible, the percent correct, and the ACT Readiness Range, along with an indicator of whether the reporting category score falls within the Readiness…
Descriptors: Scores, Classification, College Entrance Examinations, Error of Measurement
Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2014
A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…
Descriptors: Test Items, Test Bias, Simulation, Hypothesis Testing
Gargani, John; Strong, Michael – Journal of Teacher Education, 2015
In Gargani and Strong (2014), we describe The Rapid Assessment of Teacher Effectiveness (RATE), a new teacher evaluation instrument. Our account of the validation research associated with RATE inspired a review by Good and Lavigne (2015). Here, we reply to the main points of their review. We elaborate on the validity, reliability, theoretical…
Descriptors: Evidence, Teacher Effectiveness, Teacher Evaluation, Evaluation Methods
Ahmed, Ayesha; Pollitt, Alastair – Assessment in Education: Principles, Policy & Practice, 2011
At the heart of most assessments lies a set of questions, and those who write them must achieve "two" things. Not only must they ensure that each question elicits the kind of performance that shows how "good" pupils are at the subject, but they must also ensure that each mark scheme gives more marks to those who are…
Descriptors: Academic Achievement, Classification, Educational Quality, Quality Assurance
Shriberg, Lawrence D.; Fourakis, Marios; Hall, Sheryl D.; Karlsson, Heather B.; Lohmeier, Heather L.; McSweeny, Jane L.; Potter, Nancy L.; Scheer-Cohen, Alison R.; Strand, Edythe A.; Tilkens, Christie M.; Wilson, David L. – Clinical Linguistics & Phonetics, 2010
This report describes three extensions to a classification system for paediatric speech sound disorders termed the Speech Disorders Classification System (SDCS). Part I describes a classification extension to the SDCS to differentiate motor speech disorders from speech delay and to differentiate among three sub-types of motor speech disorders.…
Descriptors: Autism, Classification, Acoustics, Phonetics
McCowan, Richard J. – Online Submission, 1999
Item writing is a major responsibility of trainers. Too often, qualified staff who prepare lessons carefully and teach conscientiously use inadequate tests that do not validly reflect the true level of trainee achievement. This monograph describes techniques for constructing multiple-choice items that measure student performance accurately. It…
Descriptors: Multiple Choice Tests, Item Analysis, Test Construction, Test Items
Atkins, David C.; Bedics, Jamie D.; Mcglinchey, Joseph B.; Beauchaine, Theodore P. – Journal of Consulting and Clinical Psychology, 2005
Measures of clinical significance are frequently used to evaluate client change during therapy. Several alternatives to the original method devised by N. S. Jacobson, W. C. Follette, & D. Revenstorf (1984) have been proposed, each purporting to increase accuracy. However, researchers have had little systematic guidance in choosing among…
Descriptors: Psychotherapy, Statistical Significance, Outcomes of Treatment, Behavior Change
Gottfredson, Stephen D.; Moriarty, Laura J. – Crime & Delinquency, 2006
Statistically based risk assessment devices are widely used in criminal justice settings. Their promise remains largely unfulfilled, however, because assumptions and premises requisite to their development and application are routinely ignored and/or violated. This article provides a brief review of the most salient of these assumptions and…
Descriptors: Risk, Justice, Criminals, Crime
Patry, Jean-Luc; Gastager, Angela – Quality of Higher Education, 2004
A taxonomy of potential values conflicts in evaluations is presented. Basis is the distinction of types of conflict (aims vs. means conflicts; qualitative vs. quantitative conflicts). Six areas of values are discussed: the ethical, methodical, social and interactive, legal, economic, and personal values. The taxonomy of conflicts between these…
Descriptors: Student Evaluation of Teacher Performance, Classification, Values, Conflict

Peer reviewed
Direct link
