NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 1 to 15 of 57 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Coggeshall, Whitney Smiley – Educational Measurement: Issues and Practice, 2021
The continuous testing framework, where both successful and unsuccessful examinees have to demonstrate continued proficiency at frequent prespecified intervals, is a framework that is used in noncognitive assessment and is gaining in popularity in cognitive assessment. Despite the rigorous advantages of this framework, this paper demonstrates that…
Descriptors: Classification, Accuracy, Testing, Failure
Peer reviewed Peer reviewed
Direct linkDirect link
Sinharay, Sandip – Educational and Psychological Measurement, 2022
Administrative problems such as computer malfunction and power outage occasionally lead to missing item scores and hence to incomplete data on mastery tests such as the AP and U.S. Medical Licensing examinations. Investigators are often interested in estimating the probabilities of passing of the examinees with incomplete data on mastery tests.…
Descriptors: Mastery Tests, Computer Assisted Testing, Probability, Test Wiseness
Phelps, Richard P. – Online Submission, 2020
This review critiques the highly-praised and influential 2001 study, "Getting Tough? The Impact of High School Graduation Exams," which concluded that "minimum competency," or high school "graduation exams," had no effect on student achievement. The review compares the test classifications of "Getting…
Descriptors: High School Students, Exit Examinations, Academic Achievement, Minimum Competencies
Peer reviewed Peer reviewed
Direct linkDirect link
Leventhal, Brian C.; Grabovsky, Irina – Educational Measurement: Issues and Practice, 2020
Standard setting is arguably one of the most subjective techniques in test development and psychometrics. The decisions when scores are compared to standards, however, are arguably the most consequential outcomes of testing. Providing licensure to practice in a profession has high stake consequences for the public. Denying graduation or forcing…
Descriptors: Standard Setting (Scoring), Weighted Scores, Test Construction, Psychometrics
Peer reviewed Peer reviewed
Direct linkDirect link
Uysal, Huseyin – TESOL Quarterly: A Journal for Teachers of English to Speakers of Other Languages and of Standard English as a Second Dialect, 2022
Reclassification is a critical threshold when English Learners (ELs) exit specialized language services and access all-English mainstream classrooms. Despite the mandates of the Every Student Succeeds Act, reclassification rates and time remain a pressing problem. A product of this malfunctioning system has been long-term ELs (LTELs). This article…
Descriptors: Standardized Tests, Language Tests, Second Language Learning, English Language Learners
Peer reviewed Peer reviewed
Direct linkDirect link
Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2014
A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…
Descriptors: Test Items, Test Bias, Simulation, Hypothesis Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Macqueen, Susy; Pill, John; Knoch, Ute – Language Testing, 2016
Objects that sit between intersecting social worlds, such as Language for Specific Purposes (LSP) tests, are "boundary objects"--dynamic, historically derived mechanisms which maintain coherence between worlds (Star & Griesemer, 1989). They emerge initially from sociopolitical mandates, such as the need to ensure a safe and efficient…
Descriptors: Language Tests, Health Personnel, Languages for Special Purposes, English (Second Language)
Sukin, Tia M. – ProQuest LLC, 2010
The presence of outlying anchor items is an issue faced by many testing agencies. The decision to retain or remove an item is a difficult one, especially when the content representation of the anchor set becomes questionable by item removal decisions. Additionally, the reason for the aberrancy is not always clear, and if the performance of the…
Descriptors: Simulation, Science Achievement, Sampling, Data Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010
Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…
Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010
This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…
Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Walker, Michael E. – Measurement: Interdisciplinary Research and Perspectives, 2010
"Linking" is a term given to a general class of procedures by which one represents scores X on one test or measure in terms of scores Y on another test or measure. A recent taxonomy by Holland and Dorans (2006; Holland, 2007) organizes the various types of links into three broad categories: prediction, scale aligning, and equating. In…
Descriptors: Foreign Countries, Test Construction, Test Validity, Measurement Techniques
Peer reviewed Peer reviewed
Direct linkDirect link
von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010
The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…
Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Wu, Shu-hua; Alrabah, Sulaiman – English Language Teaching, 2014
The purpose of this classroom-based study was to discover the kinds of skill integration tasks that were employed by English teachers in Kuwait and to measure their attitudes toward implementing the skill integration technique in their classrooms. Data collection involved recording 25 hours of classroom-based observations, conducting interviews…
Descriptors: Second Language Learning, Second Language Instruction, Communicative Competence (Languages), Interviews
Peer reviewed Peer reviewed
Direct linkDirect link
van Rijn, P. W.; Beguin, A. A.; Verstralen, H. H. F. M. – Assessment in Education: Principles, Policy & Practice, 2012
While measurement precision is relatively easy to establish for single tests and assessments, it is much more difficult to determine for decision making with multiple tests on different subjects. This latter is the situation in the system of final examinations for secondary education in the Netherlands and is used as an example in this paper. This…
Descriptors: Secondary Education, Tests, Foreign Countries, Decision Making
Maier, Milton H.; Fuchs, Edmund F. – 1973
Army personnel managers have a continuing need to select, classify and assign to training and jobs large numbers of men who enter the service. The present publication addresses the value of selection and classification testing program in relation to job training success and the suitability of the tests for subgroups of the manpower available to…
Descriptors: Achievement Tests, Aptitude Tests, Classification, Job Placement
Previous Page | Next Page ยป
Pages: 1  |  2  |  3  |  4