NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 9 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Badham, Louise; Furlong, Antony – International Journal of Testing, 2023
Multilingual summative assessments face significant challenges due to tensions that exist between multiple language provision and comparability. Yet, conventional approaches for investigating comparability in multilingual assessments fail to accommodate assessments that comprise extended responses that target complex constructs. This article…
Descriptors: Summative Evaluation, Multilingualism, Comparative Analysis, Literature
Peer reviewed Peer reviewed
Direct linkDirect link
Evers, Arne – International Journal of Testing, 2012
In this article, the characteristics of five test review models are described. The five models are the US review system at the Buros Center for Testing, the German Test Review System of the Committee on Tests, the Brazilian System for the Evaluation of Psychological Tests, the European EFPA Review Model, and the Dutch COTAN Evaluation System for…
Descriptors: Program Evaluation, Test Reviews, Trend Analysis, International Education
Peer reviewed Peer reviewed
Direct linkDirect link
Carlson, Janet F.; Geisinger, Kurt F. – International Journal of Testing, 2012
The test review process used by the Buros Center for Testing is described as a series of 11 steps: (1) identifying tests to be reviewed, (2) obtaining tests and preparing test descriptions, (3) determining whether tests meet review criteria, (4) identifying appropriate reviewers, (5) selecting reviewers, (6) sending instructions and materials to…
Descriptors: Testing, Test Reviews, Evaluation Methods, Evaluation Criteria
Peer reviewed Peer reviewed
Direct linkDirect link
Rutkowski, Leslie; Rutkowski, David; Zhou, Yan – International Journal of Testing, 2016
Using an empirically-based simulation study, we show that typically used methods of choosing an item calibration sample have significant impacts on achievement bias and system rankings. We examine whether recent PISA accommodations, especially for lower performing participants, can mitigate some of this bias. Our findings indicate that standard…
Descriptors: Simulation, International Programs, Adolescents, Student Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Geisinger, Kurt F. – International Journal of Testing, 2012
This article sets the stage for the description of a variety of approaches to test reviewing worldwide. It describes the importance of test reviewing as a protection of the public and of society and also the benefits of this activity for test users, who must choose measures to use in particular situations with particular clients at a particular…
Descriptors: Test Reviews, Evaluation Methods, Evaluation Criteria, Global Approach
Peer reviewed Peer reviewed
Direct linkDirect link
Bartram, Dave – International Journal of Testing, 2012
Internationalization is possible, but the objectives need careful consideration. It is noted that the majority of countries do not have any form of test quality procedure and that only a small number have reviews, registration, certification, or some combination of these approaches. Internationalization could provide benefits at the least by…
Descriptors: Test Reviews, Educational Research, Evaluation Criteria, Standardized Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Wyse, Adam E.; Mapuranga, Raymond – International Journal of Testing, 2009
Differential item functioning (DIF) analysis is a statistical technique used for ensuring the equity and fairness of educational assessments. This study formulates a new DIF analysis method using the information similarity index (ISI). ISI compares item information functions when data fits the Rasch model. Through simulations and an international…
Descriptors: Test Bias, Evaluation Methods, Test Items, Educational Assessment
Peer reviewed Peer reviewed
Direct linkDirect link
Leighton, Jacqueline P.; Gokiert, Rebecca J.; Cui, Ying – International Journal of Testing, 2007
Studies of test dimensionality indicate that many large-scale science assessments measure multiple dimensions. These findings have reinforced the perspective that science achievement is an inherently dynamic process and that there is benefit in reporting subscores in science. A limitation with some of these studies is that they fail to indicate…
Descriptors: Program Evaluation, Psychology, Science Achievement, Science Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Behrens, John T.; Mislevy, Robert J.; Bauer, Malcolm; Williamson, David M.; Levy, Roy – International Journal of Testing, 2004
This article introduces the assessment and deployment contexts of the Networking Performance Skill System (NetPASS) project and the articles in this section that report on findings from this endeavor. First, the educational context of the Cisco Networking Academy Program is described. Second, the basic outline of Evidence Centered Design is…
Descriptors: Evaluation Methods, Global Approach, Computer Networks, Program Descriptions