ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	8

Descriptor

Foreign Countries	9
Academic Achievement	3
Test Construction	3
Comparative Analysis	2
Computation	2
Educational Quality	2
International Programs	2
Item Response Theory	2
Measures (Individuals)	2
Program Effectiveness	2
Program Validation	2
Psychological Testing	2
Reading Tests	2
Scores	2
Test Bias	2
Test Items	2
Test Reviews	2
Testing Programs	2
Academic Standards	1
Accountability	1
Accuracy	1
Achievement Tests	1
Adolescents	1
Barriers	1
Best Practices	1
More ▼

Source

International Journal of…

Publication Type

Journal Articles	9
Reports - Research	6
Reports - Evaluative	2
Reports - Descriptive	1

Education Level

Higher Education	5
Elementary Secondary Education	3

Audience

Location

China	2
Canada	1
France	1
Germany	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	1
Progress in International…	1

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Using Biodata and Situational Judgment Inventories across Cultural Groups

Peer reviewed

Direct link

Prasad, Joshua J.; Showler, Morgan B.; Schmitt, Neal; Ryan, Ann Marie; Nye, Christopher D. – International Journal of Testing, 2017

The present research compares the operation of situational judgement and biodata measures between Chinese and U.S. respondents. We describe the development and past research on both measures, followed by hypothesized differences across the two groups of respondents. We base hypotheses on the nature of the Chinese and U.S. educational systems and…

Descriptors: Measures (Individuals), Hypothesis Testing, Cross Cultural Studies, Comparative Analysis

Item Calibration Samples and the Stability of Achievement Estimates and System Rankings: Another Look at the PISA Model

Peer reviewed

Direct link

Rutkowski, Leslie; Rutkowski, David; Zhou, Yan – International Journal of Testing, 2016

Using an empirically-based simulation study, we show that typically used methods of choosing an item calibration sample have significant impacts on achievement bias and system rankings. We examine whether recent PISA accommodations, especially for lower performing participants, can mitigate some of this bias. Our findings indicate that standard…

Descriptors: Simulation, International Programs, Adolescents, Student Evaluation

Test Reviewing in Germany

Peer reviewed

Direct link

Hagemeister, Carmen; Kersting, Martin; Stemmler, Gerhard – International Journal of Testing, 2012

In 2006, a (new) German standard for test reviewing was passed (Testkuratorium, 2006). There was already a European standard in place (European Federation of Psychologists' Associations, 2008). This article presents the German standard for test reviewing and explains how the German test review system was derived from demands in the German standard…

Descriptors: Test Reviews, Foreign Countries, National Standards, Performance Factors

Toward Increasing Fairness in Score Scale Calibrations Employed in International Large-Scale Assessments

Peer reviewed

Direct link

Oliveri, Maria Elena; von Davier, Matthias – International Journal of Testing, 2014

In this article, we investigate the creation of comparable score scales across countries in international assessments. We examine potential improvements to current score scale calibration procedures used in international large-scale assessments. Our approach seeks to improve fairness in scoring international large-scale assessments, which often…

Descriptors: Test Bias, Scores, International Programs, Educational Assessment

Teachers' Perceptions of Large-Scale Assessment Programs within Low-Stakes Accountability Frameworks

Peer reviewed

Direct link

Klinger, Don A.; Rogers, W. Todd – International Journal of Testing, 2011

The intent of this study was to examine the views of teachers regarding the appropriateness of the purposes and uses of the provincial assessments in Alberta and Ontario and the seriousness of the concerns raised about these assessments. These provinces represent educational jurisdictions that use large-scale assessments within a low-stakes…

Descriptors: Testing Programs, Educational Improvement, Measures (Individuals), Foreign Countries

Is Test Taker Perception of Assessment Related to Construct Validity?

Peer reviewed

Direct link

Xie, Qin – International Journal of Testing, 2011

This study examined test takers' perception of assessment demand and its impact on the measurement of intended constructs. More than 800 test takers took a pre- and a posttest of College English Test Band 4 and filled in a perception questionnaire to report the skills they perceive as necessary for answering the test. The study found test takers…

Descriptors: College English, Reading Tests, Essay Tests, Academic Achievement

Tests in Europe: Where We Are and Where We Should Go

Peer reviewed

Direct link

Elosua, Paula; Iliescu, Dragos – International Journal of Testing, 2012

Psychometric practice does not always converge with the advances of psychometric theory. In order to investigate this gap, the authors focus on the 10 most used psychological tests in Europe, as identified by recent surveys. The article analyzes test manuals published in 6 different European countries for these 10 most used tests. A total of 32…

Descriptors: Psychological Testing, Personality Measures, Error of Measurement, Foreign Countries

Generalizability of GMAT[R] Validity to Programs outside the U.S.

Peer reviewed

Direct link

Talento-Miller, Eileen – International Journal of Testing, 2008

This study explores the predictive validity of GMAT[R] scores for predicting performance in graduate management programs outside the United States. Results suggest that the validity estimates based on the combination of GMAT[R] scores were about a third of a standard deviation higher for non-U.S. programs compared with existing data on U.S.…

Descriptors: Predictive Validity, Program Effectiveness, Educational Background, Academic Achievement

Automated Simultaneous Assembly for Multistage Testing

Peer reviewed

Direct link

Breithaupt, Krista; Ariel, Adelaide; Veldkamp, Bernard P. – International Journal of Testing, 2005

This article offers some solutions used in the assembly of the computerized Uniform Certified Public Accountancy (CPA) licensing examination as practical alternatives for operational programs producing large numbers of forms. The Uniform CPA examination was offered as an adaptive multistage test (MST) beginning in April of 2004. Examples of…

Descriptors: Foreign Countries, Testing Programs, Programming, Mathematical Applications

Ariel, Adelaide	1
Breithaupt, Krista	1
Elosua, Paula	1
Hagemeister, Carmen	1
Iliescu, Dragos	1
Kersting, Martin	1
Klinger, Don A.	1
Nye, Christopher D.	1
Oliveri, Maria Elena	1
Prasad, Joshua J.	1
Rogers, W. Todd	1
Rutkowski, David	1
Rutkowski, Leslie	1
Ryan, Ann Marie	1
Schmitt, Neal	1
Showler, Morgan B.	1
Stemmler, Gerhard	1
Talento-Miller, Eileen	1
Veldkamp, Bernard P.	1
Xie, Qin	1
Zhou, Yan	1
von Davier, Matthias	1
More ▼