ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	3
Since 2017 (last 10 years)	5
Since 2007 (last 20 years)	12

Descriptor

Test Bias	19
Test Format	19
Test Validity	19
Test Reliability	11
Foreign Countries	7
Test Items	7
Test Construction	6
English (Second Language)	5
College Entrance Examinations	4
Computer Assisted Testing	4
Factor Analysis	4
Item Analysis	4
Scores	4
Test Interpretation	4
Testing	4
Alternative Assessment	3
Group Testing	3
Language Tests	3
Reading Comprehension	3
Reading Tests	3
Student Evaluation	3
Test Use	3
Construct Validity	2
Content Validity	2
English	2
More ▼

Source

Educational and Psychological…	2
Language Assessment Quarterly	2
Council for Aid to Education	1
Discover Education	1
ETS Research Report Series	1
Education and Information…	1
Educational Research and…	1
Higher Education	1
International Journal of…	1
Journal of Education and…	1
Journal of Educational…	1
Journal of Psychoeducational…	1
Online Submission	1
More ▼

Publication Type

Journal Articles	13
Reports - Research	11
Reports - Evaluative	6
Speeches/Meeting Papers	2
ERIC Publications	1
Information Analyses	1
Opinion Papers	1

Education Level

Higher Education	7
Postsecondary Education	6
Elementary Secondary Education	2
Secondary Education	2
High Schools	1

Audience

Location

Iran	2
Canada	1
Israel	1
Italy	1
New Jersey	1
Singapore	1
Turkey	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Beck Depression Inventory	1
Computer Attitude Scale	1
Rosenberg Self Esteem Scale	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 19 results Save | Export

The MSRT: A Critical Review of English Proficiency in Iran

Peer reviewed

Direct link

Muhammed Parviz; Masoud Azizi – Discover Education, 2025

This article offers a critical review of the Ministry of Science, Research, and Technology English Proficiency Test (MSRT), a high-stakes exam required for postgraduate graduation, scholarships, and certain employment positions in Iran. Despite its widespread use, the design and implementation of the MSRT raise concerns about its validity and…

Descriptors: Language Tests, Language Proficiency, English (Second Language), Second Language Learning

Issues and Concerns in Classroom Assessment Practices

Download full text

Areekkuzhiyil, Santhosh – Online Submission, 2021

Assessment is an integral part of any teaching learning process. Assessment has large number of functions to perform, whether it is formative or summative. This paper analyse the issues involved and the areas of concern in the classroom assessment practice and discusses the recent reforms take place. [This paper was published in Edutracks v20 n8…

Descriptors: Student Evaluation, Formative Evaluation, Summative Evaluation, Test Validity

Charting the Future of Assessments. Research Report. ETS RR-24-13

Peer reviewed
PDF on ERIC

Download full text

Patrick Kyllonen; Amit Sevak; Teresa Ober; Ikkyu Choi; Jesse Sparks; Daniel Fishtein – ETS Research Report Series, 2024

Assessment refers to a broad array of approaches for measuring or evaluating a person's (or group of persons') skills, behaviors, dispositions, or other attributes. Assessments range from standardized tests used in admissions, employee selection, licensure examinations, and domestic and international large-scale assessments of cognitive and…

Descriptors: Assessment Literacy, Testing, Test Bias, Test Construction

Computerized Testing in Reading Comprehension Skill: Investigating Score Interchangeability, Item Review, Age and Gender Stereotypes, ICT Literacy and Computer Attitudes

Peer reviewed

Direct link

Toroujeni, Seyyed Morteza Hashemi – Education and Information Technologies, 2022

Score interchangeability of Computerized Fixed-Length Linear Testing (henceforth CFLT) and Paper-and-Pencil-Based Testing (henceforth PPBT) has become a controversial issue over the last decade when technology has meaningfully restructured methods of the educational assessment. Given this controversy, various testing guidelines published on…

Descriptors: Computer Assisted Testing, Reading Tests, Reading Comprehension, Scoring

On the Dimensionality of Reading Comprehension Tests Composed of Text Comprehension Items and Cloze Test Items

Peer reviewed
PDF on ERIC

Download full text

Sheybani, Elias; Zeraatpishe, Mitra – International Journal of Language Testing, 2018

Test method is deemed to affect test scores along with examinee ability (Bachman, 1996). In this research the role of method facet in reading comprehension tests is studied. Bachman divided method facet into five categories, one category is the nature of input and the nature of expected response. This study examined the role of method effect in…

Descriptors: Reading Comprehension, Reading Tests, Test Items, Test Format

Maintaining Equivalent Cut Scores for Small Sample Test Forms

Peer reviewed

Direct link

Dwyer, Andrew C. – Journal of Educational Measurement, 2016

This study examines the effectiveness of three approaches for maintaining equivalent performance standards across test forms with small samples: (1) common-item equating, (2) resetting the standard, and (3) rescaling the standard. Rescaling the standard (i.e., applying common-item equating methodology to standard setting ratings to account for…

Descriptors: Cutting Scores, Equivalency Tests, Test Format, Academic Standards

Improving the Factor Structure of Psychological Scales: The Expanded Format as an Alternative to the Likert Scale Format

Peer reviewed

Direct link

Zhang, Xijuan; Savalei, Victoria – Educational and Psychological Measurement, 2016

Many psychological scales written in the Likert format include reverse worded (RW) items in order to control acquiescence bias. However, studies have shown that RW items often contaminate the factor structure of the scale by creating one or more method factors. The present study examines an alternative scale format, called the Expanded format,…

Descriptors: Factor Structure, Psychological Testing, Alternative Assessment, Test Items

Rater Perceptions of Bias Using the Multiple Mini-Interview Format: A Qualitative Study

Peer reviewed
PDF on ERIC

Download full text

Alweis, Richard L.; Fitzpatrick, Caroline; Donato, Anthony A. – Journal of Education and Training Studies, 2015

Introduction: The Multiple Mini-Interview (MMI) format appears to mitigate individual rater biases. However, the format itself may introduce structural systematic bias, favoring extroverted personality types. This study aimed to gain a better understanding of these biases from the perspective of the interviewer. Methods: A sample of MMI…

Descriptors: Interviews, Interrater Reliability, Qualitative Research, Semi Structured Interviews

A Case Study of an International Performance-Based Assessment of Critical Thinking Skills

Download full text

Wolf, Raffaela; Zahner, Doris; Kostoris, Fiorella; Benjamin, Roger – Council for Aid to Education, 2014

The measurement of higher-order competencies within a tertiary education system across countries presents methodological challenges due to differences in educational systems, socio-economic factors, and perceptions as to which constructs should be assessed (Blömeke, Zlatkin-Troitschanskaia, Kuhn, & Fege, 2013). According to Hart Research…

Descriptors: Case Studies, International Assessment, Performance Based Assessment, Critical Thinking

The Singapore-Cambridge General Certificate of Education Advanced-Level General Paper Examination

Peer reviewed

Direct link

Hassan, Nurul Huda; Shih, Chih-Min – Language Assessment Quarterly, 2013

This article describes and reviews the Singapore-Cambridge General Certificate of Education Advanced Level General Paper (GP) examination. As a written test that is administered to preuniversity students, the GP examination is internationally recognised and accepted by universities and employers as proof of English competence. In this article, the…

Descriptors: Foreign Countries, College Entrance Examinations, English (Second Language), Writing Tests

Ongoing Issues in Test Fairness

Peer reviewed

Direct link

Camilli, Gregory – Educational Research and Evaluation, 2013

In the attempt to identify or prevent unfair tests, both quantitative analyses and logical evaluation are often used. For the most part, fairness evaluation is a pragmatic attempt at determining whether procedural or substantive due process has been accorded to either a group of test takers or an individual. In both the individual and comparative…

Descriptors: Alternative Assessment, Test Bias, Test Content, Test Format

Differential Item Functioning Analysis of the Science and Mathematics Items in the University Entrance Examinations in Turkey

Peer reviewed

Direct link

Kalaycioglu, Dilara Bakan; Berberoglu, Giray – Journal of Psychoeducational Assessment, 2011

This study is aimed to detect differential item functioning (DIF) items across gender groups, analyze item content for the possible sources of DIF, and eventually investigate the effect of DIF items on the criterion-related validity of the test scores in the quantitative section of the university entrance examination (UEE) in Turkey. The reason…

Descriptors: Test Bias, College Entrance Examinations, Item Analysis, Test Items

An Analysis of a Language Test for Employment: The Authenticity of the PhonePass Test

Peer reviewed

Direct link

Chun, Christian W. – Language Assessment Quarterly, 2006

This article presents an analysis of Ordinate Corporation's PhonePass Spoken English Test-10. The company promotes this product as being a useful assessment tool for screening job candidates' ability in spoken English. In the real-life domain of the work environment, one of the primary target language use tasks involves extended production…

Descriptors: Language Tests, English (Second Language), Speech Tests, Screening Tests

Behaviorally Anchored Rating Scales vs. Summated Rating Scales: Psychometric Properties and Susceptibility to Rating Bias.

Peer reviewed

Kinicki, Angelo J.; And Others – Educational and Psychological Measurement, 1985

Using both the Behaviorally Anchored Rating Scales (BARS) and the Purdue University Scales, 727 undergraduates rated 32 instructors. The BARS had less halo effect, more leniency error, and lower interrater reliability. Both formats were valid. The two tests did not differ in rate discrimination or susceptibility to rating bias. (Author/GDC)

Descriptors: Behavior Rating Scales, College Faculty, Comparative Testing, Higher Education

Comparing Dual-Language Versions of an International Computerized-Adaptive Certification Exam.

Download full text

Sireci, Stephen G.; Foster, David F.; Robin, Frederic; Olsen, James – 1997

Evaluating the comparability of a test administered in different languages is a difficult, if not impossible, task. Comparisons are problematic because observed differences in test performance between groups who take different language versions of a test could be due to a difference in difficulty between the tests, to cultural differences in test…

Descriptors: Adaptive Testing, Adults, Certification, Comparative Analysis

Previous Page | Next Page »

Pages: 1 | 2

Alweis, Richard L.	1
Amit Sevak	1
Areekkuzhiyil, Santhosh	1
Benjamin, Roger	1
Berberoglu, Giray	1
Bolton, David L.	1
Camilli, Gregory	1
Chun, Christian W.	1
Daniel Fishtein	1
De Lisi, Richard	1
Donato, Anthony A.	1
Dwyer, Andrew C.	1
Fitzpatrick, Caroline	1
Foster, David F.	1
Hassan, Nurul Huda	1
Ikkyu Choi	1
Jesse Sparks	1
Kalaycioglu, Dilara Bakan	1
Kinicki, Angelo J.	1
Kostoris, Fiorella	1
Larson, Gordon A.	1
Masoud Azizi	1
Muhammed Parviz	1
Olsen, James	1
More ▼