ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	4
Since 2017 (last 10 years)	14
Since 2007 (last 20 years)	26

Descriptor

Comparative Analysis	57
Test Format	57
Test Reliability	57
Test Validity	25
Test Items	20
Computer Assisted Testing	18
Multiple Choice Tests	15
Foreign Countries	13
Higher Education	13
Test Construction	13
Language Tests	12
Scores	12
Difficulty Level	10
Statistical Analysis	10
Item Response Theory	9
Correlation	8
College Students	7
Item Analysis	7
Second Language Learning	7
English (Second Language)	6
Language Proficiency	6
Test Length	6
Testing	6
Achievement Tests	5
Cloze Procedure	5
More ▼

Publication Type

Reports - Research	46
Journal Articles	37
Speeches/Meeting Papers	13
Reports - Evaluative	4
Reports - Descriptive	3
Information Analyses	2
Books	1
Guides - Non-Classroom	1
Non-Print Media	1
Opinion Papers	1
Reference Materials - General	1
More ▼

Education Level

Higher Education	10
Postsecondary Education	7
High Schools	3
Secondary Education	2
Elementary Secondary Education	1
Grade 8	1
Middle Schools	1

Audience

Practitioners	1
Researchers	1
Teachers	1

Location

Japan	2
Finland	1
France	1
Hong Kong	1
Iran	1
Israel	1
Maryland	1
Missouri	1
Taiwan	1
Turkey	1
Turkey (Ankara)	1
United Kingdom	1
United Kingdom (Belfast)	1
Vietnam	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

ACTFL Oral Proficiency…	1
California Critical Thinking…	1
Defining Issues Test	1
Embedded Figures Test	1
English Proficiency Test	1
Praxis Series	1
SAT (College Admission Test)	1
Strong Campbell Interest…	1
Torrance Tests of Creative…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 57 results Save | Export

A Comparison of Yen's Q3 Coefficient and Rasch Testlet Modeling for Identifying Local Item Dependence: Evidence from Two Vocabulary Matching Tests

Peer reviewed

Direct link

Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025

This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…

Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis

Are the Verbal TTCT Forms Actually Interchangeable?

Peer reviewed

Direct link

Grajzel, Katalin; Dumas, Denis; Acar, Selcuk – Journal of Creative Behavior, 2022

One of the best-known and most frequently used measures of creative idea generation is the Torrance Test of Creative Thinking (TTCT). The TTCT Verbal, assessing verbal ideation, contains two forms created to be used interchangeably by researchers and practitioners. However, the parallel forms reliability of the two versions of the TTCT Verbal has…

Descriptors: Test Reliability, Creative Thinking, Creativity Tests, Verbal Ability

Item Response Theory, Computer Adaptive Testing and the Risk of Self-Deception

Download full text

Benton, Tom – Research Matters, 2021

Computer adaptive testing is intended to make assessment more reliable by tailoring the difficulty of the questions a student has to answer to their level of ability. Most commonly, this benefit is used to justify the length of tests being shortened whilst retaining the reliability of a longer, non-adaptive test. Improvements due to adaptive…

Descriptors: Risk, Item Response Theory, Computer Assisted Testing, Difficulty Level

Reliability and Validity of Methods to Assess Undergraduate Healthcare Student Performance in Pharmacology: Comparison of Open Book versus Time-Limited Closed Book Examinations

Peer reviewed
PDF on ERIC

Download full text

David Bell; Vikki O'Neill; Vivienne Crawford – Practitioner Research in Higher Education, 2023

We compared the influence of open-book extended duration versus closed book time-limited format on reliability and validity of written assessments of pharmacology learning outcomes within our medical and dental courses. Our dental cohort undertake a mid-year test (30xfree-response short answer to a question, SAQ) and end-of-year paper (4xSAQ,…

Descriptors: Undergraduate Students, Pharmacology, Pharmaceutical Education, Test Format

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

A Review and Analysis of Selected School Climate Measures

Peer reviewed

Direct link

Olsen, Jacob; Preston, Angela I.; Algozzine, Bob; Algozzine, Kate; Cusumano, Dale – Clearing House: A Journal of Educational Strategies, Issues and Ideas, 2018

Although it is widely agreed that there is no universally accepted definition for school climate, most professionals ground it in shared beliefs, values, and attitudes reflecting the quality and character of life in schools. In this article, we review and analyze measures accessible to school personnel charged with documenting and monitoring…

Descriptors: Educational Environment, Measures (Individuals), School Personnel, Test Format

Effects of Situational Judgment Test Format on Reliability and Validity

Peer reviewed

Direct link

Martin-Raugh, Michelle P.; Anguiano-Carrsaco, Cristina; Jackson, Teresa; Brenneman, Meghan W.; Carney, Lauren; Barnwell, Patrick; Kochert, Jonathan – International Journal of Testing, 2018

Single-response situational judgment tests (SRSJTs) differ from multiple-response SJTs (MRSJTS) in that they present test takers with edited critical incidents and simply ask test takers to read over the action described and evaluate it according to its effectiveness. Research comparing the reliability and validity of SRSJTs and MRSJTs is thus far…

Descriptors: Test Format, Test Reliability, Test Validity, Predictive Validity

Same Test, Better Scores: Boosting the Reliability of Short Online Intelligence Recruitment Tests with Nested Logit Item Response Theory Models

Peer reviewed
PDF on ERIC

Download full text

Storme, Martin; Myszkowski, Nils; Baron, Simon; Bernard, David – Journal of Intelligence, 2019

Assessing job applicants' general mental ability online poses psychometric challenges due to the necessity of having brief but accurate tests. Recent research (Myszkowski & Storme, 2018) suggests that recovering distractor information through Nested Logit Models (NLM; Suh & Bolt, 2010) increases the reliability of ability estimates in…

Descriptors: Intelligence Tests, Item Response Theory, Comparative Analysis, Test Reliability

ACTFL Oral Proficiency Interview -- Computer (OPIc)

Peer reviewed

Direct link

Isbell, Dan; Winke, Paula – Language Testing, 2019

The American Council on the Teaching of Foreign Languages (ACTFL) oral proficiency interview -- computer (OPIc) testing system represents an ambitious effort in language assessment: Assessing oral proficiency in over a dozen languages, on the same scale, from virtually anywhere at any time. Especially for users in contexts where multiple foreign…

Descriptors: Oral Language, Language Tests, Language Proficiency, Second Language Learning

A Comparison of Two Content Area Curriculum-Based Measurement Tools

Peer reviewed

Direct link

Ford, Jeremy W.; Conoyer, Sarah J.; Lembke, Erica S.; Smith, R. Alex; Hosp, John L. – Assessment for Effective Intervention, 2018

In the present study, two types of curriculum-based measurement (CBM) tools in science, Vocabulary Matching (VM) and Statement Verification for Science (SV-S), a modified Sentence Verification Technique, were compared. Specifically, this study aimed to determine whether the format of information presented (i.e., SV-S vs. VM) produces differences…

Descriptors: Curriculum Based Assessment, Evaluation Methods, Measurement Techniques, Comparative Analysis

Multiple True-False Items: A Comparison of Scoring Algorithms

Peer reviewed

Direct link

Lahner, Felicitas-Maria; Lörwald, Andrea Carolin; Bauer, Daniel; Nouns, Zineb Miriam; Krebs, René; Guttormsen, Sissel; Fischer, Martin R.; Huwendiek, Sören – Advances in Health Sciences Education, 2018

Multiple true-false (MTF) items are a widely used supplement to the commonly used single-best answer (Type A) multiple choice format. However, an optimal scoring algorithm for MTF items has not yet been established, as existing studies yielded conflicting results. Therefore, this study analyzes two questions: What is the optimal scoring algorithm…

Descriptors: Scoring Formulas, Scoring Rubrics, Objective Tests, Multiple Choice Tests

Reducing the Need for Guesswork in Multiple-Choice Tests

Peer reviewed

Direct link

Bush, Martin – Assessment & Evaluation in Higher Education, 2015

The humble multiple-choice test is very widely used within education at all levels, but its susceptibility to guesswork makes it a suboptimal assessment tool. The reliability of a multiple-choice test is partly governed by the number of items it contains; however, longer tests are more time consuming to take, and for some subject areas, it can be…

Descriptors: Guessing (Tests), Multiple Choice Tests, Test Format, Test Reliability

Multiple Mini-Interviews in the Age of the Internet: Does Preparation Help Applicants to Medical School?

Peer reviewed

Direct link

Moshinsky, Avital; Ziegler, David; Gafni, Naomi – International Journal of Testing, 2017

Many medical schools have adopted multiple mini-interviews (MMI) as an advanced selection tool. MMIs are expensive and used to test only a few dozen candidates per day, making it infeasible to develop a different test version for each test administration. Therefore, some items are reused both within and across years. This study investigated the…

Descriptors: Interviews, Medical Schools, Test Validity, Test Reliability

Comparing the OPI and the OPIc: The Effect of Test Method on Oral Proficiency Scores and Student Preference

Peer reviewed

Direct link

Thompson, Gregory L.; Cox, Troy L.; Knapp, Nieves – Foreign Language Annals, 2016

While studies have been done to rate the validity and reliability of the Oral Proficiency Interview (OPI) and Oral Proficiency Interview-Computer (OPIc) independently, a limited amount of research has analyzed the interexam reliability of these tests, and studies have yet to be conducted comparing the results of Spanish language learners who take…

Descriptors: Comparative Analysis, Oral Language, Language Proficiency, Spanish

A Comparison of Three Test Formats to Assess Word Difficulty

Peer reviewed

Direct link

Culligan, Brent – Language Testing, 2015

This study compared three common vocabulary test formats, the Yes/No test, the Vocabulary Knowledge Scale (VKS), and the Vocabulary Levels Test (VLT), as measures of vocabulary difficulty. Vocabulary difficulty was defined as the item difficulty estimated through Item Response Theory (IRT) analysis. Three tests were given to 165 Japanese students,…

Descriptors: Language Tests, Test Format, Comparative Analysis, Vocabulary

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

ETS Research Report Series	3
Journal of Educational…	3
Language Testing	3
Educational and Psychological…	2
International Journal of…	2
Advances in Health Sciences…	1
Alberta Journal of…	1
Assessment & Evaluation in…	1
Assessment for Effective…	1
Clearing House: A Journal of…	1
College Board	1
Education and Information…	1
Educational Research and…	1
Foreign Language Annals	1
Higher Education	1
Hispanic Journal of…	1
IRAL	1
Journal of Applied Testing…	1
Journal of Creative Behavior	1
Journal of Intelligence	1
Journal of Interactive Online…	1
Journal of Language and…	1
Journal of Learning in Higher…	1
Journal on English Language…	1
Language Assessment Quarterly	1
More ▼

Federico, Pat-Anthony	2
Acar, Selcuk	1
Alemi, Minoo	1
Algozzine, Bob	1
Algozzine, Kate	1
Allison, Donald E.	1
Alpayar, Cagla	1
Anguiano-Carrsaco, Cristina	1
Barnwell, Patrick	1
Baron, Simon	1
Barrio, Concepcion	1
Bauer, Daniel	1
Benton, Tom	1
Bernard, David	1
Brenneman, Meghan	1
Brenneman, Meghan W.	1
Brown, James Dean	1
Bush, Martin	1
Carney, Lauren	1
Castellano, Karen	1
Chase, Clinton I.	1
Coats, Pamela K.	1
Coniam, David	1
Conoyer, Sarah J.	1
More ▼