ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	5
Since 2007 (last 20 years)	19

Descriptor

Language Tests	30
Multiple Choice Tests	30
English (Second Language)	18
Second Language Learning	13
Test Items	11
Foreign Countries	10
Reading Comprehension	10
Test Validity	10
Reading Tests	9
Second Language Instruction	7
Test Format	7
Language Proficiency	6
Scores	6
Test Reliability	6
Test Construction	5
Standardized Tests	4
Test Use	4
Testing Problems	4
Cloze Procedure	3
Construct Validity	3
Correlation	3
Elementary Secondary Education	3
French	3
German	3
Higher Education	3
More ▼

Source

Language Testing	6
Language Assessment Quarterly	2
Canadian Modern Language…	1
Educational Measurement:…	1
Educational Technology	1
Educational Testing Service	1
Educational and Psychological…	1
Francais dans le Monde	1
Frontiers of Education in…	1
HOW	1
International Journal of…	1
Journal of Educational…	1
Language Learning in Higher…	1
Online Submission	1
TESL-EJ	1
TESOL Quarterly: A Journal…	1
More ▼

Publication Type

Reports - Evaluative	30
Journal Articles	20
Speeches/Meeting Papers	6
Guides - Classroom - Teacher	1
Numerical/Quantitative Data	1
Opinion Papers	1
Tests/Questionnaires	1

Education Level

Higher Education	4
Elementary Education	3
Postsecondary Education	3
Secondary Education	2
Early Childhood Education	1
Grade 12	1
Grade 4	1
Intermediate Grades	1
Primary Education	1

Audience

Practitioners	3
Teachers	3

Location

Netherlands	4
Canada	1
Colombia	1
Finland	1
Ohio	1
Singapore	1
South Korea	1
United Arab Emirates	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	4
Graduate Record Examinations	1
International English…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 30 results Save | Export

Use of Adjustment by Minimum Discriminant Information in Linking Constructed-Response Test Scores in the Absence of Common Items

Peer reviewed

Direct link

Lee, Yi-Hsuan; Haberman, Shelby J.; Dorans, Neil J. – Journal of Educational Measurement, 2019

In many educational tests, both multiple-choice (MC) and constructed-response (CR) sections are used to measure different constructs. In many common cases, security concerns lead to the use of form-specific CR items that cannot be used for equating test scores, along with MC sections that can be linked to previous test forms via common items. In…

Descriptors: Scores, Multiple Choice Tests, Test Items, Responses

A Response to Holster and Lake Regarding Guessing and the Rasch Model

Peer reviewed

Direct link

Stewart, Jeffrey; McLean, Stuart; Kramer, Brandon – Language Assessment Quarterly, 2017

Stewart questioned vocabulary size estimation methods proposed by Beglar and Nation for the Vocabulary Size Test, further arguing Rasch mean square (MSQ) fit statistics cannot determine the proportion of random guesses contained in the average learner's raw score, because the average value will be near 1 by design. He illustrated this by…

Descriptors: Guessing (Tests), Item Response Theory, Language Tests, Vocabulary

Topic Familiarity Matters: A Critical Analysis of TOEFL iBT Reading Section

Peer reviewed
PDF on ERIC

Download full text

Toker, Deniz – TESL-EJ, 2019

The central purpose of this paper is to examine validity problems arising from the multiple-choice items and technical passages in the Test of English as a Foreign Language Internet-based Test (TOEFL iBT) reading section, primarily concentrating on construct-irrelevant variance (Messick, 1989). My personal TOEFL iBT experience, along with my…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Computer Assisted Testing

Designing Language Assessments in Context: Theoretical, Technical, and Institutional Considerations

Peer reviewed
PDF on ERIC

Download full text

Giraldo, Frank – HOW, 2019

The purpose of this article of reflection is to raise awareness of how poor design of language assessments may have detrimental effects, if crucial qualities and technicalities of test design are not met. The article first discusses these central qualities for useful language assessments. Then, guidelines for creating listening assessments, as an…

Descriptors: Test Construction, Consciousness Raising, Language Tests, Second Language Learning

The Alignment between easyCBM® Literacy Assessments and Ohio's Learning Standards

Download full text

Alonzo, Julie – Online Submission, 2019

This manuscript presents the results of an analysis between the easyCBM® literacy assessments, Grades K-3, and Ohio's Learning Standards related to literacy at those grade levels. Results indicate some alignment at each grade level, with the greatest degree of alignment in Grade 3.

Descriptors: Curriculum Based Assessment, State Standards, Academic Standards, Alignment (Education)

Can a Two-Question Test Be Reliable and Valid for Predicting Academic Outcomes?

Peer reviewed

Direct link

Bridgeman, Brent – Educational Measurement: Issues and Practice, 2016

Scores on essay-based assessments that are part of standardized admissions tests are typically given relatively little weight in admissions decisions compared to the weight given to scores from multiple-choice assessments. Evidence is presented to suggest that more weight should be given to these assessments. The reliability of the writing scores…

Descriptors: Multiple Choice Tests, Scores, Standardized Tests, Comparative Analysis

Pursuing the Qualities of a "Good" Test

Peer reviewed

Direct link

Coniam, David – Frontiers of Education in China, 2014

This article examines the issue of the quality of teacher-produced tests, limiting itself in the current context to objective, multiple-choice tests. The article investigates a short, two-part 20-item English language test. After a brief overview of the key test qualities of reliability and validity, the article examines the two subtests in terms…

Descriptors: Teacher Made Tests, English (Second Language), Language Tests, Multiple Choice Tests

Do Multiple-Choice Options Inflate Estimates of Vocabulary Size on the VST?

Peer reviewed

Direct link

Stewart, Jeffrey – Language Assessment Quarterly, 2014

Validated under a Rasch framework (Beglar, 2010), the Vocabulary Size Test (VST) (Nation & Beglar, 2007) is an increasingly popular measure of decontextualized written receptive vocabulary size in the field of second language acquisition. However, although the validation indicates that the test has high internal reliability, still unaddressed…

Descriptors: Multiple Choice Tests, Vocabulary, Language Tests, Receptive Language

Common Educational Proficiency Assessment (CEPA) in English

Peer reviewed

Direct link

Coombe, Christine; Davidson, Peter – Language Testing, 2014

The Common Educational Proficiency Assessment (CEPA) is a large-scale, high-stakes, English language proficiency/placement test administered in the United Arab Emirates to Emirati nationals in their final year of secondary education or Grade 12. The purpose of the CEPA is to place students into English classes at the appropriate government…

Descriptors: Language Tests, High Stakes Tests, English (Second Language), Second Language Learning

Inquiry Pedagogy and Smartphones: Enabling a Change in School Culture

Direct link

Norris, Cathleen A.; Soloway, Elliot; Tan, Chun Ming; Looi, Chee-Kit – Educational Technology, 2013

In this project report, the authors describe the five transformations that a primary school in Singapore underwent as it adopted an inquiry pedagogy and curriculum along with a 1:1 deployment of smartphones. Based on the students' end-of-year test scores, not only did the students learn the prescribed content, but also they developed key 21st…

Descriptors: School Culture, Foreign Countries, Inquiry, Teaching Methods

An Automated Method to Generate e-Learning Quizzes from Online Language Learner Writing

Peer reviewed

Direct link

Flanagan, Brendan; Yin, Chengjiu; Hirokawa, Sachio; Hashimoto, Kiyota; Tabata, Yoshiyuki – International Journal of Distance Education Technologies, 2013

In this paper, the entries of Lang-8, which is a Social Networking Site (SNS) site for learning and practicing foreign languages, were analyzed and found to contain similar rates of errors for most error categories reported in previous research. These similarly rated errors were then processed using an algorithm to determine corrections suggested…

Descriptors: Social Networks, Computer Assisted Instruction, Educational Technology, Second Language Instruction

Special Features of Assessment in Reading Comprehension in a Finnish University Language Centre

Peer reviewed

Direct link

Lehto, Marja-Liisa; Maijala, Minna – Language Learning in Higher Education, 2013

Since Finnish is not an Indo-European language, studying foreign languages in general and then also studying special fields through the medium of foreign languages may provide an extra difficulty for Finnish students. Most university language centres in Finland have organized reading comprehension courses in several foreign languages for the…

Descriptors: Foreign Countries, Reading Comprehension, Second Language Instruction, Second Language Learning

Does Linking Mixed-Format Tests Using a Multiple-Choice Anchor Produce Comparable Results for Male and Female Subgroups? Research Report. ETS RR-11-44

Download full text

Kim, Sooyeon; Walker, Michael E. – Educational Testing Service, 2011

This study examines the use of subpopulation invariance indices to evaluate the appropriateness of using a multiple-choice (MC) item anchor in mixed-format tests, which include both MC and constructed-response (CR) items. Linking functions were derived in the nonequivalent groups with anchor test (NEAT) design using an MC-only anchor set for 4…

Descriptors: Test Format, Multiple Choice Tests, Test Items, Gender Differences

A Two-Stage Scoring Method to Enhance Accuracy of Performance Level Classification

Peer reviewed

Direct link

Finkelman, Matthew; Darby, Mark; Nering, Michael – Educational and Psychological Measurement, 2009

Many tests classify each examinee into one of multiple performance levels on the basis of a combination of multiple-choice (MC) and constructed-response (CR) items. This study introduces a two-stage scoring method that identifies examinees whose MC scores place them near a cut point, advising scorers on which examinees will be most affected by…

Descriptors: Classification, Scoring, Multiple Choice Tests, Responses

A Meta-Analysis of Test Format Effects on Reading and Listening Test Performance: Focus on Multiple-Choice and Open-Ended Formats

Peer reviewed

Direct link

In'nami, Yo; Koizumi, Rie – Language Testing, 2009

A meta-analysis was conducted on the effects of multiple-choice and open-ended formats on L1 reading, L2 reading, and L2 listening test performance. Fifty-six data sources located in an extensive search of the literature were the basis for the estimates of the mean effect sizes of test format effects. The results using the mixed effects model of…

Descriptors: Test Format, Listening Comprehension Tests, Multiple Choice Tests, Program Effectiveness

Previous Page | Next Page »

Pages: 1 | 2

Stewart, Jeffrey	2
Alonzo, Julie	1
Barton, Paul E.	1
Bensoussan, Marsha	1
Bridgeman, Brent	1
Bruton, Anthony	1
Choi, Hyeran	1
Choi, Inn-Chull	1
Cohen, Andrew D.	1
Coley, Richard J.	1
Coniam, David	1
Coombe, Christine	1
Darby, Mark	1
Davidson, Peter	1
Dorans, Neil J.	1
Ferne, Tracy	1
Finkelman, Matthew	1
Flanagan, Brendan	1
Giraldo, Frank	1
Haberman, Shelby J.	1
Hashimoto, Kiyota	1
Hirokawa, Sachio	1
In'nami, Yo	1
Johnson, Patricia	1
Kim, Sooyeon	1
More ▼