ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	4
Since 2007 (last 20 years)	21

Descriptor

Language Tests	24
English (Second Language)	14
Foreign Countries	9
Computer Assisted Testing	7
Internet	7
Language Proficiency	7
Scores	7
Scoring	7
Second Language Learning	7
Correlation	5
Reading Tests	5
Test Validity	5
Mathematics Tests	4
Test Items	4
Writing Tests	4
Educational Assessment	3
Elementary Secondary Education	3
Essay Tests	3
Foreign Students	3
Statistical Analysis	3
Test Construction	3
Test Content	3
Accuracy	2
Achievement Tests	2
Alignment (Education)	2
More ▼

Source

Educational Testing Service

Publication Type

Reports - Research	12
Reports - Descriptive	6
Reports - Evaluative	5
Numerical/Quantitative Data	2
Guides - Non-Classroom	1
Information Analyses	1
Tests/Questionnaires	1

Education Level

Higher Education	7
Postsecondary Education	4
Elementary Secondary Education	3
Adult Education	1
Elementary Education	1

Audience

Location

China	3
California	1
Canada	1
Colombia	1
Connecticut	1
Egypt	1
Europe	1
Georgia	1
Germany	1
India	1
Indiana	1
Iowa	1
Japan	1
Michigan	1
North America	1
Ohio	1
South Korea	1
Wisconsin	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Test of English as a Foreign…	11
Test of English for…	3
Graduate Record Examinations	1

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

Design Framework for the "TOEFL® Essentials"™ Test 2021. Research Memorandum. ETS RM-21-03

Download full text

Papageorgiou, Spiros; Davis, Larry; Norris, John M.; Garcia Gomez, Pablo; Manna, Venessa F.; Monfils, Lora – Educational Testing Service, 2021

The "TOEFL® Essentials"™ test is a new English language proficiency test in the "TOEFL"® family of assessments. It measures foundational language skills and communication abilities in academic and general (daily life) contexts. The test covers the four language skills of reading, listening, writing, and speaking and is intended…

Descriptors: Language Tests, English (Second Language), Second Language Learning, Language Proficiency

English Language Proficiency Standards and Assessment Alignment: Issues, Insights, and Innovations for Guiding Peer Review. Research Notes

Download full text

Bailey, Alison L.; Wolf, Mikyung Kim; Ballard, Laura – Educational Testing Service, 2022

The research note focuses on the alignment aspect of English language proficiency (ELP) assessments, one of the required types of validity evidence for the federal peer review process of states' assessment systems. A basic tenant of current U.S. education policy is the alignment between what a test assesses and what content has been determined as…

Descriptors: English (Second Language), Second Language Learning, Language Proficiency, Alignment (Education)

Exploring the Alignment between a Curriculum and a Test for Young Learners of English as a Foreign Language. Research Memorandum. ETS RM-20-08

Download full text

Papageorgiou, Spiros; Xu, Xiaoqiu; Timpe-Laughlin, Veronika; Dugdale, Deborah M. – Educational Testing Service, 2020

The purpose of this study is to examine the appropriateness of using the "TOEFL Primary®" tests to evaluate the language abilities of students learning English as a foreign language (EFL) through an online-delivered curriculum, the VIPKid Major Course (MC). Data include student test scores on the TOEFL Primary Listening and Reading tests…

Descriptors: Alignment (Education), Language Tests, English (Second Language), Second Language Learning

Mapping the Redesigned "TOEIC Bridge"® Test Scores to Proficiency Levels of the Common European Framework of Reference for Languages. Research Memorandum. ETS RM-21-01

Download full text

Schmidgall, Jonathan – Educational Testing Service, 2021

The redesigned "TOEIC Bridge"® tests are designed to measure the reading, listening, speaking, and writing proficiency of beginning to low-intermediate English learners in the context of everyday adult life. This report describes the comprehensive and multifaceted process used to enhance the meaningfulness of TOEIC Bridge test score…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Language Proficiency

Use of Continuous Exponential Families to Link Forms via Anchor Tests. Research Report. ETS RR-11-11

Download full text

Haberman, Shelby J.; Yan, Duanli – Educational Testing Service, 2011

Continuous exponential families are applied to linking test forms via an internal anchor. This application combines work on continuous exponential families for single-group designs and work on continuous exponential families for equivalent-group designs. Results are compared to those for kernel and equipercentile equating in the case of chained…

Descriptors: Equated Scores, Statistical Analysis, Language Tests, Mathematics Tests

Does Linking Mixed-Format Tests Using a Multiple-Choice Anchor Produce Comparable Results for Male and Female Subgroups? Research Report. ETS RR-11-44

Download full text

Kim, Sooyeon; Walker, Michael E. – Educational Testing Service, 2011

This study examines the use of subpopulation invariance indices to evaluate the appropriateness of using a multiple-choice (MC) item anchor in mixed-format tests, which include both MC and constructed-response (CR) items. Linking functions were derived in the nonequivalent groups with anchor test (NEAT) design using an MC-only anchor set for 4…

Descriptors: Test Format, Multiple Choice Tests, Test Items, Gender Differences

Fit of Item Response Theory Models: A Survey of Data from Several Operational Tests. Research Report. ETS RR-11-29

Download full text

Sinharay, Sandip; Haberman, Shelby J.; Jia, Helena – Educational Testing Service, 2011

Standard 3.9 of the "Standards for Educational and Psychological Testing" (American Educational Research Association, American Psychological Association, & National Council for Measurement in Education, 1999) demands evidence of model fit when an item response theory (IRT) model is used to make inferences from a data set. We applied two recently…

Descriptors: Item Response Theory, Goodness of Fit, Statistical Analysis, Language Tests

Use of e-rater[R] in Scoring of the TOEFL iBT[R] Writing Test. Research Report. ETS RR-11-25

Download full text

Haberman, Shelby J. – Educational Testing Service, 2011

Alternative approaches are discussed for use of e-rater[R] to score the TOEFL iBT[R] Writing test. These approaches involve alternate criteria. In the 1st approach, the predicted variable is the expected rater score of the examinee's 2 essays. In the 2nd approach, the predicted variable is the expected rater score of 2 essay responses by the…

Descriptors: Writing Tests, Scoring, Essays, Language Tests

Automated Subscores for TOEFL iBT[R] Independent Essays. Research Report. ETS RR-11-39

Download full text

Attali, Yigal – Educational Testing Service, 2011

The e-rater[R] automated essay scoring system is used operationally in the scoring of TOEFL iBT[R] independent essays. Previous research has found support for a 3-factor structure of the e-rater features. This 3-factor structure has an attractive hierarchical linguistic interpretation with a word choice factor, a grammatical convention within a…

Descriptors: Essay Tests, Language Tests, Test Scoring Machines, Automation

How Does the Knowledge of Subgroup Membership of Examinees Affect the Prediction of True Subscores? Research Report. ETS RR-11-43

Download full text

Haberman, Shelby J.; Sinharay, Sandip – Educational Testing Service, 2011

Subscores are reported for several operational assessments. Haberman (2008) suggested a method based on classical test theory to determine if the true subscore is predicted better by the corresponding subscore or the total score. Researchers are often interested in learning how different subgroups perform on subtests. Stricker (1993) and…

Descriptors: True Scores, Test Theory, Prediction, Group Membership

A Differential Word Use Measure for Content Analysis in Automated Essay Scoring. Research Report. ETS RR-11-36

Download full text

Attali, Yigal – Educational Testing Service, 2011

This paper proposes an alternative content measure for essay scoring, based on the "difference" in the relative frequency of a word in high-scored versus low-scored essays. The "differential word use" (DWU) measure is the average of these differences across all words in the essay. A positive value indicates the essay is using…

Descriptors: Scoring, Essay Tests, Word Frequency, Content Analysis

Studies of a Latent Class Signal Detection Model for Constructed Response Scoring II: Incomplete and Hierarchical Designs. Research Report. ETS RR-10-08

Download full text

DeCarlo, Lawrence T. – Educational Testing Service, 2010

A basic consideration in large-scale assessments that use constructed response (CR) items, such as essays, is how to allocate the essays to the raters that score them. Designs that are used in practice are incomplete, in that each essay is scored by only a subset of the raters, and also unbalanced, in that the number of essays scored by each rater…

Descriptors: Test Items, Responses, Essay Tests, Scoring

Application of a General Polytomous Testlet Model to the Reading Section of a Large-Scale English Language Assessment. Research Report. ETS RR-10-21

Download full text

Li, Yanmei; Li, Shuhong; Wang, Lin – Educational Testing Service, 2010

Many standardized educational tests include groups of items based on a common stimulus, known as "testlets". Standard unidimensional item response theory (IRT) models are commonly used to model examinees' responses to testlet items. However, it is known that local dependence among testlet items can lead to biased item parameter estimates…

Descriptors: English, Language Tests, Reading Tests, Item Response Theory

The TOEIC[R] Speaking and Writing Tests:Relations to Test-Taker Perceptions of Proficiency in English. Research Report. ETS RR-09-18

Download full text

Powers, Donald E.; Kim, Hae-Jin; Yu, Feng; Weng, Vincent Z.; VanWinkle, Waverely – Educational Testing Service, 2009

To facilitate the interpretation of test scores from the new TOEIC[R] (Test of English for International Communications[TM]) speaking and writing tests as measures of English-language proficiency, we administered a self-assessment inventory to TOEIC examinees in Japan and Korea, to gather their perceptions of their ability to perform a variety of…

Descriptors: English for Special Purposes, Language Tests, Writing Tests, Speech Tests

Does Content Knowledge Affect TOEFL iBT[TM] Reading Performance? A Confirmatory Approach to Differential Item Functioning. TOEFL iBT Research Report. RR-09-29

Download full text

Liu, Ou Lydia; Schedl, Mary; Malloy, Jeanne; Kong, Nan – Educational Testing Service, 2009

The TOEFL iBT[TM] has increased the length of the reading passages in the reading section compared to the passages on the TOEFL[R] computer-based test (CBT) to better approximate academic reading in North American universities, resulting in a reduced number of passages in the reading test. A concern arising from this change is whether the decrease…

Descriptors: English (Second Language), Language Tests, Internet, Computer Assisted Testing

Previous Page | Next Page »

Pages: 1 | 2

Haberman, Shelby J.	4
Attali, Yigal	3
Papageorgiou, Spiros	2
Sinharay, Sandip	2
Bailey, Alison L.	1
Ballard, Laura	1
Barkaoui, Khaled	1
Brooks, Lindsay	1
Davis, Larry	1
DeCarlo, Lawrence T.	1
Dugdale, Deborah M.	1
Garcia Gomez, Pablo	1
Huang, Li-Shih	1
Jia, Helena	1
Kim, Hae-Jin	1
Kim, Sooyeon	1
Kong, Nan	1
Kostin, Irene	1
Lapkin, Sharon	1
Li, Shuhong	1
Li, Yanmei	1
Liu, Ou Lydia	1
Malloy, Jeanne	1
Manna, Venessa F.	1
Mollaun, Pam	1
More ▼