Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 3 |
| Since 2017 (last 10 years) | 14 |
| Since 2007 (last 20 years) | 22 |
Descriptor
| Multiple Choice Tests | 52 |
| Reading Tests | 52 |
| Test Validity | 52 |
| Reading Comprehension | 33 |
| Foreign Countries | 22 |
| Test Reliability | 22 |
| Language Tests | 20 |
| Test Construction | 20 |
| English (Second Language) | 17 |
| Test Items | 17 |
| Cloze Procedure | 15 |
| More ▼ | |
Source
Author
| Alonzo, Julie | 3 |
| O'Reilly, Robert P. | 3 |
| Anderson, Daniel | 2 |
| Bensoussan, Marsha | 2 |
| Carver, Ronald P. | 2 |
| Pyrczak, Fred | 2 |
| Zeraatpishe, Mitra | 2 |
| Akyol, Hayati | 1 |
| Ali Zahabi | 1 |
| Appenzellar, Anne B. | 1 |
| Arnold, Sharon | 1 |
| More ▼ | |
Publication Type
Education Level
| Higher Education | 10 |
| Postsecondary Education | 8 |
| Secondary Education | 8 |
| Elementary Education | 6 |
| Grade 4 | 5 |
| Junior High Schools | 5 |
| Middle Schools | 5 |
| Grade 6 | 4 |
| Grade 7 | 4 |
| Intermediate Grades | 4 |
| Grade 2 | 3 |
| More ▼ | |
Audience
| Teachers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Shear, Benjamin R. – Journal of Educational Measurement, 2023
Large-scale standardized tests are regularly used to measure student achievement overall and for student subgroups. These uses assume tests provide comparable measures of outcomes across student subgroups, but prior research suggests score comparisons across gender groups may be complicated by the type of test items used. This paper presents…
Descriptors: Gender Bias, Item Analysis, Test Items, Achievement Tests
Budi Waluyo; Ali Zahabi; Luksika Ruangsung – rEFLections, 2024
The increasing popularity of the Common European Framework of Reference (CEFR) in non-native English-speaking countries has generated a demand for concrete examples in the creation of CEFR-based tests that assess the four main English skills. In response, this research endeavors to provide insight into the development and validation of a…
Descriptors: Language Tests, Language Proficiency, Undergraduate Students, Language Skills
New York State Education Department, 2024
The New York State Education Department (NYSED) has a partnership with NWEA for the development of the 2024 Grades 3-8 English Language Arts Tests. Teachers from across the State work with NYSED in a variety of activities to ensure the validity and reliability of the New York State Testing Program (NYSTP). The 2024 Grades 6 and 7 English Language…
Descriptors: Language Tests, Test Format, Language Arts, English Instruction
Özdemir, Ezgi Çetinkaya; Akyol, Hayati – Universal Journal of Educational Research, 2019
Reading comprehension has an important place in lifelong learning. It is an interactive process between the reader and the text. Students need reading comprehension skills at all educational levels and for all school subjects. Determining the level of students' reading comprehension skills is the subject of testing and evaluation. Tests used to…
Descriptors: Reading Comprehension, Reading Tests, Test Construction, Grade 4
Martínez-Huertas, José Á.; Jastrzebska, Olga; Olmos, Ricardo; León, José A. – Assessment & Evaluation in Higher Education, 2019
Automated summary evaluation is proposed as an alternative to rubrics and multiple-choice tests in knowledge assessment. Inbuilt rubric is a recent Latent Semantic Analysis (LSA) method that implements rubrics in an artificially-generated semantic space. It was compared with classical LSA's cosine-based methods assessing knowledge in a…
Descriptors: Automation, Scoring Rubrics, Alternative Assessment, Test Reliability
Roy-Charland, Annie; Colangelo, Gabrielle; Foglia, Victoria; Reguigui, Leïla – Reading and Writing: An Interdisciplinary Journal, 2017
In tests used to measure reading comprehension, validity is important in obtaining accurate results. Unfortunately, studies have shown that people can correctly answer some questions of these tests without reading the related passage. These findings bring forth the need to address whether this phenomenon is observed in multiple-choice only tests…
Descriptors: Standardized Tests, Reading Tests, Reading Comprehension, Test Validity
Yazdinejad, Anoushe; Zeraatpishe, Mitra – International Journal of Language Testing, 2019
In this study the validity of partial dictation as a measure of overall language proficiency was examined. Two partial dictation tests along with a C-Test, a cloze test, and a reading comprehension test, as criterion measures, were administered to a group of Iranian EFL learners. The coefficients of correlation between partial dictation and…
Descriptors: Test Validity, Verbal Communication, Language Proficiency, Language Tests
Toker, Deniz – TESL-EJ, 2019
The central purpose of this paper is to examine validity problems arising from the multiple-choice items and technical passages in the Test of English as a Foreign Language Internet-based Test (TOEFL iBT) reading section, primarily concentrating on construct-irrelevant variance (Messick, 1989). My personal TOEFL iBT experience, along with my…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Computer Assisted Testing
Lim, Hyojung – Language Testing in Asia, 2019
Background: This study aims to empirically answer the question of whether the role of sub-reading skills changes depending on the test format (e.g., multiple-choice vs. open-ended reading questions). The test format effect also addresses the issue of test validity--whether the reading test properly elicits construct-relevant reading skills or…
Descriptors: Foreign Countries, Test Format, Language Tests, English (Second Language)
Tonekaboni, Fateme Roohani; Ravand, Hamdollah; Rezvani, Reza – International Journal of Language Testing, 2021
Investigating the processes underlying test performance is a major source of data supporting the explanation inference in the validity argument (Chappelle, 2021). One way of modeling the cognitive processes underlying test performance is by constructing a Q-matrix, which is essentially about summarizing the attributes explaining test-takers'…
Descriptors: Reading Comprehension, Reading Tests, High Stakes Tests, Inferences
Zare, Samaneh; Boori, Ali Akbar – International Journal of Language Testing, 2018
In this study, the cloze-elide test was developed and administered under time constraints. This research is aimed to examine the validity and reliability of the speeded cloze-elide test and investigate its relationship with reading comprehension, C-Test, and multiple-choice cloze test. Processing speed is a vital indicator to distinguish high to…
Descriptors: Cloze Procedure, Timed Tests, Language Tests, English (Second Language)
Sheybani, Elias; Zeraatpishe, Mitra – International Journal of Language Testing, 2018
Test method is deemed to affect test scores along with examinee ability (Bachman, 1996). In this research the role of method facet in reading comprehension tests is studied. Bachman divided method facet into five categories, one category is the nature of input and the nature of expected response. This study examined the role of method effect in…
Descriptors: Reading Comprehension, Reading Tests, Test Items, Test Format
Alonzo, Julie; Anderson, Daniel – Behavioral Research and Teaching, 2018
In response to a request for additional analyses, in particular reporting confidence intervals around the results, we re-analyzed the data from prior studies. This supplementary report presents the results of the additional analyses addressing classification accuracy, reliability, and criterion-related validity evidence. For ease of reference, we…
Descriptors: Curriculum Based Assessment, Computation, Statistical Analysis, Classification
Arnold, Sharon; Reed, Phil – British Journal of Special Education, 2016
Schools have an obligation to assess the literacy skills of their students, and the provision of reading instruction to students includes the ability to measure progress in this area. However, the design of reading tests includes the ability not only to read words, but also the ability to verbalise them. This presents a particular challenge for…
Descriptors: Reading Tests, Summative Evaluation, Special Education, Pervasive Developmental Disorders
Nese, Joseph F. T.; Anderson, Daniel; Irvin, P. Shawn; Alonzo, Julie – Behavioral Research and Teaching, 2018
This in-brief technical report documents the results from two different analytic approaches for examining the reliability of the slope for easyCBM® reading measures in Grades K-8. Results varied by grade, assessment measure, and the analytic approach. Results patterns are discussed.
Descriptors: Curriculum Based Assessment, Response to Intervention, Kindergarten, Grade 1

Peer reviewed
Direct link
