NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20260
Since 20250
Since 2022 (last 5 years)0
Since 2017 (last 10 years)5
Since 2007 (last 20 years)19
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 30 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Lee, Yi-Hsuan; Haberman, Shelby J.; Dorans, Neil J. – Journal of Educational Measurement, 2019
In many educational tests, both multiple-choice (MC) and constructed-response (CR) sections are used to measure different constructs. In many common cases, security concerns lead to the use of form-specific CR items that cannot be used for equating test scores, along with MC sections that can be linked to previous test forms via common items. In…
Descriptors: Scores, Multiple Choice Tests, Test Items, Responses
Peer reviewed Peer reviewed
Direct linkDirect link
Stewart, Jeffrey; McLean, Stuart; Kramer, Brandon – Language Assessment Quarterly, 2017
Stewart questioned vocabulary size estimation methods proposed by Beglar and Nation for the Vocabulary Size Test, further arguing Rasch mean square (MSQ) fit statistics cannot determine the proportion of random guesses contained in the average learner's raw score, because the average value will be near 1 by design. He illustrated this by…
Descriptors: Guessing (Tests), Item Response Theory, Language Tests, Vocabulary
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Toker, Deniz – TESL-EJ, 2019
The central purpose of this paper is to examine validity problems arising from the multiple-choice items and technical passages in the Test of English as a Foreign Language Internet-based Test (TOEFL iBT) reading section, primarily concentrating on construct-irrelevant variance (Messick, 1989). My personal TOEFL iBT experience, along with my…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Computer Assisted Testing
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Giraldo, Frank – HOW, 2019
The purpose of this article of reflection is to raise awareness of how poor design of language assessments may have detrimental effects, if crucial qualities and technicalities of test design are not met. The article first discusses these central qualities for useful language assessments. Then, guidelines for creating listening assessments, as an…
Descriptors: Test Construction, Consciousness Raising, Language Tests, Second Language Learning
Alonzo, Julie – Online Submission, 2019
This manuscript presents the results of an analysis between the easyCBM® literacy assessments, Grades K-3, and Ohio's Learning Standards related to literacy at those grade levels. Results indicate some alignment at each grade level, with the greatest degree of alignment in Grade 3.
Descriptors: Curriculum Based Assessment, State Standards, Academic Standards, Alignment (Education)
Peer reviewed Peer reviewed
Direct linkDirect link
Bridgeman, Brent – Educational Measurement: Issues and Practice, 2016
Scores on essay-based assessments that are part of standardized admissions tests are typically given relatively little weight in admissions decisions compared to the weight given to scores from multiple-choice assessments. Evidence is presented to suggest that more weight should be given to these assessments. The reliability of the writing scores…
Descriptors: Multiple Choice Tests, Scores, Standardized Tests, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Coniam, David – Frontiers of Education in China, 2014
This article examines the issue of the quality of teacher-produced tests, limiting itself in the current context to objective, multiple-choice tests. The article investigates a short, two-part 20-item English language test. After a brief overview of the key test qualities of reliability and validity, the article examines the two subtests in terms…
Descriptors: Teacher Made Tests, English (Second Language), Language Tests, Multiple Choice Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Stewart, Jeffrey – Language Assessment Quarterly, 2014
Validated under a Rasch framework (Beglar, 2010), the Vocabulary Size Test (VST) (Nation & Beglar, 2007) is an increasingly popular measure of decontextualized written receptive vocabulary size in the field of second language acquisition. However, although the validation indicates that the test has high internal reliability, still unaddressed…
Descriptors: Multiple Choice Tests, Vocabulary, Language Tests, Receptive Language
Peer reviewed Peer reviewed
Direct linkDirect link
Coombe, Christine; Davidson, Peter – Language Testing, 2014
The Common Educational Proficiency Assessment (CEPA) is a large-scale, high-stakes, English language proficiency/placement test administered in the United Arab Emirates to Emirati nationals in their final year of secondary education or Grade 12. The purpose of the CEPA is to place students into English classes at the appropriate government…
Descriptors: Language Tests, High Stakes Tests, English (Second Language), Second Language Learning
Norris, Cathleen A.; Soloway, Elliot; Tan, Chun Ming; Looi, Chee-Kit – Educational Technology, 2013
In this project report, the authors describe the five transformations that a primary school in Singapore underwent as it adopted an inquiry pedagogy and curriculum along with a 1:1 deployment of smartphones. Based on the students' end-of-year test scores, not only did the students learn the prescribed content, but also they developed key 21st…
Descriptors: School Culture, Foreign Countries, Inquiry, Teaching Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Flanagan, Brendan; Yin, Chengjiu; Hirokawa, Sachio; Hashimoto, Kiyota; Tabata, Yoshiyuki – International Journal of Distance Education Technologies, 2013
In this paper, the entries of Lang-8, which is a Social Networking Site (SNS) site for learning and practicing foreign languages, were analyzed and found to contain similar rates of errors for most error categories reported in previous research. These similarly rated errors were then processed using an algorithm to determine corrections suggested…
Descriptors: Social Networks, Computer Assisted Instruction, Educational Technology, Second Language Instruction
Peer reviewed Peer reviewed
Direct linkDirect link
Lehto, Marja-Liisa; Maijala, Minna – Language Learning in Higher Education, 2013
Since Finnish is not an Indo-European language, studying foreign languages in general and then also studying special fields through the medium of foreign languages may provide an extra difficulty for Finnish students. Most university language centres in Finland have organized reading comprehension courses in several foreign languages for the…
Descriptors: Foreign Countries, Reading Comprehension, Second Language Instruction, Second Language Learning
Kim, Sooyeon; Walker, Michael E. – Educational Testing Service, 2011
This study examines the use of subpopulation invariance indices to evaluate the appropriateness of using a multiple-choice (MC) item anchor in mixed-format tests, which include both MC and constructed-response (CR) items. Linking functions were derived in the nonequivalent groups with anchor test (NEAT) design using an MC-only anchor set for 4…
Descriptors: Test Format, Multiple Choice Tests, Test Items, Gender Differences
Peer reviewed Peer reviewed
Direct linkDirect link
Finkelman, Matthew; Darby, Mark; Nering, Michael – Educational and Psychological Measurement, 2009
Many tests classify each examinee into one of multiple performance levels on the basis of a combination of multiple-choice (MC) and constructed-response (CR) items. This study introduces a two-stage scoring method that identifies examinees whose MC scores place them near a cut point, advising scorers on which examinees will be most affected by…
Descriptors: Classification, Scoring, Multiple Choice Tests, Responses
Peer reviewed Peer reviewed
Direct linkDirect link
In'nami, Yo; Koizumi, Rie – Language Testing, 2009
A meta-analysis was conducted on the effects of multiple-choice and open-ended formats on L1 reading, L2 reading, and L2 listening test performance. Fifty-six data sources located in an extensive search of the literature were the basis for the estimates of the mean effect sizes of test format effects. The results using the mixed effects model of…
Descriptors: Test Format, Listening Comprehension Tests, Multiple Choice Tests, Program Effectiveness
Previous Page | Next Page »
Pages: 1  |  2