Publication Date
| In 2026 | 0 |
| Since 2025 | 26 |
| Since 2022 (last 5 years) | 156 |
| Since 2017 (last 10 years) | 359 |
| Since 2007 (last 20 years) | 509 |
Descriptor
| Foreign Countries | 731 |
| Test Format | 731 |
| Test Items | 227 |
| Language Tests | 188 |
| English (Second Language) | 186 |
| Second Language Learning | 164 |
| Computer Assisted Testing | 162 |
| Multiple Choice Tests | 145 |
| Test Construction | 130 |
| Comparative Analysis | 127 |
| Scores | 117 |
| More ▼ | |
Source
Author
| Allalouf, Avi | 6 |
| Ellington, Henry | 4 |
| Goldhammer, Frank | 4 |
| Sireci, Stephen G. | 4 |
| Baghaei, Purya | 3 |
| Bulut, Okan | 3 |
| Cheng, Liying | 3 |
| DiBattista, David | 3 |
| Hambleton, Ronald K. | 3 |
| Höhne, Jan Karem | 3 |
| McLean, Stuart | 3 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 29 |
| Teachers | 26 |
| Researchers | 11 |
| Administrators | 7 |
| Students | 7 |
| Policymakers | 2 |
Location
| Canada | 62 |
| Turkey | 58 |
| Germany | 41 |
| United Kingdom | 36 |
| Australia | 34 |
| Japan | 34 |
| China | 33 |
| Iran | 25 |
| United Kingdom (England) | 25 |
| Netherlands | 24 |
| United States | 24 |
| More ▼ | |
Laws, Policies, & Programs
| Individuals with Disabilities… | 1 |
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Stark, Tobias H.; Silber, Henning; Krosnick, Jon A.; Blom, Annelies G.; Aoyagi, Midori; Belchior, Ana; Bosnjak, Michael; Clement, Sanne Lund; John, Melvin; Jónsdóttir, Guðbjörg Andrea; Lawson, Karen; Lynn, Peter; Martinsson, Johan; Shamshiri-Petersen, Ditte; Tvinnereim, Endre; Yu, Ruoh-rong – Sociological Methods & Research, 2020
Questionnaire design is routinely guided by classic experiments on question form, wording, and context conducted decades ago. This article explores whether two question order effects (one due to the norm of evenhandedness and the other due to subtraction or perceptual contrast) appear in surveys of probability samples in the United States and 11…
Descriptors: Questionnaires, Test Format, Generalization, Foreign Countries
Chu, Man-Wai; Fung, Karen – Research in Science Education, 2020
Canadian students experience many different assessments throughout their schooling (O'Connor 2011). There are many benefits to using a variety of assessment types, item formats, and science-based performance tasks in the classroom to measure the many dimensions of science education. Although using a variety of assessments is beneficial, it is…
Descriptors: Student Evaluation, Science Achievement, Foreign Countries, Test Format
Remizova, Alisa; Rudnev, Maksim – International Journal of Social Research Methodology, 2020
The justifiability scale (JS) is widely used to measure individual and country differences in moral attitudes. However, the validity of the instrument has been barely assessed. The current study addressed the concurrent and content validity of four popular JS items (justifiability of homosexuality, suicide, prostitution, and euthanasia). A sample…
Descriptors: Moral Values, Content Validity, Attitude Measures, Foreign Countries
El Rassi, Mary Ann B. – International Association for Development of the Information Society, 2020
Despite the increased research interest on the implementation of Open Book Open Web exams in developed countries, there has been very little systematic studies that investigated the difference in gender experience and the cognitive process that could affect attitude towards OBOW exams compared to the traditional ones in developing countries. This…
Descriptors: Gender Differences, Student Attitudes, Test Format, Computer Assisted Testing
Arias, Angel; Blais, Jean-Guy – Canadian Modern Language Review, 2023
This article draws on argument-based validation to gather and evaluate construct-related evidence (i.e., the explanation inference) of a high-stakes test. The data stemmed from the listening component of a French test used for immigration to Canada through the province of Quebec. An expert panel with varied backgrounds in applied linguistics…
Descriptors: French, Listening Comprehension Tests, Second Language Learning, High Stakes Tests
Ali Akbar Ariamanesh; Hossein Barati; Manijeh Youhanaee – International Journal of Language Testing, 2023
The present study investigates the efficacy of preparation time in four speaking tasks of TOEFL iBT. As the current pre-task planning time offered by ETS is very short, 15 to 30 seconds, we intended to explore how the test-takers' speaking quality would change if the preparation time was added to the response time, giving the respondents a…
Descriptors: English (Second Language), Second Language Learning, Language Tests, Test Preparation
Tim Stoeckel; Tomoko Ishii – Vocabulary Learning and Instruction, 2024
In an upcoming coverage-comprehension study, we plan to assess learners' meaning-recall knowledge of words as they occur in the study's reading passage. As several meaning-recall test formats exist, the purpose of this small-scale study (N = 10) was to determine which of three formats was most similar to a criterion interview regarding mean score…
Descriptors: Vocabulary Development, Language Tests, Second Language Learning, Classification
Wicaksono, Azizul Ghofar Candra; Korom, Erzsébet – Participatory Educational Research, 2022
The accuracy of learning results relies on the evaluation and assessment. The learning goals, including problem solving ability must be aligned with the valid standardized measurement tools. The study on exploring the nature of problem-solving, framework, and assessment in the Indonesian context will make contributions to problem solving…
Descriptors: Problem Solving, Educational Research, Test Construction, Test Validity
Opstad, Leiv – Athens Journal of Education, 2021
The discussion of whether multiple-choice questions can replace the traditional exam with essays and constructed questions in introductory courses has just started in Norway. There is not an easy answer. The findings depend on the pattern of the questions. Therefore, one must be careful in drawing conclusions. In this research, one will explore a…
Descriptors: Multiple Choice Tests, Essay Tests, Introductory Courses, Foreign Countries
Sutadji, Eddy; Susilo, Herawati; Wibawa, Aji Prasetya; Jabari, Nidal A. M.; Rohmad, Syaiful Nur – Education Sciences, 2021
Assessment methods are important to create qualified graduates who are ready to face the real world. Authentic assessment is considered to be the most effective method to achieve this. The application of authentic assessment is often universal. However, there is a difference between natural sciences and social sciences. If it is used for different…
Descriptors: Performance Based Assessment, Natural Sciences, Social Sciences, College Faculty
Shear, Benjamin R. – Journal of Educational Measurement, 2023
Large-scale standardized tests are regularly used to measure student achievement overall and for student subgroups. These uses assume tests provide comparable measures of outcomes across student subgroups, but prior research suggests score comparisons across gender groups may be complicated by the type of test items used. This paper presents…
Descriptors: Gender Bias, Item Analysis, Test Items, Achievement Tests
Kurnaz-Adibatmaz, Fatma Betül; Yildiz, Hüseyin – Journal of Theoretical Educational Science, 2020
In this study logistic regression and Lord's Chi Square methods were used to research the items that have DIF. The study utilized Peabody Picture Vocabulary Test (PPVT). The original form of the PPVT includes four options. Three different forms (A, B and C) were formed by removing one of the distractors respectively. The original form of PPVT was…
Descriptors: Item Analysis, Test Items, Vocabulary, Verbal Ability
Jeffrey Martin – Vocabulary Learning and Instruction, 2022
The functioning of a vocabulary testing instrument rests in part on the test-taking actions made possible for examinees by item format, an aspect of test development that warrants consideration in second-language vocabulary research. For example, although iterations of the written receptive vocabulary levels test (VLT) have integrated improvements…
Descriptors: Test Wiseness, Vocabulary, Vocabulary Development, Second Language Learning
Papenberg, Martin; Diedenhofen, Birk; Musch, Jochen – Journal of Experimental Education, 2021
Testwiseness may introduce construct-irrelevant variance to multiple-choice test scores. Presenting response options sequentially has been proposed as a potential solution to this problem. In an experimental validation, we determined the psychometric properties of a test based on the sequential presentation of response options. We created a strong…
Descriptors: Test Wiseness, Test Validity, Test Reliability, Multiple Choice Tests
Falani, Ilham; Akbar, Maruf; Naga, Dali S. – International Journal of Instruction, 2020
This study compared the precision of ability estimation on different types of item response theory models for mixed-format data. Participants in this study were 1625 Junior High School Students in Depok, Indonesia. The mixed-format test was used to measure the students' ability in mathematics. The test used consists of multiple-choice and…
Descriptors: Foreign Countries, Junior High School Students, Ability, Item Response Theory

Peer reviewed
Direct link
