Publication Date
In 2025 | 1 |
Since 2024 | 4 |
Since 2021 (last 5 years) | 17 |
Since 2016 (last 10 years) | 37 |
Since 2006 (last 20 years) | 47 |
Descriptor
Foreign Countries | 66 |
Test Format | 66 |
Test Reliability | 66 |
Test Validity | 36 |
Test Items | 22 |
Language Tests | 18 |
English (Second Language) | 15 |
Second Language Learning | 15 |
Test Construction | 15 |
Comparative Analysis | 13 |
Computer Assisted Testing | 13 |
More ▼ |
Source
Author
Publication Type
Journal Articles | 55 |
Reports - Research | 48 |
Reports - Evaluative | 11 |
Reports - Descriptive | 8 |
Speeches/Meeting Papers | 6 |
Tests/Questionnaires | 4 |
Guides - Classroom - Teacher | 1 |
Opinion Papers | 1 |
Education Level
Audience
Practitioners | 2 |
Researchers | 2 |
Teachers | 2 |
Location
Turkey | 8 |
Canada | 6 |
Japan | 5 |
Germany | 4 |
United Kingdom | 4 |
Israel | 3 |
France | 2 |
Indonesia | 2 |
Netherlands | 2 |
Nigeria | 2 |
Singapore | 2 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025
This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…
Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis
Mustafa Ilhan; Nese Güler; Gülsen Tasdelen Teker; Ömer Ergenekon – International Journal of Assessment Tools in Education, 2024
This study aimed to examine the effects of reverse items created with different strategies on psychometric properties and respondents' scale scores. To this end, three versions of a 10-item scale in the research were developed: 10 positive items were integrated in the first form (Form-P) and five positive and five reverse items in the other two…
Descriptors: Test Items, Psychometrics, Scores, Measures (Individuals)
Dambha, Tasneem; Swanepoel, De Wet; Mahomed-Asmail, Faheema; De Sousa, Karina C.; Graham, Marien A.; Smits, Cas – Journal of Speech, Language, and Hearing Research, 2022
Purpose: This study compared the test characteristics, test-retest reliability, and test efficiency of three novel digits-in-noise (DIN) test procedures to a conventional antiphasic 23-trial adaptive DIN (D23). Method: One hundred twenty participants with an average age of 42 years (SD = 19) were included. Participants were tested and retested…
Descriptors: Auditory Tests, Screening Tests, Efficiency, Test Format
Constructing a Roadmap to Measure the Quality of Business Assessments Aimed at Curriculum Management
Silva, Thanuci; Santos, Regiane dos; Mallet, Débora – Journal of Education for Business, 2023
Assuring the quality of education is a concern of learning institutions. To do so, it is necessary to have assertive learning management, with consistent data on students' outcomes. This research provides associate deans and researchers, a roadmap with which to gather evidence to improve the quality of open-ended assessments. Based on statistical…
Descriptors: Student Evaluation, Evaluation Methods, Business Education, Higher Education
Celeste Combrinck – SAGE Open, 2024
We have less time and focus than ever before, while the demand for attention is increasing. Therefore, it is no surprise that when answering questionnaires, we often choose to strongly agree or be neutral, producing problematic and unusable data. The current study investigated forced-choice (ipsative) format compared to the same questions on a…
Descriptors: Likert Scales, Test Format, Surveys, Design
Duru, Erdinc; Ozgungor, Sevgi; Yildirim, Ozen; Duatepe-Paksu, Asuman; Duru, Sibel – International Journal of Assessment Tools in Education, 2022
The aim of this study is to develop a valid and reliable measurement tool that measures critical thinking skills of university students. Pamukkale Critical Thinking Skills Scale was developed as two separate forms; multiple choice and open-ended. The validity and reliability studies of the multiple-choice form were constructed on two different…
Descriptors: Critical Thinking, Cognitive Measurement, Test Validity, Test Reliability
Benton, Tom – Research Matters, 2021
Computer adaptive testing is intended to make assessment more reliable by tailoring the difficulty of the questions a student has to answer to their level of ability. Most commonly, this benefit is used to justify the length of tests being shortened whilst retaining the reliability of a longer, non-adaptive test. Improvements due to adaptive…
Descriptors: Risk, Item Response Theory, Computer Assisted Testing, Difficulty Level
McLeod, Melissa; Cheng, Liying – Language Assessment Quarterly, 2023
The Canadian English Language Proficiency Index Program (CELPIP) Test was designed for immigration and citizenship in Canada. CELPIP is a computer-based English-language proficiency test which covers all four skills. This test review provides a description of the test and its construct, tasks, and delivery. Then, it appraises CELPIP for…
Descriptors: Language Tests, Language Proficiency, English (Second Language), Second Language Learning
David Bell; Vikki O'Neill; Vivienne Crawford – Practitioner Research in Higher Education, 2023
We compared the influence of open-book extended duration versus closed book time-limited format on reliability and validity of written assessments of pharmacology learning outcomes within our medical and dental courses. Our dental cohort undertake a mid-year test (30xfree-response short answer to a question, SAQ) and end-of-year paper (4xSAQ,…
Descriptors: Undergraduate Students, Pharmacology, Pharmaceutical Education, Test Format
Egbe, Cajetan Ikechukwu; Agbo, Philomina Akudo; Okwo, Frederick Amunabo; Agbo, George Chibuike – TechTrends: Linking Research and Practice to Improve Learning, 2023
In the use of English programme of the general studies unit of Nigerian universities, computer based testing (CBT) has been introduced as a novel approach for writing examinations, as against the paper based test (PBT). For this reason, there is a need to ascertain students' perceptions of computer-based tests in the use of English Programme,…
Descriptors: Student Attitudes, Computer Assisted Testing, Undergraduate Students, Foreign Countries
Simic, Nataša; Marušic Jablanovic, Milica; Grbic, Sanja – Journal of Education for Teaching: International Research and Pedagogy, 2022
The aim of this study was to validate the structure of the "FIT-Choice scale" on a Serbian sample of pre-service teachers, as well as to determine the motivations and beliefs about the teaching profession, and test if motivation differs across different groups of pre-service teachers. After prospective class and subject teachers…
Descriptors: Foreign Countries, Likert Scales, Factor Structure, Factor Analysis
Fitria Lafifa; Dadan Rosana – Turkish Online Journal of Distance Education, 2024
This research goal to develop a multiple-choice closed-ended test to assessing and evaluate students' digital literacy skills. The sample in this study were students at MTsN 1 Blitar City who were selected using a purposive sampling technique. The test was also validated by experts, namely 2 Doctors of Physics and Science from Yogyakarta State…
Descriptors: Educational Innovation, Student Evaluation, Digital Literacy, Multiple Choice Tests
Wicaksono, Azizul Ghofar Candra; Korom, Erzsébet – Participatory Educational Research, 2022
The accuracy of learning results relies on the evaluation and assessment. The learning goals, including problem solving ability must be aligned with the valid standardized measurement tools. The study on exploring the nature of problem-solving, framework, and assessment in the Indonesian context will make contributions to problem solving…
Descriptors: Problem Solving, Educational Research, Test Construction, Test Validity
Papenberg, Martin; Diedenhofen, Birk; Musch, Jochen – Journal of Experimental Education, 2021
Testwiseness may introduce construct-irrelevant variance to multiple-choice test scores. Presenting response options sequentially has been proposed as a potential solution to this problem. In an experimental validation, we determined the psychometric properties of a test based on the sequential presentation of response options. We created a strong…
Descriptors: Test Wiseness, Test Validity, Test Reliability, Multiple Choice Tests
Senel, Selma; Senel, Hüseyin Can – Journal of Educational Technology and Online Learning, 2021
COVID-19 has changed the way we teach. Today, we have become far more experienced in the delivery of distance education and use of online tools. However, the quality of distance education and learning outcomes have become a matter of ongoing debate. Just as higher education aims to develop high-level skills in its students, researchers are seeking…
Descriptors: Test Format, Computer Assisted Testing, Formative Evaluation, Performance Based Assessment