Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 7 |
Since 2016 (last 10 years) | 16 |
Since 2006 (last 20 years) | 24 |
Descriptor
Test Format | 54 |
Test Items | 54 |
Test Construction | 24 |
Foreign Countries | 18 |
Multiple Choice Tests | 11 |
Difficulty Level | 10 |
Higher Education | 9 |
Student Evaluation | 8 |
Test Reliability | 8 |
Science Tests | 7 |
Secondary Education | 7 |
More ▼ |
Source
Author
Aksakalli, Ayhan | 1 |
Alexander, John J., Ed. | 1 |
Ault, Marilyn | 1 |
Baker, Eva | 1 |
Bennett, Randy Elliot | 1 |
Berger, Aliza E. | 1 |
Bokyoung Park | 1 |
Borowski, Andreas | 1 |
Boser, Judith A. | 1 |
Brownell, Sara E. | 1 |
Bulgreen, Janis A. | 1 |
More ▼ |
Publication Type
Education Level
Location
Canada | 4 |
Turkey | 4 |
Arizona | 2 |
Australia | 2 |
Germany | 2 |
Louisiana | 2 |
Canada (Ottawa) | 1 |
Florida | 1 |
Georgia | 1 |
Indiana | 1 |
Iran | 1 |
More ▼ |
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Individuals with Disabilities… | 1 |
No Child Left Behind Act 2001 | 1 |
Perkins Loan Program | 1 |
Assessments and Surveys
SAT (College Admission Test) | 3 |
National Assessment of… | 2 |
Cornell Critical Thinking Test | 1 |
General Educational… | 1 |
New Jersey High School… | 1 |
Program for International… | 1 |
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
McGuire, Michael J. – International Journal for the Scholarship of Teaching and Learning, 2023
College students in a lower-division psychology course made metacognitive judgments by predicting and postdicting performance for true-false, multiple-choice, and fill-in-the-blank question sets on each of three exams. This study investigated which question format would result in the most accurate metacognitive judgments. Extending Koriat's (1997)…
Descriptors: Metacognition, Multiple Choice Tests, Accuracy, Test Format
Park, Yena; Lee, Senyung; Shin, Sun-Young – Language Testing, 2022
Despite consistent calls for authentic stimuli in listening tests for better construct representation, unscripted texts have been rarely adopted in high-stakes listening tests due to perceived inefficiency. This study details how a local academic listening test was developed using authentic unscripted audio-visual texts from the local target…
Descriptors: Listening Comprehension Tests, English for Academic Purposes, Test Construction, Foreign Students
Sayin, Ayfer; Sata, Mehmet – International Journal of Assessment Tools in Education, 2022
The aim of the present study was to examine Turkish teacher candidates' competency levels in writing different types of test items by utilizing Rasch analysis. In addition, the effect of the expertise of the raters scoring the items written by the teacher candidates was examined within the scope of the study. 84 Turkish teacher candidates…
Descriptors: Foreign Countries, Item Response Theory, Evaluators, Expertise
Remizova, Alisa; Rudnev, Maksim – International Journal of Social Research Methodology, 2020
The justifiability scale (JS) is widely used to measure individual and country differences in moral attitudes. However, the validity of the instrument has been barely assessed. The current study addressed the concurrent and content validity of four popular JS items (justifiability of homosexuality, suicide, prostitution, and euthanasia). A sample…
Descriptors: Moral Values, Content Validity, Attitude Measures, Foreign Countries
Höhne, Jan Karem; Krebs, Dagmar – International Journal of Social Research Methodology, 2018
The effect of the response scale direction on response behavior is a well-known phenomenon in survey research. While there are several approaches to explaining how such response order effects occur, the literature reports mixed evidence. Furthermore, different question formats seem to vary in their susceptibility to these effects. We therefore…
Descriptors: Test Items, Response Style (Tests), Questioning Techniques, Questionnaires
Kaharu, Sarintan N.; Mansyur, Jusman – Pegem Journal of Education and Instruction, 2021
This study aims to develop a test that can be used to explore mental models and representation patterns of objects in liquid fluid. The test developed by adapting the Reeves's Development Model was carried out in several stages, namely: determining the orientation and test segments; initial survey; preparation of the initial draft; try out;…
Descriptors: Test Construction, Schemata (Cognition), Scientific Concepts, Water
Yanis, Hilal; Yürük, Nejla – Journal of Research on Technology in Education, 2021
The integration of technology into science teaching by pre-service science teachers and their self-efficacy in using technology in their teaching practices are important issues for science education. The purpose of this study is to develop an Educational Robotics Technological Pedagogical Content Knowledge (ER-TPACK) self-efficacy scale based on a…
Descriptors: Educational Technology, Technology Uses in Education, Technology Integration, Science Instruction
Lina Anaya; Nagore Iriberri; Pedro Rey-Biel; Gema Zamarro – Annenberg Institute for School Reform at Brown University, 2021
Standardized assessments are widely used to determine access to educational resources with important consequences for later economic outcomes in life. However, many design features of the tests themselves may lead to psychological reactions influencing performance. In particular, the level of difficulty of the earlier questions in a test may…
Descriptors: Test Construction, Test Wiseness, Test Items, Difficulty Level
Masrai, Ahmed – SAGE Open, 2022
Vocabulary size measures serve important functions, not only with respect to placing learners at appropriate levels on language courses but also with a view to examining the progress of learners. One of the widely reported formats suitable for these purposes is the Yes/No vocabulary test. The primary aim of this study was to introduce and provide…
Descriptors: Vocabulary Development, Language Tests, English (Second Language), Second Language Learning
Fukuzawa, Sherry; deBraga, Michael – Journal of Curriculum and Teaching, 2019
Graded Response Method (GRM) is an alternative to multiple-choice testing where students rank options according to their relevance to the question. GRM requires discrimination and inference between statements and is a cost-effective critical thinking assessment in large courses where open-ended answers are not feasible. This study examined…
Descriptors: Alternative Assessment, Multiple Choice Tests, Test Items, Test Format
Wright, Christian D.; Huang, Austin L.; Cooper, Katelyn M.; Brownell, Sara E. – International Journal for the Scholarship of Teaching and Learning, 2018
College instructors in the United States usually make their own decisions about how to design course exams. Even though summative course exams are well known to be important to student success, we know little about the decision making of instructors when designing course exams. To probe how instructors design exams for introductory biology, we…
Descriptors: College Faculty, Science Teachers, Science Tests, Teacher Made Tests
Kiliçkaya, Ferit – Online Submission, 2019
The current study aims to determine the effect of multiple-choice, matching, gap-fill and word formation items used in assessing L2 vocabulary on learners' performance and to obtain the learners' views regarding the use of these types of items in vocabulary assessment. The convenience sampling method was selected, and the participants of the study…
Descriptors: Second Language Learning, Second Language Instruction, English (Second Language), Multiple Choice Tests
Aksakalli, Ayhan; Turgut, Umit; Salar, Riza – Journal of Education and Practice, 2016
The purpose of this study is to investigate whether students are more successful on abstract or illustrated test questions. To this end, the questions on an abstract test were changed into a visual format, and these tests were administered every three days to a total of 240 students at six middle schools located in the Erzurum city center and…
Descriptors: Comparative Analysis, Scores, Middle School Students, Grade 8
Carnegie, Jacqueline A. – Canadian Journal for the Scholarship of Teaching and Learning, 2017
Summative evaluation for large classes of first- and second-year undergraduate courses often involves the use of multiple choice question (MCQ) exams in order to provide timely feedback. Several versions of those exams are often prepared via computer-based question scrambling in an effort to deter cheating. An important parameter to consider when…
Descriptors: Undergraduate Students, Student Evaluation, Multiple Choice Tests, Test Format
Bokyoung Park – English Teaching, 2017
This study investigated Korean college students' performance as measured by two different vocabulary assessment tools (the Productive Vocabulary Levels Test (PVLT) and the Productive Vocabulary Use Task (PVUT)) and the relationship these assessments have with students' writing proficiency. A total of 72 students participated in the study. The…
Descriptors: Foreign Countries, Vocabulary Development, Language Tests, Second Language Learning