Publication Date
| In 2026 | 0 |
| Since 2025 | 197 |
| Since 2022 (last 5 years) | 1067 |
| Since 2017 (last 10 years) | 2577 |
| Since 2007 (last 20 years) | 4938 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 225 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 65 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Moon, Jung Aa; Sinharay, Sandip; Keehner, Madeleine; Katz, Irvin R. – International Journal of Testing, 2020
The current study examined the relationship between test-taker cognition and psychometric item properties in multiple-selection multiple-choice and grid items. In a study with content-equivalent mathematics items in alternative item formats, adult participants' tendency to respond to an item was affected by the presence of a grid and variations of…
Descriptors: Computer Assisted Testing, Multiple Choice Tests, Test Wiseness, Psychometrics
Suto, Irenka; Greatorex, Jackie; Vitello, Sylvia; Child, Simon – Research Matters, 2020
Educational taxonomies are classification schemes which provide the terminology that educationalists need to describe and work with different areas of knowledge. It is good practice to use taxonomies to formulate and review curricula, learning objectives, and associated assessments. Demonstrating sufficient coverage of each of an adequate range of…
Descriptors: Taxonomy, Secondary School Curriculum, College Curriculum, Educational Objectives
Strait, Julia Englund; Dawson, Peg; Walther, Christine A. P.; Strait, Gerald Gill; Barton, Amy K.; Brunson McClain, Maryellen – Contemporary School Psychology, 2020
Executive functioning (EF) skills are vital for academic success. Along with the recent explosion of interventions targeting these skills comes the need for affordable, efficient, and ecologically valid measures for planning and tailoring interventions and monitoring outcomes. The current study describes the refinement and initial psychometric…
Descriptors: Executive Function, Questionnaires, Rating Scales, Test Items
Villarroel, Verónica; Boud, David; Bloxham, Susan; Bruna, Daniela; Bruna, Carola – Innovations in Education and Teaching International, 2020
Tests and examinations are widely used internationally. Despite their pervasiveness, they tend to measure lower order thinking skills in a decontextualized manner at a time when the literature frequently argues for the benefits of a richer, authentic approach to assessment. The focus of this paper is to improve authenticity in test assessment…
Descriptors: Performance Based Assessment, Testing, Tests, Test Construction
Jimoh, Mohammed Idris; Daramola, Dorcas Sola; Oladele, Jumoke Iyabode; Sheu, Adaramaja Lukman – Anatolian Journal of Education, 2020
The study investigated items that were prone to guessing in Senior School Certificate Examinations (SSCE) Economics multiple-choice tests among students in Kwara State, Nigeria. The 2016 West African Senior Secondary Certificate Examinations (WASSCE) and National Examinations Council (NECO) Economics multiple-choice test items were subjected to…
Descriptors: Foreign Countries, High School Students, Guessing (Tests), Test Items
Remizova, Alisa; Rudnev, Maksim – International Journal of Social Research Methodology, 2020
The justifiability scale (JS) is widely used to measure individual and country differences in moral attitudes. However, the validity of the instrument has been barely assessed. The current study addressed the concurrent and content validity of four popular JS items (justifiability of homosexuality, suicide, prostitution, and euthanasia). A sample…
Descriptors: Moral Values, Content Validity, Attitude Measures, Foreign Countries
Rao, Dhawaleswar; Saha, Sujan Kumar – IEEE Transactions on Learning Technologies, 2020
Automatic multiple choice question (MCQ) generation from a text is a popular research area. MCQs are widely accepted for large-scale assessment in various domains and applications. However, manual generation of MCQs is expensive and time-consuming. Therefore, researchers have been attracted toward automatic MCQ generation since the late 90's.…
Descriptors: Multiple Choice Tests, Test Construction, Automation, Computer Software
Tutar, Miyase; Karamustafaoglu, Orhan – International Journal of Curriculum and Instruction, 2020
This study is aimed to develop an adaptation test to determine the adaptation level of the 6th grade students about the concepts and topics to their daily life included in the unit ''Systems and Health in Our Body''. The test items were prepared by considering the reasons of the problems that they can encounter in their daily life about the…
Descriptors: Test Construction, Grade 6, Human Body, Health Education
Susanti, Yuni; Tokunaga, Takenobu; Nishikawa, Hitoshi – Research and Practice in Technology Enhanced Learning, 2020
The present study focuses on the integration of an automatic question generation (AQG) system and a computerised adaptive test (CAT). We conducted two experiments. In the first experiment, we administered sets of questions to English learners to gather their responses. We further used their responses in the second experiment, which is a…
Descriptors: Computer Assisted Testing, Test Items, Simulation, English Language Learners
Benton, Tom; Leech, Tony; Hughes, Sarah – Cambridge Assessment, 2020
In the context of examinations, the phrase "maintaining standards" usually refers to any activity designed to ensure that it is no easier (or harder) to achieve a given grade in one year than in another. Specifically, it tends to mean activities associated with setting examination grade boundaries. Benton et al (2020) describes a method…
Descriptors: Mathematics Tests, Equated Scores, Comparative Analysis, Difficulty Level
Sinharay, Sandip; van Rijn, Peter – Grantee Submission, 2020
Response-time models are of increasing interest in educational and psychological testing. This paper focuses on the lognormal model for response times (van der Linden, 2006), which is one of the most popular response-time models. Several existing statistics for testing normality and the fit of factor-analysis models are repurposed for testing the…
Descriptors: Educational Testing, Psychological Testing, Goodness of Fit, Factor Analysis
Guerreiro, Meg A.; Barker, Elizabeth; Johnson, Janice Lee – AERA Online Paper Repository, 2020
This paper aims to explore the incorporation of embedding items within reading passages as an effort to improve assessment equity, student experience and performance, and engagement within a universal design framework. Reading comprehension items placed within text rather than at the end may remove measurement of confounding constructs such as…
Descriptors: Reading Comprehension, Grade 3, Elementary School Students, Measurement Techniques
Chun Wang; Ping Chen; Shengyu Jiang – Journal of Educational Measurement, 2020
Many large-scale educational surveys have moved from linear form design to multistage testing (MST) design. One advantage of MST is that it can provide more accurate latent trait [theta] estimates using fewer items than required by linear tests. However, MST generates incomplete response data by design; hence, questions remain as to how to…
Descriptors: Test Construction, Test Items, Adaptive Testing, Maximum Likelihood Statistics
Bulut, Okan; Bulut, Hatice Cigdem; Cormier, Damien C.; Ilgun Dibek, Munevver; Sahin Kursad, Merve – Educational Assessment, 2023
Some statewide testing programs allow students to receive corrective feedback and revise their answers during testing. Despite its pedagogical benefits, the effects of providing revision opportunities remain unknown in the context of alternate assessments. Therefore, this study examined student data from a large-scale alternate assessment that…
Descriptors: Error Correction, Alternative Assessment, Feedback (Response), Multiple Choice Tests
Kuang, Huan; Sahin, Fusun – Large-scale Assessments in Education, 2023
Background: Examinees may not make enough effort when responding to test items if the assessment has no consequence for them. These disengaged responses can be problematic in low-stakes, large-scale assessments because they can bias item parameter estimates. However, the amount of bias, and whether this bias is similar across administrations, is…
Descriptors: Test Items, Comparative Analysis, Mathematics Tests, Reaction Time

Peer reviewed
Direct link
