Publication Date
| In 2026 | 0 |
| Since 2025 | 197 |
| Since 2022 (last 5 years) | 1067 |
| Since 2017 (last 10 years) | 2577 |
| Since 2007 (last 20 years) | 4938 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 225 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 65 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Gurdil Ege, Hatice; Demir, Ergul – Eurasian Journal of Educational Research, 2020
Purpose: The present study aims to evaluate how the reliabilities computed using a, Stratified a, Angoff-Feldt, and Feldt-Raju estimators may differ when sample size (500, 1000, and 2000) and item type ratio of dichotomous to polytomous items (2:1; 1:1, 1:2) included in the scale are varied. Research Methods: In this study, Cronbach's a,…
Descriptors: Test Format, Simulation, Test Reliability, Sample Size
Holme, Thomas A.; Bauer, Christopher; Trate, Jaclyn M.; Reed, Jessica J.; Raker, Jeffrey R.; Murphy, Kristen L. – Journal of Chemical Education, 2020
The American Chemical Society, Division of Chemical Education, Examinations Institute has been developing content maps for the undergraduate program based on subdiscipline specifications since 2008. The Anchoring Concepts Content Maps (or ACCM) have been published in four subdisciplines (general, organic, physical, and inorganic chemistry) with…
Descriptors: Undergraduate Students, Chemistry, Scientific Concepts, Concept Mapping
Cole, Brian S.; Lima-Walton, Elia; Brunnert, Kim; Vesey, Winona Burt; Raha, Kaushik – Journal of Applied Testing Technology, 2020
Automatic item generation can rapidly generate large volumes of exam items, but this creates challenges for assembly of exams which aim to include syntactically diverse items. First, we demonstrate a diminishing marginal syntactic return for automatic item generation using a saturation detection approach. This analysis can help users of automatic…
Descriptors: Artificial Intelligence, Automation, Test Construction, Test Items
Krach, Shelley Kathleen; McCreery, Michael P.; Dennis, Lindsay; Guerard, Jessika; Harris, Erica L. – Psychology in the Schools, 2020
Pearson now uses a technology-based testing platform, Q-Interactive, to administer tests previously available in paper versions. The same norms are used for both versions; Pearson's in-house equivalency studies indicated that both versions are equated. The goal of the current study is to independently evaluate equivalency findings. For the current…
Descriptors: Preschool Children, Computer Assisted Testing, Test Items, Scores
Ismail, Yilmaz – Educational Research and Reviews, 2020
This study draws on the understanding that when the correlation between variables is not known yet the non-linear expectation in the correlation between the variables is present, non-linear measurement tools can be used. In education, possibility measurement tools can be used for non-linear measurement. Multiple-choice possibility measurement…
Descriptors: Multiple Choice Tests, Measurement Techniques, Student Evaluation, Test Items
Derek Sauder – ProQuest LLC, 2020
The Rasch model is commonly used to calibrate multiple choice items. However, the sample sizes needed to estimate the Rasch model can be difficult to attain (e.g., consider a small testing company trying to pretest new items). With small sample sizes, auxiliary information besides the item responses may improve estimation of the item parameters.…
Descriptors: Item Response Theory, Sample Size, Computation, Test Length
Chengran Wang; Bing Wei – Physical Review Physics Education Research, 2024
The notion of scientific visual literacy has been advocated in recent science curriculum reform documents and related learning outcomes are expected from students. However, few studies have been conducted to determine how it is tested in high-stakes examinations. This study utilized the Visualization Blooming Tool to examine the level of visual…
Descriptors: Physics, Scientific Literacy, Science Tests, Thinking Skills
Amber Dudley; Emma Marsden; Giulia Bovolenta – Language Testing, 2024
Vocabulary knowledge strongly predicts second language reading, listening, writing, and speaking. Yet, few tests have been developed to assess vocabulary knowledge in French. The primary aim of this pilot study was to design and initially validate the Context-Aligned Two Thousand Test (CA-TTT), following open research practices. The CA-TTT is a…
Descriptors: French, Vocabulary Development, Secondary School Students, Language Tests
Yi-Chun Chen; Hsin-Kai Wu; Ching-Ting Hsin – Research in Science & Technological Education, 2024
Background and Purpose: As a growing number of instructional units have been developed to promote young children's scientific and engineering practices (SEPs), understanding how to evaluate and assess children's SEPs is imperative. However, paper-and-pencil assessments would not be suitable for young children because of their limited reading and…
Descriptors: Science Education, Engineering Education, Elementary School Students, Middle School Students
Katrin Klingbeil; Fabian Rösken; Bärbel Barzel; Florian Schacht; Kaye Stacey; Vicki Steinle; Daniel Thurm – ZDM: Mathematics Education, 2024
Assessing students' (mis)conceptions is a challenging task for teachers as well as for researchers. While individual assessment, for example through interviews, can provide deep insights into students' thinking, this is very time-consuming and therefore not feasible for whole classes or even larger settings. For those settings, automatically…
Descriptors: Multiple Choice Tests, Formative Evaluation, Mathematics Tests, Misconceptions
Kokou A. Atitsogbe; Jean-Luc Bernaud – International Journal for Educational and Vocational Guidance, 2024
This manuscript aimed to develop an instrument assessing vocational values among students (VVS-S). The scale was developed in French using three different samples of Togolese participants for item development (N = 140), exploratory (N = 308) and confirmatory analyses (N = 300). It consists of 17 items divided into the five subscales of Power,…
Descriptors: Vocational Interests, Values, Measures (Individuals), Test Construction
Philomina Abena Anyidoho; Rebecca Berenbon; Bridget McHugh – International Journal of Training and Development, 2024
Many workforce development training programmes use learning gains as a measure of programme effectiveness. However, research on K-12 education suggests that posttest scores may be influenced by pretesting effects. Pretesting may improve posttest performance by giving learners preknowledge of posttest content. Alternatively, pretesting may enhance…
Descriptors: Trainees, Trainers, Labor Force Development, High Stakes Tests
Deborah Rivas-Drake; Jozet Channey; Gina McGovern; Bernardette J. Pinetta – AERA Open, 2024
This article delineates the development of a measure to assess teachers' reported engagement in practices that center on issues of racial equity as part of their SEL instruction. An iterative mixed-method approach included theoretical grounding, literature reviews, content expert evaluation, focus groups, cognitive interviews, and multiple survey…
Descriptors: Equal Education, Social Emotional Learning, Measures (Individuals), Test Construction
Stephen Adofo; Sirpa Kärkkäinen; Tuula Keinonen – Science Education International, 2024
This study examined integrated science questions of the Basic Education Certificate Examination (BECE) in Ghana at the end of grade 9. Results from BECE determine which senior high school a student can attend. Science questions (n = 751) over the span of eight years were analyzed in this study and viewed under the lens of the revised Bloom's…
Descriptors: Secondary School Science, Science Tests, Foreign Countries, Grade 9
Alicia A. Stoltenberg – ProQuest LLC, 2024
Multiple-select multiple-choice items, or multiple-choice items with more than one correct answer, are used to quickly assess content on standardized assessments. Because there are multiple keys to these item types, there are also multiple ways to score student responses to these items. The purpose of this study was to investigate how changing the…
Descriptors: Scoring, Evaluation Methods, Multiple Choice Tests, Standardized Tests

Peer reviewed
Direct link
