Publication Date
In 2025 | 31 |
Since 2024 | 125 |
Since 2021 (last 5 years) | 464 |
Since 2016 (last 10 years) | 869 |
Since 2006 (last 20 years) | 1349 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Practitioners | 195 |
Teachers | 159 |
Researchers | 92 |
Administrators | 49 |
Students | 34 |
Policymakers | 14 |
Parents | 12 |
Counselors | 2 |
Community | 1 |
Media Staff | 1 |
Support Staff | 1 |
More ▼ |
Location
Canada | 62 |
Turkey | 59 |
Germany | 40 |
United Kingdom | 36 |
Australia | 35 |
Japan | 34 |
China | 32 |
United States | 32 |
California | 25 |
United Kingdom (England) | 25 |
Netherlands | 24 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Morphew, Jason W.; Silva, Mariana; Herman, Geoffrey; West, Matthew – Applied Cognitive Psychology, 2020
Laboratory studies have routinely demonstrated that testing often leads to greater learning and retention than repeated studying. In the classroom, this effect has been replicated with memory and application tasks. However, studies of classrooms involving mathematical problem solving are sparse and have had mixed results. This paper presents the…
Descriptors: Mastery Learning, Testing, Undergraduate Students, Engineering Education
Tkatchov, Mary; Hugus, Erin; Barnes, Richard – Journal of Competency-Based Education, 2020
Redundancy in assessment adds unnecessary time to degree completion, which also increases the cost of tuition. In addition, assessment practices that are overly burdensome for faculty can also place too much of a financial burden on an institution and, ultimately, the students. Therefore, competency-based education (CBE) institutions are wise to…
Descriptors: Competency Based Education, Higher Education, Standards, Student Evaluation
Lee, Won-Chan; Kim, Stella Y.; Choi, Jiwon; Kang, Yujin – Journal of Educational Measurement, 2020
This article considers psychometric properties of composite raw scores and transformed scale scores on mixed-format tests that consist of a mixture of multiple-choice and free-response items. Test scores on several mixed-format tests are evaluated with respect to conditional and overall standard errors of measurement, score reliability, and…
Descriptors: Raw Scores, Item Response Theory, Test Format, Multiple Choice Tests
Arslan, Burcu; Jiang, Yang; Keehner, Madeleine; Gong, Tao; Katz, Irvin R.; Yan, Fred – Educational Measurement: Issues and Practice, 2020
Computer-based educational assessments often include items that involve drag-and-drop responses. There are different ways that drag-and-drop items can be laid out and different choices that test developers can make when designing these items. Currently, these decisions are based on experts' professional judgments and design constraints, rather…
Descriptors: Test Items, Computer Assisted Testing, Test Format, Decision Making
Kurnaz-Adibatmaz, Fatma Betül; Yildiz, Hüseyin – Journal of Theoretical Educational Science, 2020
In this study logistic regression and Lord's Chi Square methods were used to research the items that have DIF. The study utilized Peabody Picture Vocabulary Test (PPVT). The original form of the PPVT includes four options. Three different forms (A, B and C) were formed by removing one of the distractors respectively. The original form of PPVT was…
Descriptors: Item Analysis, Test Items, Vocabulary, Verbal Ability
Ippel, Lianne; Magis, David – Educational and Psychological Measurement, 2020
In dichotomous item response theory (IRT) framework, the asymptotic standard error (ASE) is the most common statistic to evaluate the precision of various ability estimators. Easy-to-use ASE formulas are readily available; however, the accuracy of some of these formulas was recently questioned and new ASE formulas were derived from a general…
Descriptors: Item Response Theory, Error of Measurement, Accuracy, Standards
Shear, Benjamin R. – Journal of Educational Measurement, 2023
Large-scale standardized tests are regularly used to measure student achievement overall and for student subgroups. These uses assume tests provide comparable measures of outcomes across student subgroups, but prior research suggests score comparisons across gender groups may be complicated by the type of test items used. This paper presents…
Descriptors: Gender Bias, Item Analysis, Test Items, Achievement Tests
NWEA, 2022
This technical report documents the processes and procedures employed by NWEA® to build and support the English MAP® Reading Fluency™ assessments administered during the 2020-2021 school year. It is written for measurement professionals and administrators to help evaluate the quality of MAP Reading Fluency. The seven sections of this report: (1)…
Descriptors: Achievement Tests, Reading Tests, Reading Achievement, Reading Fluency
Jeffrey Martin – Vocabulary Learning and Instruction, 2022
The functioning of a vocabulary testing instrument rests in part on the test-taking actions made possible for examinees by item format, an aspect of test development that warrants consideration in second-language vocabulary research. For example, although iterations of the written receptive vocabulary levels test (VLT) have integrated improvements…
Descriptors: Test Wiseness, Vocabulary, Vocabulary Development, Second Language Learning
Papenberg, Martin; Diedenhofen, Birk; Musch, Jochen – Journal of Experimental Education, 2021
Testwiseness may introduce construct-irrelevant variance to multiple-choice test scores. Presenting response options sequentially has been proposed as a potential solution to this problem. In an experimental validation, we determined the psychometric properties of a test based on the sequential presentation of response options. We created a strong…
Descriptors: Test Wiseness, Test Validity, Test Reliability, Multiple Choice Tests
Falani, Ilham; Akbar, Maruf; Naga, Dali S. – International Journal of Instruction, 2020
This study compared the precision of ability estimation on different types of item response theory models for mixed-format data. Participants in this study were 1625 Junior High School Students in Depok, Indonesia. The mixed-format test was used to measure the students' ability in mathematics. The test used consists of multiple-choice and…
Descriptors: Foreign Countries, Junior High School Students, Ability, Item Response Theory
Erduran, Sibel; El Masri, Yasmine; Cullinane, Alison; Ng, Y. P. D. – International Journal of Science Education, 2020
High stakes examinations can have profound implications for how science is taught and learned. Limitations of school science such as the 'cookbook problem' can potentially be addressed if high stakes assessments target learning outcomes that are innovative. For example, less mindless procedural engagement and more thoughtful consideration of…
Descriptors: Science Tests, High Stakes Tests, Achievement Tests, Foreign Countries
Akçay, Ahmet; Tunagür, Muhammed; Karabulut, Ahmet – International Journal of Education and Literacy Studies, 2020
This study aims to examine Turkish exam papers of the students, who study in the secondary school of 5th, 6th, 7th and 8th classes. The exam papers have been examined from various aspects, including the number and type of questions, the language expression and distribution of the questions, the cognitive level (according to the Bloom's taxonomy),…
Descriptors: Foreign Countries, Middle School Teachers, Middle School Students, Grade 5
Steiner, Martina; van Loon, Mariëtte H.; Bayard, Natalie S.; Roebers, Claudia M. – Metacognition and Learning, 2020
This study investigated elementary school children's development of monitoring and control when learning from texts. Second (N = 138) and fourth (N = 164) graders were tested in the middle (T[subscript 1]) and end (T[subscript 2]) of the school year. The study focused on the cross-sectional and longitudinal development of monitoring and control,…
Descriptors: Age Differences, Test Format, Children, Elementary School Students
Höhne, Jan Karem – International Journal of Social Research Methodology, 2019
In social research, the use of agree/disagree (A/D) questions is a popular method for measuring attitudes. Research has shown that A/D questions require complex cognitive processing and are susceptible to response bias. Thus, some researchers recommend the use of item-specific (IS) questions. This study examines the processing of A/D and IS…
Descriptors: Eye Movements, Social Science Research, Research Methodology, Online Surveys