Publication Date
| In 2026 | 0 |
| Since 2025 | 190 |
| Since 2022 (last 5 years) | 1057 |
| Since 2017 (last 10 years) | 2567 |
| Since 2007 (last 20 years) | 4928 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 225 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 65 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Peralta, Yadira; Aguilar-Rodriguez, Adriana; González Dávila, Osiel; Miranda, Alfonso – Journal of Psychoeducational Assessment, 2021
According to the literature, the use of the Berkeley Puppet Interview (BPI) to measure Big Five personality traits in children provides reliable and valid scores. However, the implementation of the BPI could be costly, especially when working with large sample sizes. Big Five self-reports were collected from 1118 Mexican children aged 7-8 years…
Descriptors: Personality Measures, Children, Test Reliability, Foreign Countries
Myszkowski, Nils; Storme, Martin – Journal of Creative Behavior, 2021
Fluency tasks are among the most common item formats for the assessment of certain cognitive abilities, such as verbal fluency or divergent thinking. A typical approach to the psychometric modeling of such tasks (e.g., "Intelligence," 2016, 57, 25) is the Rasch Poisson Counts Model (RPCM; "Probabilistic models for some intelligence…
Descriptors: Creative Thinking, Cognitive Measurement, Test Items, Difficulty Level
Hall, Matthew L.; Reidies, Jess A. – Journal of Deaf Studies and Deaf Education, 2021
We tested the utility of two standardized measures of receptive skills in American Sign Language (ASL) in hearing adults who are novice signers: the ASL Comprehension Test (ASL-CT; Hauser, P. C., Paludneviciene, R., Riddle, W., Kurz, K. B., Emmorey, K., & Contreras, J. (2016). American Sign Language Comprehension Test: A tool for sign language…
Descriptors: American Sign Language, Receptive Language, Novices, Adults
Ha, Hung Tan – Language Testing in Asia, 2021
The Listening Vocabulary Levels Test (LVLT) created by McLean et al. Language Teaching Research 19:741-760, 2015 filled an important gap in the field of second language assessment by introducing an instrument for the measurement of phonological vocabulary knowledge. However, few attempts have been made to provide further validity evidence for the…
Descriptors: Vocabulary, Vietnamese, Test Validity, Test Items
Sinharay, Sandip – Grantee Submission, 2021
Drasgow, Levine, and Zickar (1996) suggested a statistic based on the Neyman-Pearson lemma (e.g., Lehmann & Romano, 2005, p. 60) for detecting preknowledge on a known set of items. The statistic is a special case of the optimal appropriateness indices of Levine and Drasgow (1988) and is the most powerful statistic for detecting item…
Descriptors: Robustness (Statistics), Hypothesis Testing, Statistics, Test Items
Chen, Yunxiao; Lee, Yi-Hsuan; Li, Xiaoou – Journal of Educational and Behavioral Statistics, 2022
In standardized educational testing, test items are reused in multiple test administrations. To ensure the validity of test scores, the psychometric properties of items should remain unchanged over time. In this article, we consider the sequential monitoring of test items, in particular, the detection of abrupt changes to their psychometric…
Descriptors: Standardized Tests, Test Items, Test Validity, Scores
Cooperman, Allison W.; Weiss, David J.; Wang, Chun – Educational and Psychological Measurement, 2022
Adaptive measurement of change (AMC) is a psychometric method for measuring intra-individual change on one or more latent traits across testing occasions. Three hypothesis tests--a Z test, likelihood ratio test, and score ratio index--have demonstrated desirable statistical properties in this context, including low false positive rates and high…
Descriptors: Error of Measurement, Psychometrics, Hypothesis Testing, Simulation
Wittmann, Eveline; Weyland, Ulrike; Seeber, Susan; Warwas, Julia; Strikovic, Aldin; Krebs, Philine; Pohley, Monja; Wilczek, Larissa – Empirical Research in Vocational Education and Training, 2022
The identification of effects of vocational education and training conditions on competence development in nursing education requires longitudinal studies. An important precondition is the availability of a test of nursing competence which is economical in use, measures a homogeneous construct throughout years of nursing education and across…
Descriptors: Nursing Education, Computer Assisted Testing, Nursing Students, Competence
Dmoshinskaia, Natasha; Gijlers, Hannie; de Jong, Ton – Journal of Science Education and Technology, 2022
Giving feedback to peers can be a valuable learning experience for a feedback provider. However, different types of products require different types of feedback, which, in turn, may lead to different learning outcomes. The current study investigates the effect on the learning of feedback providers of reviewing different types of products.…
Descriptors: Peer Evaluation, Feedback (Response), Concept Mapping, Test Items
Camenares, Devin – International Journal for the Scholarship of Teaching and Learning, 2022
Balancing assessment of learning outcomes with the expectations of students is a perennial challenge in education. Difficult exams, in which many students perform poorly, exacerbate this problem and can inspire a wide variety of interventions, such as a grading curve. However, addressing poor performance can sometimes distort or inflate grades and…
Descriptors: College Students, Student Evaluation, Tests, Test Items
Becker, Kirk A.; Kao, Shu-chuan – Journal of Applied Testing Technology, 2022
Natural Language Processing (NLP) offers methods for understanding and quantifying the similarity between written documents. Within the testing industry these methods have been used for automatic item generation, automated scoring of text and speech, modeling item characteristics, automatic question answering, machine translation, and automated…
Descriptors: Item Banks, Natural Language Processing, Computer Assisted Testing, Scoring
Hilliard, Airlie; Kazim, Emre; Bitsakis, Theodoros; Leutner, Franziska – Journal of Intelligence, 2022
Selection methods are commonly used in talent acquisition to predict future job performance and to find the best candidates, but questionnaire-based assessments can be lengthy and lead to candidate fatigue and poor engagement, affecting completion rates and producing poor data. Gamification can mitigate some of these issues through greater…
Descriptors: Personality Measures, Personality Traits, Gamification, Imagery
Sadoughi, Majid; Hejazi, S. Yahya – Language Testing in Asia, 2022
Teacher support, as an essential type of social support and an important antecedent of many key outcomes in L2 learning, can significantly contribute to foreign language achievement. Although teacher support has received considerable attention in education and educational psychology, it has drawn scanty attention in foreign language and applied…
Descriptors: Test Construction, English (Second Language), Second Language Instruction, Language Teachers
An Analysis of Differential Bundle Functioning in Multidimensional Tests Using the SIBTEST Procedure
Özdogan, Didem; Kelecioglu, Hülya – International Journal of Assessment Tools in Education, 2022
This study aims to analyze the differential bundle functioning in multidimensional tests with a specific purpose to detect this effect through differentiating the location of the item with DIF in the test, the correlation between the dimensions, the sample size, and the ratio of reference to focal group size. The first 10 items of the test that is…
Descriptors: Correlation, Sample Size, Test Items, Item Analysis
Green, Clare; Hughes, Sarah – Cambridge University Press & Assessment, 2022
The Digital High Stakes Assessment Programme in Cambridge University Press & Assessment is developing digital assessments for UK and global teachers and learners. In one development, the team are making decisions about the assessment models to use to assess computing systems knowledge and understanding. This research took place as part of the…
Descriptors: Test Items, Computer Science, Achievement Tests, Objective Tests

Peer reviewed
Direct link
