Publication Date
| In 2026 | 0 |
| Since 2025 | 197 |
| Since 2022 (last 5 years) | 1067 |
| Since 2017 (last 10 years) | 2577 |
| Since 2007 (last 20 years) | 4938 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 225 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 65 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Cintron, Dakota W. – ETS Research Report Series, 2021
The extent to which a test's time limit alters a test taker's performance is known as speededness. The manifestation of speededness, or speeded behavior on a test, can be in the form of random guessing, leaving a substantial proportion of test items unanswered, or rushed test-taking behavior in general. Speeded responses do not depend solely on a…
Descriptors: Classification, Research and Development, Timed Tests, Guessing (Tests)
Lee, HyeSun; Smith, Weldon; Martinez, Angel; Ferris, Heather; Bova, Joe – Applied Measurement in Education, 2021
The aim of the current research was to provide recommendations to facilitate the development and use of anchoring vignettes (AVs) for cross-cultural comparisons in education. Study 1 identified six factors leading to order violations and ties in AV responses based on cognitive interviews with 15-year-old students. The factors were categorized into…
Descriptors: Vignettes, Test Items, Equated Scores, Nonparametric Statistics
Demir, Gönül Türkan – International Journal of Curriculum and Instruction, 2021
The study aimed to scrutinize the practice course exam and the exam questions in Darulmuallim- the first teacher training institution in Turkish education history. In the study, the nature of the theoretical basis of today's teaching practices was also considered by shedding light on the past. The study was carried out via document analysis…
Descriptors: Tests, Teacher Education Programs, Foreign Countries, Test Items
Feinberg, Richard A. – Educational Measurement: Issues and Practice, 2021
Unforeseen complications during the administration of large-scale testing programs are inevitable and can prevent examinees from accessing all test material. For classification tests in which the primary purpose is to yield a decision, such as a pass/fail result, the current study investigated a model-based standard error approach, Bayesian…
Descriptors: High Stakes Tests, Classification, Decision Making, Bayesian Statistics
Kosh, Audra E. – Journal of Applied Testing Technology, 2021
In recent years, Automatic Item Generation (AIG) has increasingly shifted from theoretical research to operational implementation, a shift raising some unforeseen practical challenges. Specifically, generating high-quality answer choices presents several challenges such as ensuring that answer choices blend in nicely together for all possible item…
Descriptors: Test Items, Multiple Choice Tests, Decision Making, Test Construction
Shogren, Karrie A.; Anderson, Mark H.; Raley, Sheida K.; Hagiwara, Mayumi – Exceptionality, 2021
The "Self-Determination Inventory" (SDI) is a suite of tools developed to measure self-determination. The SDI: Student Report was recently validated for adolescents aged 13 to 22 with and without disabilities across diverse racial/ethnic backgrounds. A parallel, proxy report version to be completed by teachers or parents, the SDI:…
Descriptors: Measures (Individuals), Self Determination, Students with Disabilities, Adolescents
Chowdhury, Pinaki – Online Submission, 2021
Collecting data on learners' performance in different chemistry contents and analysing them to identify their knowledge and understanding in related content areas is a major task of Chemistry Education Research. The data collection process on the learners' content knowledge and understanding of content knowledge requires a standard measuring tool.…
Descriptors: Data Collection, Standards, Chemistry, Scientific Concepts
Esdale, Ryan William – ProQuest LLC, 2021
As educators and school leaders work towards building students' capacity throughout a student's academic career, the goal is to facilitate opportunities for students to develop college and career readiness skills that can be applied to novel and challenging problems long after high school graduation. Hess' Cognitive Rigor Matrix (HCRM) is a tool…
Descriptors: Thinking Skills, High Schools, Aptitude Tests, High School Students
Stephanie M. Werner; Ying Chen; Mike Stieff – Journal of Chemical Education, 2021
The Chemistry Self-Concept Inventory (CSCI) is a widely used instrument within chemistry education research. Yet, agreement on its overall reliability and validity is lacking, and psychometric analyses of the instrument remain outstanding. This study examined the psychometric properties of the subscale and item function of the CSCI on 1140 high…
Descriptors: Self Concept Measures, Chemistry, Psychometrics, Item Response Theory
Huggins-Manley, Anne Corinne; Qiu, Yuxi; Penfield, Randall D. – International Journal of Testing, 2018
Score equity assessment (SEA) refers to an examination of population invariance of equating across two or more subpopulations of test examinees. Previous SEA studies have shown that score equity may be present for examinees scoring at particular test score ranges but absent for examinees scoring at other score ranges. No studies to date have…
Descriptors: Equated Scores, Test Bias, Test Items, Difficulty Level
Musa Adekunle Ayanwale; Jamiu Oluwadamilare Amusa; Adekunle Ibrahim Oladejo; Funmilayo Ayedun – Interchange: A Quarterly Review of Education, 2024
The study focuses on assessing the proficiency levels of higher education students, specifically the physics achievement test (PHY 101) at the National Open University of Nigeria (NOUN). This test, like others, evaluates various aspects of knowledge and skills simultaneously. However, relying on traditional models for such tests can result in…
Descriptors: Item Response Theory, Difficulty Level, Item Analysis, Test Items
Raoda Ismail; Heri Retnawati; Sugiman; Novita Intan Arovah; Okky Riswandha Imawan – Journal of Education and e-Learning Research, 2024
The success of a school centers on teachers' ability to create well-aligned educational tools. The Merdeka Belajar curriculum requires adapting assessments, including Higher-Order Thinking Skills (HOTS) questions for real-world problem-solving scenarios. This study explored contexts proposed by teachers in Papua for developing Mathematics HOTS…
Descriptors: Foreign Countries, Mathematics Education, Mathematics Curriculum, Curriculum Development
Muh. Fitrah; Anastasia Sofroniou; Ofianto; Loso Judijanto; Widihastuti – Journal of Education and e-Learning Research, 2024
This research uses Rasch model analysis to identify the reliability and separation index of an integrated mathematics test instrument with a cultural architecture structure in measuring students' mathematical thinking abilities. The study involved 357 students from six eighth-grade public junior high schools in Bima. The selection of schools was…
Descriptors: Mathematics Tests, Item Response Theory, Test Reliability, Indexes
Hong Jiao, Editor; Robert W. Lissitz, Editor – IAP - Information Age Publishing, Inc., 2024
With the exponential increase of digital assessment, different types of data in addition to item responses become available in the measurement process. One of the salient features in digital assessment is that process data can be easily collected. This non-conventional structured or unstructured data source may bring new perspectives to better…
Descriptors: Artificial Intelligence, Natural Language Processing, Psychometrics, Computer Assisted Testing
Peer reviewedSami Baral; Li Lucy; Ryan Knight; Alice Ng; Luca Soldaini; Neil T. Heffernan; Kyle Lo – Grantee Submission, 2024
In real-world settings, vision language models (VLMs) should robustly handle naturalistic, noisy visual content as well as domain-specific language and concepts. For example, K-12 educators using digital learning platforms may need to examine and provide feedback across many images of students' math work. To assess the potential of VLMs to support…
Descriptors: Visual Learning, Visual Perception, Natural Language Processing, Freehand Drawing

Direct link
