Publication Date
| In 2026 | 0 |
| Since 2025 | 4 |
| Since 2022 (last 5 years) | 9 |
| Since 2017 (last 10 years) | 18 |
| Since 2007 (last 20 years) | 30 |
Descriptor
| Educational Assessment | 98 |
| Test Reliability | 98 |
| Test Validity | 58 |
| Test Construction | 26 |
| Elementary Secondary Education | 24 |
| Evaluation Methods | 20 |
| Foreign Countries | 19 |
| State Programs | 18 |
| Scores | 16 |
| Achievement Tests | 15 |
| Student Evaluation | 14 |
| More ▼ | |
Source
Author
| Burton, Nancy W. | 2 |
| Crowley, Susan L. | 2 |
| Floyd, Randy G. | 2 |
| Parker, Richard | 2 |
| Russell, Nolan F. | 2 |
| Tindal, Gerald | 2 |
| Albano, Anthony D. | 1 |
| Alfonso, Vincent C. | 1 |
| Almond, Pat | 1 |
| Alspaugh, John W. | 1 |
| Anderson, Paul S. | 1 |
| More ▼ | |
Publication Type
| Reports - Research | 98 |
| Journal Articles | 41 |
| Speeches/Meeting Papers | 22 |
| Tests/Questionnaires | 8 |
| Information Analyses | 3 |
| Numerical/Quantitative Data | 3 |
| Reports - Descriptive | 1 |
| Reports - Evaluative | 1 |
Education Level
Audience
| Researchers | 3 |
| Policymakers | 1 |
| Practitioners | 1 |
| Teachers | 1 |
Location
| Canada | 4 |
| Florida | 2 |
| Illinois | 2 |
| Indonesia | 2 |
| Netherlands | 2 |
| Pennsylvania | 2 |
| South Africa | 2 |
| South Carolina | 2 |
| Alaska | 1 |
| Arizona (Phoenix) | 1 |
| Brazil | 1 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 2 |
| Elementary and Secondary… | 1 |
| Elementary and Secondary… | 1 |
| Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Kuan-Yu Jin; Wai-Lok Siu – Journal of Educational Measurement, 2025
Educational tests often have a cluster of items linked by a common stimulus ("testlet"). In such a design, the dependencies caused between items are called "testlet effects." In particular, the directional testlet effect (DTE) refers to a recursive influence whereby responses to earlier items can positively or negatively affect…
Descriptors: Models, Test Items, Educational Assessment, Scores
Roberto Brazileio Paixão; Michael C. Rodriguez – Educational Research and Evaluation, 2023
The usefulness of evaluation is critical. Evaluation use occurs when, from its results or process, decisions are made about the program, it changes people's mindsets, or persuasive or legitimation actions happen (instrumental, conceptual, and symbolic uses respectively). Few quantitative evaluation use studies have been conducted in recent years.…
Descriptors: Measures (Individuals), College Faculty, Test Validity, Test Reliability
Mei-Ju Chen; Chao-Yu Guo; Li-Ting Tseng; Ming-Yi Chiu; Hsuan-Jui Weng – Psychology in the Schools, 2025
An interdisciplinary curriculum provides integration of students' subjects and lives, guiding students to understand and engage with changes in the world through real situations and problems, with teachers playing a critical role. However, there is no assessment tool to explore teachers' interdisciplinary role perceptions (TIRP). In this study, an…
Descriptors: Teacher Role, Role Perception, Interdisciplinary Approach, Attitude Measures
Judy R. Wilkerson; W. Steve Lang; LaSonya Moore – Journal of Research in Education, 2025
The DAATS (Dispositions Assessments Aligned with Teacher Standards) battery is a series of five instruments of different item types that measure teachers' consistency with the critical dispositions embedded in the InTASC Standards. The purpose of this study was to continue a 20-year research project on the development and implementation of…
Descriptors: Educational Assessment, National Standards, Teacher Evaluation, Teacher Competencies
Jeffrey Shero; Jessica Logan – Society for Research on Educational Effectiveness, 2024
Background/Context: Previous research in educational assessment has consistently emphasized the importance of reliability as a cornerstone of test quality. Traditional measures of reliability, such as test-retest and split-half reliability, offer a broad view of how internally consistent a measure is but overlook the variability in this internal…
Descriptors: Educational Assessment, Special Education, Students with Disabilities, Learning Disabilities
Wang, Yu; Chiu, Chia-Yi; Köhn, Hans Friedrich – Journal of Educational and Behavioral Statistics, 2023
The multiple-choice (MC) item format has been widely used in educational assessments across diverse content domains. MC items purportedly allow for collecting richer diagnostic information. The effectiveness and economy of administering MC items may have further contributed to their popularity not just in educational assessment. The MC item format…
Descriptors: Multiple Choice Tests, Nonparametric Statistics, Test Format, Educational Assessment
Begicevic Redjep, Nina; Balaban, Igor; Zugec, Bojan – Technology, Pedagogy and Education, 2021
The European Commission emphasises the need for educational institutions to integrate digital technologies in their teaching, learning and organisational practices. This study contributes to the field of digital transformation of schools by proposing and validating a Framework for Digitally Mature Schools (FDMS) and an instrument for assessing the…
Descriptors: Technology Integration, Information Technology, Program Evaluation, Educational Assessment
Michael A. Cook; Steven M. Ross – Center for Research and Reform in Education, 2024
This study examined the effectiveness of Flashlight360 by continuing a retrospective, mixed-methods quasi-experimental design of ELLs in Grades 1-12 during the 2023-24 school year in a large western state school district. Outcome measures included composite, speaking, and writing achievement gains on the WIDA ACCESS assessment administered to…
Descriptors: Elementary Secondary Education, Suburban Schools, Hispanic American Students, Evidence Based Practice
Rinat Arviv Elyashiv; Orit Avidov-Ungar – Educational Review, 2024
Large-scale assessments have become a basic national policy for educational improvement encouraging standards, decentralisation and school accountability. The current study focuses on the pedagogical dimension of large-scale assessments, examining its uses as a policy instrument for effecting pedagogical change. The paper presents and discusses…
Descriptors: Teacher Attitudes, Educational Assessment, Educational Policy, National Competency Tests
Ika Zenita Ratnaningsih; Unika Prihatsanti; Anggun Resdasari Prasetyo; Bambang Sumintono – Journal of Applied Research in Higher Education, 2025
Purpose: The present study aimed to validate the Indonesian-language version of the psychological capital questionnaire (PCQ), specifically within the context of higher education, by utilising Rasch analysis to evaluate the reliability and validity aspect such as item-fit statistics, rating scale function, and differential item functioning of the…
Descriptors: Foreign Countries, Indonesian Languages, Test Validity, Psychological Characteristics
Sondergeld, Toni A.; Johnson, Carla C. – School Science and Mathematics, 2019
In response to the call for more rigorously validated educational assessments, this study used an iterative multimethod validation process to develop and validate outcomes from the 21st Century Skills Assessment global rating scale. Qualitative and quantitative data sources were used to inform four types of validity evidence: content, response…
Descriptors: 21st Century Skills, Test Construction, Test Validity, Educational Assessment
Floren, Michael; Hess, Chelsie; Sherman, Valerie J. H.; Sileo, Nancy M. – Journal of Educational Research and Innovation, 2020
The InTASC Candidate Self-Perception Instrument (ICSPI) is an innovative, high-quality educational measurement tool designed to support the assessment and accreditation efforts of a wide variety of educator preparation programs (EPP). The procedures used for the creation and refining of items for the ICSPI are presented, including empirical…
Descriptors: Higher Education, Accreditation (Institutions), Educational Assessment, Teacher Education Programs
Albano, Anthony D.; McConnell, Scott R.; Lease, Erin M.; Cai, Liuhan – Grantee Submission, 2020
Research has shown that the context of practice tasks can have a significant impact on learning, with long-term retention and transfer improving when tasks of different types are mixed by interleaving (abcabcabc) compared with grouping together in blocks (aaabbbccc). This study examines the influence of context via interleaving from a psychometric…
Descriptors: Context Effect, Test Items, Preschool Children, Computer Assisted Testing
Boston, Melissa D.; Candela, Amber G. – ZDM: The International Journal on Mathematics Education, 2018
The Instructional Quality Assessment (IQA) identifies the nature and quality of classroom instruction by considering students' opportunities to engage in cognitively demanding mathematical work and discussions. The IQA assesses ambitious mathematics instruction based on the following dimensions: potential of the task, task implementation, rigor of…
Descriptors: Mathematics Instruction, Educational Assessment, Educational Quality, Scoring Rubrics
van der Lans, Rikkert M.; Maulana, Ridwan; Helms-Lorenz, Michelle; Fernández-García, Carmen-María; Chun, Seyeoung; de Jager, Thelma; Irnidayanti, Yulia; Inda-Caro, Mercedes; Lee, Okhwa; Coetzee, Thys; Fadhilah, Nurul; Jeon, Meae; Moorer, Peter – SAGE Open, 2021
This study examines measurement invariance of student perceptions of teaching quality collected in five countries: Indonesia (n students = 6,331), the Netherlands (n students = 6,738), South Africa (n students = 3,422), South Korea (n students = 6,997) and Spain (n students = 4,676). The administered questionnaire was the My Teacher Questionnaire…
Descriptors: Foreign Countries, Student Attitudes, Student Evaluation of Teacher Performance, Teacher Effectiveness

Peer reviewed
Direct link
