Publication Date
| In 2026 | 0 |
| Since 2025 | 5 |
| Since 2022 (last 5 years) | 68 |
| Since 2017 (last 10 years) | 169 |
| Since 2007 (last 20 years) | 391 |
Descriptor
Source
Author
| Sireci, Stephen G. | 9 |
| Kitao, Kenji | 4 |
| Kitao, S. Kathleen | 4 |
| Papageorgiou, Spiros | 4 |
| Thurlow, Martha L. | 4 |
| Winnick, Joseph P. | 4 |
| van der Linden, Wim J. | 4 |
| Chang, Hua-Hua | 3 |
| Donovan, Jenny | 3 |
| Ewing, Maureen | 3 |
| Hau, Kit-Tai | 3 |
| More ▼ | |
Publication Type
Education Level
Audience
| Teachers | 68 |
| Practitioners | 59 |
| Administrators | 20 |
| Students | 15 |
| Policymakers | 9 |
| Researchers | 7 |
| Parents | 6 |
| Counselors | 3 |
| Community | 2 |
| Support Staff | 1 |
Location
| Australia | 18 |
| California | 15 |
| Canada | 14 |
| China | 13 |
| United States | 12 |
| Massachusetts | 9 |
| United Kingdom | 9 |
| Europe | 8 |
| Georgia | 8 |
| Japan | 8 |
| Rhode Island | 8 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Li, Xueming; Sireci, Stephen G. – Educational and Psychological Measurement, 2013
Validity evidence based on test content is of essential importance in educational testing. One source for such evidence is an alignment study, which helps evaluate the congruence between tested objectives and those specified in the curriculum. However, the results of an alignment study do not always sufficiently capture the degree to which a test…
Descriptors: Content Validity, Multidimensional Scaling, Data Analysis, Educational Testing
Bichi, Ado Abdu; Hafiz, Hadiza; Bello, Samira Abdullahi – International Journal of Evaluation and Research in Education, 2016
High-stakes testing is used for the purposes of providing results that have important consequences. Validity is the cornerstone upon which all measurement systems are built. This study applied the Item Response Theory principles to analyse Northwest University Kano Post-UTME Economics test items. The developed fifty (50) economics test items was…
Descriptors: Item Response Theory, Test Items, Difficulty Level, Statistical Analysis
Martin-Raugh, Michelle P.; Reese, Clyde M.; Howell, Heather; Tannenbaum, Richard J.; Steinberg, Jonathan H.; Xu, Jun – Educational Testing Service, 2016
The purpose of this report is to explore the content-related validity evidence supporting the mathematics components of the "ETS"® National Observational Teaching Exam (NOTE) assessment series, a kindergarten through 6th grade teacher licensure assessment. To establish the content knowledge required for the effective teaching of…
Descriptors: Teacher Certification, Teacher Evaluation, Elementary School Teachers, Teacher Competencies
National Assessment Governing Board, 2014
Since 1973, the National Assessment of Educational Progress (NAEP) has gathered information about student achievement in mathematics. Results of these periodic assessments, produced in print and web-based formats, provide valuable information to a wide variety of audiences. They inform citizens about the nature of students' comprehension of the…
Descriptors: National Competency Tests, Mathematics Achievement, Mathematics Skills, Grade 4
Arjoon, Janelle A.; Xu, Xiaoying; Lewis, Jennifer E. – Journal of Chemical Education, 2013
Many of the instruments developed for research use by the chemistry
education community are relatively new. Because psychometric evidence dictates the validity of interpretations made from test scores, gathering and reporting validity and reliability evidence is of utmost importance. Therefore, the purpose of this study was to investigate what…
Descriptors: Science Instruction, Measurement Techniques, Psychometrics, Evidence
Emery, Henry John – Language Assessment Quarterly, 2014
The proceedings of the first Language Testing Forum in 1980 were published in "ELT Documents 111: Issues in Language Testing" (Alderson & Hughes, 1981). Discussants at the 1980 Forum raised a number questions on Language for Specific Purposes (LSP) testing relating, notably, to test specificity, test content, the relationship between…
Descriptors: English for Special Purposes, Aviation Education, Second Language Instruction, Second Language Learning
Bermundo, Cesar B.; Bermundo, Alex B.; Ballester, Rex C. – Australian Association for Research in Education (NJ1), 2012
iBank is a project that utilizes a software to create an item Bank that store quality questions, generate test and print exam. The items are from analyze teacher-constructed test questions that provides the basis for discussing test results, by determining why a test item is or not discriminating between the better and poorer students, and by…
Descriptors: Test Items, Computer Software, Test Results, Test Construction
McGrew, Sean – ProQuest LLC, 2012
This study examines teacher and student talk about tests and test data in a bilingual charter elementary school over two academic years. Considering tests as conveying information, the analysis proposes categories to distinguish different kinds and uses of test information. Kinds of test information include that conveyed by the existence of the…
Descriptors: Bilingualism, Classroom Communication, Charter Schools, Elementary School Students
Adams, Raymond J.; Lietz, Petra; Berezner, Alla – Large-scale Assessments in Education, 2013
Background: While rotated test booklets have been employed in large-scale assessments to increase the content coverage of the assessments, rotation has not yet been applied to the context questionnaires administered to respondents. Methods: This paper describes the development of a methodology that uses rotated context questionnaires in…
Descriptors: Questionnaires, Item Response Theory, Foreign Countries, Achievement Tests
Rhode Island Department of Education, 2014
The purpose of this Guidebook is to describe the process and basic requirements for the student learning measures that are used as part of the building administrator evaluation and support process. For aspects of the process that have room for flexibility and school/district-level discretion, the different options have been clearly separated and…
Descriptors: Guides, Student Evaluation, Evaluation Methods, Public Schools
Rhode Island Department of Education, 2014
The purpose of this Guidebook is to describe the process and basic requirements for the student learning measures that are used as part of the support professional evaluation and support process. For aspects of the process that have room for flexibility and school/district-level discretion, the different options have been clearly separated and…
Descriptors: Guides, Student Evaluation, Evaluation Methods, Public Schools
Burdman, Pamela – Policy Analysis for California Education, PACE, 2015
There is growing concern that the remedial math courses taken by most community college students unnecessarily divert some students from earning a degree. Anecdotes of students who thought they had completed their math requirements in high school only to have remedial courses delay their progress through college are common. In addition, research…
Descriptors: Remedial Instruction, Educational Change, Student Placement, Educational Policy
Rhode Island Department of Education, 2015
Rhode Island is committed to ensuring that all educators receive fair, accurate, and meaningful educator evaluations that provide information that can help all teachers improve and refine their practice. This commitment is an outgrowth of the state's recognition of the influence teachers have on student growth and achievement. Currently, districts…
Descriptors: Guides, Evaluation Methods, Public Schools, Educational Objectives
Crotts, Katrina; Sireci, Stephen G.; Zenisky, April – Journal of Applied Testing Technology, 2012
Validity evidence based on test content is important for educational tests to demonstrate the degree to which they fulfill their purposes. Most content validity studies involve subject matter experts (SMEs) who rate items that comprise a test form. In computerized-adaptive testing, examinees take different sets of items and test "forms"…
Descriptors: Computer Assisted Testing, Adaptive Testing, Content Validity, Test Content
Breakstone, Joel – Theory and Research in Social Education, 2014
This article considers the design process for new formative history assessments. Over the course of 3 years, my colleagues from the Stanford History Education Group and I designed, piloted, and revised dozens of "History Assessments of Thinking" (HATs). As we created HATs, we sought to gather information about their cognitive validity,…
Descriptors: History Instruction, Formative Evaluation, Tests, Correlation

Peer reviewed
Direct link
