Publication Date
| In 2026 | 0 |
| Since 2025 | 5 |
| Since 2022 (last 5 years) | 68 |
| Since 2017 (last 10 years) | 169 |
| Since 2007 (last 20 years) | 391 |
Descriptor
| Test Content | 826 |
| Test Construction | 284 |
| Test Items | 264 |
| Test Validity | 189 |
| Foreign Countries | 168 |
| Test Format | 157 |
| Student Evaluation | 138 |
| Test Reliability | 136 |
| Elementary Secondary Education | 125 |
| Testing | 111 |
| Standardized Tests | 105 |
| More ▼ | |
Source
Author
| Sireci, Stephen G. | 9 |
| Kitao, Kenji | 4 |
| Kitao, S. Kathleen | 4 |
| Papageorgiou, Spiros | 4 |
| Thurlow, Martha L. | 4 |
| Winnick, Joseph P. | 4 |
| van der Linden, Wim J. | 4 |
| Chang, Hua-Hua | 3 |
| Donovan, Jenny | 3 |
| Ewing, Maureen | 3 |
| Hau, Kit-Tai | 3 |
| More ▼ | |
Publication Type
Education Level
Audience
| Teachers | 68 |
| Practitioners | 59 |
| Administrators | 20 |
| Students | 15 |
| Policymakers | 9 |
| Researchers | 7 |
| Parents | 6 |
| Counselors | 3 |
| Community | 2 |
| Support Staff | 1 |
Location
| Australia | 18 |
| California | 15 |
| Canada | 14 |
| China | 13 |
| United States | 12 |
| Massachusetts | 9 |
| United Kingdom | 9 |
| Europe | 8 |
| Georgia | 8 |
| Japan | 8 |
| Rhode Island | 8 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Solheim, Oddny Judith; Lundetrae, Kjersti – Assessment in Education: Principles, Policy & Practice, 2018
Gender differences in reading seem to increase throughout schooling and then decrease or even disappear with age, but the reasons for this are unclear. In this study, we explore whether differences in the way "reading literacy" is operationalised can add to our understanding of varying gender differences in international large-scale…
Descriptors: Achievement Tests, Foreign Countries, Grade 4, Reading Achievement
Haladyna, Thomas M. – IDEA Center, Inc., 2018
Writing multiple-choice test items to measure student learning in higher education is a challenge. Based on extensive scholarly research and experience, the author describes various item formats, offers guidelines for creating these items, and provides many examples of both good and bad test items. He also suggests some shortcuts for developing…
Descriptors: Test Construction, Multiple Choice Tests, Test Items, Higher Education
Hildebrandt, Susan A.; Swanson, Pete – Foreign Language Annals, 2019
Implemented in almost 900 teacher education programs across 41 states and the District of Columbia, edTPA is marketed as a content-specific, standardized portfolio assessment of beginning teacher performance. However, concerns about edTPA and its content specificity are pervasive. To that end, the researchers surveyed teacher educators with World…
Descriptors: Language Teachers, Teacher Attitudes, Second Language Learning, Second Language Instruction
Karimova, Könül; Csapó, Beno – Journal of Advanced Academics, 2020
The internal/external (I/E) frame of reference entails high, positive association of mathematics and verbal achievements with matching academic self-concepts but negative or near-zero correlation with their nonmatching self-concepts. This study aimed to extend the traditional I/E model by contrasting the mathematics domain with two foreign…
Descriptors: Mathematics Achievement, Verbal Ability, Self Concept, Academic Ability
Magno, Carlo – UNESCO Bangkok, 2020
The COVID-19 pandemic has disrupted education across the globe leading countries to adapt how they administer and manage high-stakes examinations and large-scale learning assessments. This thematic review describes the measures that countries have taken, in terms of policies and practices, when learning assessments are disrupted by emergencies and…
Descriptors: High Stakes Tests, COVID-19, Pandemics, Cross Cultural Studies
Bass, Kristin M.; Drits-Esser, Dina; Stark, Louisa A. – CBE - Life Sciences Education, 2016
The credibility of conclusions made about the effectiveness of educational interventions depends greatly on the quality of the assessments used to measure learning gains. This essay, intended for faculty involved in small-scale projects, courses, or educational research, provides a step-by-step guide to the process of developing, scoring, and…
Descriptors: Sciences, Knowledge Level, Educational Research, High School Students
Traxler, Adrienne; Henderson, Rachel; Stewart, John; Stewart, Gay; Papak, Alexis; Lindell, Rebecca – Physical Review Physics Education Research, 2018
Research on the test structure of the Force Concept Inventory (FCI) has largely ignored gender, and research on FCI gender effects (often reported as "gender gaps") has seldom interrogated the structure of the test. These rarely crossed streams of research leave open the possibility that the FCI may not be structurally valid across…
Descriptors: Physics, Science Instruction, Sex Fairness, Gender Differences
Walstad, William B.; Rebeck, Ken – Journal of Economic Education, 2017
The "Test of Financial Literacy" (TFL) was created to measure the financial knowledge of high school students. Its content is based on the standards and benchmarks stated in the "National Standards for Financial Literacy" (Council for Economic Education 2013). The test development process involved extensive item writing and…
Descriptors: Tests, Money Management, Literacy, High School Students
Flory, Michael; Sun, Chris – CNA Corporation, 2017
The Every Student Succeeds Act (ESSA) provides greater flexibility in state accountability systems than did previous federal legislation. In response, many states continue to refine their accountability systems to include college readiness tests, including college admissions and placement exams. This paper summarizes perspectives of K-12…
Descriptors: College Readiness, College Entrance Examinations, Student Placement, Educational Legislation
Zou, Min; Wu, Wenxin – English Language Teaching, 2015
Since its first pilot study was launched in 2003, China Accreditation Test for Translators and Interpreters (CATTI) has developed into the most authoritative translation and interpretation proficiency qualification accreditation test in China and played an important role in assessing and cultivating translators and interpreters. Based on the…
Descriptors: Foreign Countries, Translation, Test Validity, Test Reliability
Varcasia, Cecilia – Language Learning in Higher Education, 2019
This paper explores, through the use of the observation checklist and Conversation Analysis (CA), the discourse functions elicited by the dialogic task of the Free University of Bolzano (IT) speaking test. It aims to contribute to content validation, which has been claimed to be especially relevant in paired speaking tests, where interaction is…
Descriptors: Discourse Analysis, Content Validity, Check Lists, Task Analysis
Kheirzadeh, Shiela; Marandi, S. Susan; Tavakoli, Mansoor – International Journal of Assessment Tools in Education, 2019
To investigate the congruence between the requisite postgraduate academic language skills and the language skills measured by the General English section of the Iranian National PhD Entrance exam, field specialist informants, language-specialist informants and post-graduate students were questioned. The informants' data were collected through…
Descriptors: English for Academic Purposes, Second Language Learning, College Entrance Examinations, Doctoral Programs
National Assessment Governing Board, 2016
Having a large-scale national assessment in the arts makes an important statement about the need for all children in our country to obtain the special benefits of learning that only the arts provide. In recognition of the importance of the arts in education, the National Assessment of Educational Progress (NAEP), also known as The Nation's Report…
Descriptors: Art Education, National Competency Tests, Guidelines, Test Content
Roehl, Tobias – Cultural Studies of Science Education, 2015
Drawing on a sociocultural perspective on educational assessment the empirical examples of Margareta Serder and Anders Jakobsson serve as a starting point for a critical analysis of PISA and the image of science education it perpetuates. While PISA claims to neutrally measure competencies relevant to science education, I argue that the test…
Descriptors: International Assessment, Educational Assessment, Science Education, Measurement
Wolkowitz, Amanda; Davis-Becker, Susan – Practical Assessment, Research & Evaluation, 2015
This study evaluates the impact of common item characteristics on the outcome of equating in credentialing examinations when traditionally recommended representation is not possible. This research used real data sets from several credentialing exams to test the impact of content representation, item statistics, and number of common items on…
Descriptors: Test Items, Equated Scores, Licensing Examinations (Professions), Test Content

Peer reviewed
Direct link
