Publication Date
| In 2026 | 0 |
| Since 2025 | 5 |
| Since 2022 (last 5 years) | 68 |
| Since 2017 (last 10 years) | 169 |
| Since 2007 (last 20 years) | 391 |
Descriptor
| Test Content | 826 |
| Test Construction | 284 |
| Test Items | 264 |
| Test Validity | 189 |
| Foreign Countries | 168 |
| Test Format | 157 |
| Student Evaluation | 138 |
| Test Reliability | 136 |
| Elementary Secondary Education | 125 |
| Testing | 111 |
| Standardized Tests | 105 |
| More ▼ | |
Source
Author
| Sireci, Stephen G. | 9 |
| Kitao, Kenji | 4 |
| Kitao, S. Kathleen | 4 |
| Papageorgiou, Spiros | 4 |
| Thurlow, Martha L. | 4 |
| Winnick, Joseph P. | 4 |
| van der Linden, Wim J. | 4 |
| Chang, Hua-Hua | 3 |
| Donovan, Jenny | 3 |
| Ewing, Maureen | 3 |
| Hau, Kit-Tai | 3 |
| More ▼ | |
Publication Type
Education Level
Audience
| Teachers | 68 |
| Practitioners | 59 |
| Administrators | 20 |
| Students | 15 |
| Policymakers | 9 |
| Researchers | 7 |
| Parents | 6 |
| Counselors | 3 |
| Community | 2 |
| Support Staff | 1 |
Location
| Australia | 18 |
| California | 15 |
| Canada | 14 |
| China | 13 |
| United States | 12 |
| Massachusetts | 9 |
| United Kingdom | 9 |
| Europe | 8 |
| Georgia | 8 |
| Japan | 8 |
| Rhode Island | 8 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Baghaei, Samira; Bagheri, Mohammad Sadegh; Yamini, Mortaza – Cogent Education, 2020
The main purpose of this quantitative-qualitative content analysis study was to compare IELTS and TOEFL listening and reading tests based on the representation of the learning objectives of Revised Bloom's taxonomy. To this end, 12 Academic IELTS listening and reading tests and 12 TOEFL iBT listening and reading tests were analyzed qualitatively…
Descriptors: Second Language Learning, English (Second Language), Language Tests, Reading Tests
Randall, David – National Association of Scholars, 2020
Since 2014 the College Board has continued to revise and develop the Advanced Placement European, United States, and World History examinations. It keeps getting in trouble. Many critics have excoriated the College Board for teaching history grossly politicized to the left--history without the history of freedom, history that teaches hatred of…
Descriptors: Advanced Placement Programs, History Instruction, World History, Social Bias
National Assessment Governing Board, 2019
Since 1973, the National Assessment of Educational Progress (NAEP) has gathered information about student achievement in mathematics. The NAEP assessment in mathematics has two components that differ in purpose. One assessment measures long-term trends in achievement among 9-, 13-, and 17-year-old students by using the same basic design each time.…
Descriptors: National Competency Tests, Mathematics Achievement, Grade 4, Grade 8
Arneson, Amy – ProQuest LLC, 2019
This three-paper dissertation explores item cluster-based assessments, first in general as it relates to modeling, and then, specific issues surrounding a particular item cluster-based assessment designed. There should be a reasonable analogy between the structure of a psychometric model and the cognitive theory that the assessment is based upon.…
Descriptors: Item Response Theory, Test Items, Critical Thinking, Cognitive Tests
Oliveri, María Elena; Nastal, Jessica; Slomp, David – ETS Research Report Series, 2020
This report discusses frameworks and assessment development approaches to consider fairness, opportunity to learn, and consequences of test use in the design and use of assessments administered to diverse populations. Examples include the integrated design and appraisal framework and the sociocognitively based evidence-centered design approach.…
Descriptors: Culture Fair Tests, Guidelines, Test Use, Test Construction
Wyse, Adam E.; Babcock, Ben – Journal of Educational Measurement, 2016
A common suggestion made in the psychometric literature for fixed-length classification tests is that one should design tests so that they have maximum information at the cut score. Designing tests in this way is believed to maximize the classification accuracy and consistency of the assessment. This article uses simulated examples to illustrate…
Descriptors: Cutting Scores, Psychometrics, Test Construction, Classification
Neiro, Jakke; Johansson, Niko – LUMAT: International Journal on Math, Science and Technology Education, 2020
The history and evolution of science assessment remains poorly known, especially in the context of the exam question contents. Here we analyze the Finnish matriculation examination in biology from the 1920s to 1960s to understand how the exam has evolved in both its knowledge content and educational form. Each question was classified according to…
Descriptors: Foreign Countries, Biology, Test Content, Test Format
Bello, Hassan; Abdullah, Nor Athiyah – Electronic Journal of e-Learning, 2021
Computer-based assessment or e-assessment system is an e-learning system where information communication technology is utilized for examination activity, grading, and recording of responses of the examinees. It includes the entire assessment process from the examinees, teachers, institutions, examination agencies, and the public. E-assessment…
Descriptors: Evaluation Methods, Computer Assisted Testing, Technology Uses in Education, Program Effectiveness
Markus, Keith A. – Assessment in Education: Principles, Policy & Practice, 2016
Justification of testing practice involves moving from one state of knowledge about the test to another. Theories of test validity can (a) focus on the beginning of the process, (b) focus on the end, or (c) encompass the entire process. Analyses of four case studies test and illustrate three claims: (a) restrictions on validity entail a supplement…
Descriptors: Vocabulary, Test Validity, Case Studies, Test Construction
Reed, Jessica J.; Villafan~e, Sachel M.; Raker, Jeffrey R.; Holme, Thomas A.; Murphy, Kristen L. – Journal of Chemical Education, 2017
General chemistry courses are often the foundation for the study of other science disciplines and upper-level chemistry concepts. Students who take introductory chemistry courses are more often from health and science-related fields than chemistry. As such, the content taught and assessed in general chemistry courses is envisioned as building…
Descriptors: Science Tests, Chemistry, Test Items, Test Content
Sahin, Alper; Ozbasi, Durmus – Eurasian Journal of Educational Research, 2017
Purpose: This study aims to reveal effects of content balancing and item selection method on ability estimation in computerized adaptive tests by comparing Fisher's maximum information (FMI) and likelihood weighted information (LWI) methods. Research Methods: Four groups of examinees (250, 500, 750, 1000) and a bank of 500 items with 10 different…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Test Content
North, Brian; Piccardo, Enrica – Language Assessment Quarterly, 2023
This paper presents a methodology for directly aligning 'can do' frameworks to each other. The methodology, inspired by the manual for relating examinations to the "Common European Framework of Reference for Languages: Learning, teaching, assessment" (CEFR) (Council of Europe, 2009) and Kane's (2004, 2013) interpretative argument, takes…
Descriptors: Second Language Learning, Second Language Instruction, Language Proficiency, Rating Scales
Zhao, Xueyu; Solano-Flores, Guillermo; Qian, Ming – International Multilingual Research Journal, 2018
This article addresses test translation review in international test comparisons. We investigated the applicability of the theory of test translation error--a theory of the multidimensionality and inevitability of test translation error--across source language-target language combinations in the translation of PISA (Programme of International…
Descriptors: Translation, Error Patterns, Achievement Tests, Foreign Countries
Yang, Xuexue – International Multilingual Research Journal, 2020
Despite the importance of assessment accommodations, little is known about its use in the context of classroom assessments. To provide guidance for teachers on how to best support their emergent bilinguals during classroom assessments, there may be ideas from large-scale assessments that can be used in the classrooms. This article, a targeted…
Descriptors: Testing Accommodations, Measurement, Bilingualism, Second Language Learning
Scharaschkin, Alex – Assessment in Education: Principles, Policy & Practice, 2017
This issue's featured article, "Assessment and Learning: Fields Apart" (Baird, Andrich, Hopfenbeck, and Stobart 2017) raises issues that are of basic importance for the disciplines of assessment and teaching and learning theory. In this commentary, Alex Scharaschkin restricts his remarks to a few areas. He considers the idea of a…
Descriptors: Educational Assessment, Learning Theories, Test Theory, Psychometrics

Peer reviewed
Direct link
