Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 2 |
| Since 2017 (last 10 years) | 5 |
| Since 2007 (last 20 years) | 16 |
Descriptor
| Comparative Analysis | 45 |
| Test Construction | 45 |
| Foreign Countries | 15 |
| Test Items | 13 |
| Language Tests | 11 |
| Test Use | 10 |
| Test Reliability | 9 |
| Test Validity | 9 |
| Higher Education | 8 |
| Scores | 8 |
| Testing | 7 |
| More ▼ | |
Source
Author
| Arth, Thomas O. | 1 |
| Baldwin, Peter | 1 |
| Bao, Lei | 1 |
| Bauer, Christopher F. | 1 |
| Bohlen, Michael J. | 1 |
| Briggs, Derek C. | 1 |
| Brown, James Dean | 1 |
| Brown, James Dean, Ed. | 1 |
| Brunsman, Bethany A. | 1 |
| Burston, Monique | 1 |
| Chen, Cheng | 1 |
| More ▼ | |
Publication Type
Education Level
| Elementary Secondary Education | 4 |
| Higher Education | 3 |
| Elementary Education | 1 |
| Secondary Education | 1 |
Audience
| Practitioners | 2 |
| Teachers | 1 |
Location
| Australia | 3 |
| Indonesia | 2 |
| Japan | 2 |
| New Zealand | 2 |
| North America | 2 |
| United States | 2 |
| Afghanistan | 1 |
| Africa | 1 |
| Asia | 1 |
| Bangladesh | 1 |
| Bhutan | 1 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 2 |
| Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022
While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…
Descriptors: Scoring, Testing, Test Items, Test Format
Sinharay, Sandip – Educational Measurement: Issues and Practice, 2018
The choice of anchor tests is crucial in applications of the nonequivalent groups with anchor test design of equating. Sinharay and Holland (2006, 2007) suggested "miditests," which are anchor tests that are content-representative and have the same mean item difficulty as the total test but have a smaller spread of item difficulties.…
Descriptors: Test Content, Difficulty Level, Test Items, Test Construction
Papageorgiou, Spiros; Manna, Venessa F. – Language Assessment Quarterly, 2021
The TOEFL iBT test was introduced in 2005 to better reflect the language demands of real-life academic tasks than did previous versions of the test. The task-based design of the test was intended to support the interpretation of its scores as a trustworthy measure of international students' ability to use English in an academic environment. Until…
Descriptors: Academic Language, COVID-19, Pandemics, Scores
Bao, Lei; Koenig, Kathleen; Xiao, Yang; Fritchman, Joseph; Zhou, Shaona; Chen, Cheng – Physical Review Physics Education Research, 2022
Abilities in scientific thinking and reasoning have been emphasized as core areas of initiatives, such as the Next Generation Science Standards or the College Board Standards for College Success in Science, which focus on the skills the future will demand of today's students. Although there is rich literature on studies of how these abilities…
Descriptors: Physics, Science Instruction, Teaching Methods, Thinking Skills
Tacket, Wendy L.; Pasatta, Kelley; Pauken, Evan – Journal of College Access, 2018
The Center for Education Policy Research at Harvard University explained, "Across the country, 10- 40% of seemingly college-intending students, particularly those from low-income backgrounds, fail to enroll in college the fall after graduation. This phenomenon is known as "summer melt" (Castleman, Page, and Snowdon, 2013). In order…
Descriptors: Summer Programs, At Risk Students, College Attendance, Low Income Groups
Nowak, Danuta – Teaching English with Technology, 2013
The present article attempts to show how important and easy it is to use authentic material in the classroom. However, the teacher who copies news reports from the Internet may infringe the copyright law. The article offers a comparative analysis of copyright laws in Common Law countries and the EU countries in relation to fair use. The article…
Descriptors: Information Sources, Copyrights, Internet, Comparative Analysis
Detterman, Douglas K. – Intelligence, 2011
Watson's Jeopardy victory raises the question of the similarity of artificial intelligence and human intelligence. Those of us who study human intelligence issue a challenge to the artificial intelligence community. We will construct a unique battery of tests for any computer that would provide an actual IQ score for the computer. This is the same…
Descriptors: Artificial Intelligence, Intelligence, Human Body, Comparative Analysis
Australian Council for Educational Research, 2015
Monitoring Trends in Educational Growth (MTEG) offers a flexible, collaborative approach to developing and implementing an assessment of learning outcomes that yields high-quality, nationally relevant data. MTEG is a service that involves ACER staff working closely with each country to develop an assessment program that meets the country's…
Descriptors: Educational Development, Educational Trends, Progress Monitoring, Educational Quality
Shohamy, Elana – Language and Intercultural Communication, 2013
While much of the work in language testing is concerned with constructing quality tests in order to measure language knowledge in reliable and valid ways, there has been a significant movement in language testing research that examines tests in the context of their use in education and society. This line of research exits from the notion that…
Descriptors: Language Tests, Testing, Evaluation Research, Ideology
Kowal, Julie; Hassel, Emily Ayscue – Public Impact, 2010
For too long, performance measurement systems in education have failed to document and recognize real differences among educators. But a recent national push to use performance evaluations for critical personnel decisions has highlighted the shortcomings of the current systems and increased the urgency to dramatically improve them. As state and…
Descriptors: Teaching (Occupation), Teacher Evaluation, Performance Based Assessment, Comparative Analysis
Sawchuk, Stephen – Education Digest: Essential Readings Condensed for Quick Review, 2010
Most experts in the testing community have presumed that the $350 million promised by the U.S. Department of Education to support common assessments would promote those that made greater use of open-ended items capable of measuring higher-order critical-thinking skills. But as measurement experts consider the multitude of possibilities for an…
Descriptors: Educational Quality, Test Items, Comparative Analysis, Multiple Choice Tests
Mueller Gathercole, Virginia C.; Thomas, Enlli Mon; Hughes, Emma – International Journal of Bilingual Education and Bilingualism, 2008
The purpose of this paper is to propose an applied model for the assessment of bilingual children's language abilities with standardised tests. We discuss the purposes of such tests, especially in relation to vocabulary knowledge, and potential applications of test results for each of those purposes. The specific case to be examined here is that…
Descriptors: Test Results, Language Tests, Monolingualism, Vocabulary Development
Briggs, Derek C. – Partnership for Assessment of Readiness for College and Careers, 2011
There is often confusion about distinctions between growth models and value-added models. The first half of this paper attempts to dispel some of these confusions by clarifying terminology and illustrating by example how the results from a large-scale assessment can and will be used to make inferences about student growth and the value-added…
Descriptors: Value Added Models, Language Usage, Measurement, Inferences
Gantt, Linda M. – Art Therapy: Journal of the American Art Therapy Association, 2009
The Formal Elements Art Therapy Scale (FEATS) is a measurement system for applying numbers to global variables in two-dimensional art (drawing and painting). While it was originally developed for use with the single-picture assessment ("Draw a person picking an apple from a tree" [PPAT]), researchers can also apply many of the 14 scales of the…
Descriptors: Measurement Techniques, Measures (Individuals), Art Therapy, Evaluation Methods
Leach, Mark M.; Oakland, Thomas – International Journal of Testing, 2007
Ethics codes are designed to protect the public by prescribing behaviors professionals are expected to exhibit. Although test use is universal, albeit reflecting strong Western influences, previous studies that examine the degree issues pertaining to test development and use and that are addressed in ethics codes of national psychological…
Descriptors: Test Use, Test Construction, Psychologists, Psychology

Peer reviewed
Direct link
