Publication Date
In 2025 | 34 |
Since 2024 | 128 |
Since 2021 (last 5 years) | 467 |
Since 2016 (last 10 years) | 873 |
Since 2006 (last 20 years) | 1353 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Practitioners | 195 |
Teachers | 159 |
Researchers | 92 |
Administrators | 49 |
Students | 34 |
Policymakers | 14 |
Parents | 12 |
Counselors | 2 |
Community | 1 |
Media Staff | 1 |
Support Staff | 1 |
More ▼ |
Location
Canada | 62 |
Turkey | 59 |
Germany | 40 |
United Kingdom | 36 |
Australia | 35 |
Japan | 35 |
China | 32 |
United States | 32 |
California | 25 |
United Kingdom (England) | 25 |
Netherlands | 24 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Whittle, Rachael J.; Benson, Amanda C.; Ullah, Shahid; Telford, Amanda – Educational Research and Evaluation, 2018
External written examinations are commonly used for determining student academic achievement. The influence of question type and cognitive process on examination performance in senior-secondary physical education is unclear. A secondary data analysis of Victorian Certificate of Education (VCE) Physical Education examination data (2011; n = 9,323,…
Descriptors: Cognitive Processes, Academic Achievement, Physical Education, Test Items
Bijsterbosch, Erik – Geographical Education, 2018
Geography teachers' school-based (internal) examinations in pre-vocational geography education in the Netherlands appear to be in line with the findings in the literature, namely that teachers' assessment practices tend to focus on the recall of knowledge. These practices are strongly influenced by national (external) examinations. This paper…
Descriptors: Foreign Countries, Instructional Effectiveness, National Competency Tests, Geography Instruction
Loiseau, Nathalie; Delgado Luchner, Carmen – Interpreter and Translator Trainer, 2021
To date, research into conference interpreting has not produced a definition of the concrete subskills associated with an A, B and C language in interpreters' combinations of working languages. Existing frameworks for performance assessment in foreign languages are not designed to cover the very advanced range of language mastery associated with…
Descriptors: Translation, Language Proficiency, Second Language Learning, Second Language Instruction
Oyzon, Voltaire Q.; Bendulo, Hermabeth O.; Tibus, Erlinda D.; Bande, Rhodora A.; Macalinao, Myrna L. – International Journal of Evaluation and Research in Education, 2016
Schools in the Philippines, especially those that are offering teacher education programs, are advised to construct examinations that are Licensure Examination for Teachers (LET)-like test items. This is because "if any aspect of a test is unfamiliar to candidates, they are likely to perform less well than they would do otherwise on…
Descriptors: Foreign Countries, Student Attitudes, Preferences, Test Format
Scott, Terry F.; Schumayer, Dániel – Physical Review Physics Education Research, 2017
The Force Concept Inventory is one of the most popular and most analyzed multiple-choice concept tests used to investigate students' understanding of Newtonian mechanics. The correct answers poll a set of underlying Newtonian concepts and the coherence of these underlying concepts has been found in the data. However, this inventory was constructed…
Descriptors: World Views, Scientific Concepts, Scientific Principles, Multiple Choice Tests
Tengberg, Michael – Language Testing, 2017
Reading comprehension tests are often assumed to measure the same, or at least similar, constructs. Yet, reading is not a single but a multidimensional form of processing, which means that variations in terms of reading material and item design may emphasize one aspect of the construct at the cost of another. The educational systems in Denmark,…
Descriptors: Foreign Countries, National Competency Tests, Reading Tests, Comparative Analysis
Madya, Suwarsih; Retnawati, Heri; Purnawan, Ari; Putro, Nur Hidayanto Pancoro Setyo; Apino, Ezi – TEFLIN Journal: A publication on the teaching and learning of English, 2019
This explorative-descriptive study set out to examine the equivalence among Test of English Proficiency (TOEP) forms, developed by the Indonesian Testing Service Centre (ITSC) and co-founded by The Association for The Teaching of English as a Foreign Language in Indonesia (TEFLIN) and The Association of Psychology in Indonesia. Using a…
Descriptors: Language Tests, Language Proficiency, English (Second Language), Second Language Learning
Raymond Stubbe; Yumiko Cochrane – Vocabulary Learning and Instruction, 2019
One of the many challenges facing Japanese university students studying English is the multi-word phrase. The English language contains a large number of such multiple-word items, which act as single words with a single meaning. This study is concerned with evaluating the efficacy of yes/no checklist tests to assess knowledge of multi-word units.…
Descriptors: Check Lists, Word Lists, Phrase Structure, English (Second Language)
Solheim, Oddny Judith; Lundetrae, Kjersti – Assessment in Education: Principles, Policy & Practice, 2018
Gender differences in reading seem to increase throughout schooling and then decrease or even disappear with age, but the reasons for this are unclear. In this study, we explore whether differences in the way "reading literacy" is operationalised can add to our understanding of varying gender differences in international large-scale…
Descriptors: Achievement Tests, Foreign Countries, Grade 4, Reading Achievement
Haladyna, Thomas M. – IDEA Center, Inc., 2018
Writing multiple-choice test items to measure student learning in higher education is a challenge. Based on extensive scholarly research and experience, the author describes various item formats, offers guidelines for creating these items, and provides many examples of both good and bad test items. He also suggests some shortcuts for developing…
Descriptors: Test Construction, Multiple Choice Tests, Test Items, Higher Education
Bush, Martin – Assessment & Evaluation in Higher Education, 2015
The humble multiple-choice test is very widely used within education at all levels, but its susceptibility to guesswork makes it a suboptimal assessment tool. The reliability of a multiple-choice test is partly governed by the number of items it contains; however, longer tests are more time consuming to take, and for some subject areas, it can be…
Descriptors: Guessing (Tests), Multiple Choice Tests, Test Format, Test Reliability
Liang, Tie; Wells, Craig S. – Applied Measurement in Education, 2015
Investigating the fit of a parametric model plays a vital role in validating an item response theory (IRT) model. An area that has received little attention is the assessment of multiple IRT models used in a mixed-format test. The present study extends the nonparametric approach, proposed by Douglas and Cohen (2001), to assess model fit of three…
Descriptors: Nonparametric Statistics, Goodness of Fit, Item Response Theory, Test Format
Menold, Natalja; Tausch, Anja – Sociological Methods & Research, 2016
Effects of rating scale forms on cross-sectional reliability and measurement equivalence were investigated. A randomized experimental design was implemented, varying category labels and number of categories. The participants were 800 students at two German universities. In contrast to previous research, reliability assessment method was used,…
Descriptors: Rating Scales, Test Reliability, Measurement, Classification
Boone, William J. – CBE - Life Sciences Education, 2016
This essay describes Rasch analysis psychometric techniques and how such techniques can be used by life sciences education researchers to guide the development and use of surveys and tests. Specifically, Rasch techniques can be used to document and evaluate the measurement functioning of such instruments. Rasch techniques also allow researchers to…
Descriptors: Item Response Theory, Psychometrics, Science Education, Educational Research
Jiajun Guo – ProQuest LLC, 2016
Divergent thinking (DT) tests are the most frequently used types of creativity assessment and have been administered in traditional paper and pencil format for more than a half century. With the prevalence of computer-based testing and increasing demands for large-scale, faster, and more flexible testing procedures, it is necessary to explore and…
Descriptors: Test Construction, Computer Assisted Testing, Creative Thinking, Creativity Tests