NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing all 15 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Liu, Xiaohua; Read, John – Language Assessment Quarterly, 2021
Expert judgement has been frequently employed with reading assessments to gauge the skills potentially measured by test tasks, for purposes such as construct validation or producing diagnostic information. Despite the critical role it plays in such endeavours, few studies have triangulated its results with other types of data such as reported…
Descriptors: Reading Tests, Reading Skills, Test Items, Expertise
Peer reviewed Peer reviewed
Direct linkDirect link
Schramm, Thilo; Jose, Anika; Schmiemann, Philipp – CBE - Life Sciences Education, 2021
Evolutionary trees are central to learning about evolutionary processes, yet students at all educational levels struggle to read and interpret them. The synthetic tree-reading model (STREAM), based on published and not yet empirically tested models, was tested to determine whether the assumed hierarchy of the model could be substantiated and how…
Descriptors: Undergraduate Students, Graduate Students, Evolution, Visual Aids
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Asquith, Steven – TESL-EJ, 2022
Although an accurate measure of vocabulary size is integral to understanding the proficiency of language learners, the validity of multiple-choice (M/C) vocabulary tests to determine this has been questioned due to users guessing correct answers which inflates scores. In this paper the nature of guessing and partial knowledge used when taking the…
Descriptors: Guessing (Tests), English (Second Language), Second Language Learning, Language Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Hoffmann, Matt D.; Loughead, Todd. M. – Measurement in Physical Education and Exercise Science, 2019
Using a multiphase approach, the purpose of the present study was to develop a psychometrically sound questionnaire to measure protégés' perceptions of peer athlete mentoring functions. Phase 1 consisted of three stages: (a) item development, (b) assessment of content validity via think-aloud interviews with peer mentored athletes, and (c)…
Descriptors: Athletes, Mentors, Questionnaires, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Lehane, Paula; Scully, Darina; O'Leary, Michael – Irish Educational Studies, 2022
In line with the widespread proliferation of digital technology in everyday life, many countries are now beginning to use computer-based exams (CBEs) in their post-primary education systems. To ensure that these CBEs are delivered in a manner that preserves their fairness, validity, utility and credibility, several factors pertaining to their…
Descriptors: Computer Assisted Testing, Secondary School Students, Culture Fair Tests, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Yunjiu, Luo; Wei, Wei; Zheng, Ying – SAGE Open, 2022
Artificial intelligence (AI) technologies have the potential to reduce the workload for the second language (L2) teachers and test developers. We propose two AI distractor-generating methods for creating Chinese vocabulary items: semantic similarity and visual similarity. Semantic similarity refers to antonyms and synonyms, while visual similarity…
Descriptors: Chinese, Vocabulary Development, Artificial Intelligence, Undergraduate Students
Peer reviewed Peer reviewed
Direct linkDirect link
Johnson, Martin; Rushton, Nicky – Educational Research, 2019
Background: The development of a set of questions is a central element of examination development, with the validity of an examination resting to a large extent on the quality of the questions that it comprises. This paper reports on the methods and findings of a project that explores how educational examination question writers engage in the…
Descriptors: Writing (Composition), Test Construction, Specialists, Protocol Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Selleri, Patrizia; Carugati, Felice – European Journal of Psychology of Education, 2018
There is a consensus that the items proposed by the Program for International Student Assessment (PISA) program allow us to focus on the outcomes of the processes of appropriation and transformation of learning tools at the end of compulsory schooling, particularly regarding the key competencies for lifelong learning and citizenship in digital…
Descriptors: Foreign Countries, Achievement Tests, Secondary School Students, International Assessment
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Lee, Jia-Ying – Taiwan Journal of TESOL, 2018
This article examines the test-taking strategies of high- and low-scoring Chinese-speaking participants when they answer English multiple-choice reading comprehension questions. Thirty-two participants took a TOEIC reading test, provided think-aloud protocols, and joined a post-task interview. The data come primarily from qualitative analysis and…
Descriptors: Foreign Countries, Test Wiseness, English (Second Language), Language Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Vahrenhold, Jan; Paul, Wolfgang – Computer Science Education, 2014
We report on the development, validation, and implementation of a collection of test items designed to detect misconceptions related to first-year computer science courses. To this end, we reworked the development scheme proposed by Almstrum et al. ("SIGCSE Bulletin" 38(4):132-145, 2006) to include students' artifacts and to…
Descriptors: Computer Science Education, Introductory Courses, Test Items, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Cui, Ying; Roberts, Mary Roduta – Educational Measurement: Issues and Practice, 2013
The goal of this study was to investigate the usefulness of person-fit analysis in validating student score inferences in a cognitive diagnostic assessment. In this study, a two-stage procedure was used to evaluate person fit for a diagnostic test in the domain of statistical hypothesis testing. In the first stage, the person-fit statistic, the…
Descriptors: Scores, Validity, Cognitive Tests, Diagnostic Tests
Gewertz, Catherine – Education Week, 2012
Pondering a math problem while she swings her sneakered feet from a chair, 12-year-old Andrea Guevara is helping researchers design an assessment that will shape the learning of 19 million students. The 8th grader, who came to the United States from Ecuador three years ago, is trying out two ways of providing English-language support on a…
Descriptors: Test Items, Foreign Countries, Feedback (Response), Protocol Analysis
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Hasse, Sascha; Joachim, Cora; Bögeholz, Susanne; Hammann, Marcus – International Journal of Education in Mathematics, Science and Technology, 2014
In Germany, science education standards for students at the end of grade nine have been in existance since 2005. Some of these standards are dedicated to scientific inquiry (e.g. experimentation). They describe which abilities learners are expected to possess at the end of grade nine. In the USA, several documents describe standards for…
Descriptors: Foreign Countries, Preservice Teachers, Biology, Science Instruction
Peer reviewed Peer reviewed
Direct linkDirect link
Roth, Wolff-Michael; Oliveri, Maria Elena; Sandilands, Debra Dallie; Lyons-Thomas, Juliette; Ercikan, Kadriye – International Journal of Science Education, 2013
Even if national and international assessments are designed to be comparable, subsequent psychometric analyses often reveal differential item functioning (DIF). Central to achieving comparability is to examine the presence of DIF, and if DIF is found, to investigate its sources to ensure differentially functioning items that do not lead to bias.…
Descriptors: Test Bias, Evaluation Methods, Protocol Analysis, Science Achievement
Triska, Olive H.; And Others – 1996
The domination of the information processing approach has shifted research from problem solving strategies to the structure and organization of knowledge that characterizes expertise. The purpose of this study was to compare the reasoning processes of 12 clinicians and 40 medical students as they responded to 6 positively stated multiple choice…
Descriptors: Clinical Diagnosis, Cognitive Processes, College Faculty, Foreign Countries