Publication Date
| In 2026 | 0 |
| Since 2025 | 220 |
| Since 2022 (last 5 years) | 1089 |
| Since 2017 (last 10 years) | 2599 |
| Since 2007 (last 20 years) | 4960 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 226 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Radišic, Jelena; Baucal, Aleksandar – European Journal of Psychology of Education, 2018
The study explores how teachers perceive and go about students' thinking in connection to particular mathematical content and how they frame the notion of applied mathematics in their own classrooms. Teachers' narratives are built around two released PISA 2012 mathematics items, the 'Drip rate' and 'Climbing Mount Fuji' (will be referred to as the…
Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students
Verdis, Athanasios; Sotiriou, Christina – International Journal of Music Education, 2018
This study investigates the psychometric characteristics of Gordon's Advanced Measures of Music Audiation (AMMA) in a region with strong non-Western music tradition. It also examines the possibility of measuring audiation with the modern psychometric theory. The AMMA test was administered to 513 students in the city of Ioannina and a number of…
Descriptors: Psychometrics, Music, Correlation, Factor Analysis
Olpak, Yusuf Ziya; Kiliç Çakmak, Ebru – Online Learning, 2018
The aim of this study was to describe the validity and reliability of a Turkish language version of the CoI survey developed by Arbaugh et al. (2008). Data were obtained from 1150 students enrolled in online courses in various departments in three Turkish state universities. The data were randomly divided into two parts: the first part was…
Descriptors: Foreign Countries, Test Reliability, Test Validity, Student Surveys
McClellan, Catherine; Snyder, Rebecca; Woods-Murphy, Maryann; Basset, Katherine – National Network of State Teachers of the Year, 2018
Great teachers recognize great assessments. As policy and education leaders work to make sure state tests are measuring the problem-solving, writing, and critical-thinking skills students need for success, they should convene and rely on teachers to review test quality and help answer the question: Do the questions on our state test reflect…
Descriptors: Student Evaluation, Educational Quality, Standardized Tests, Test Items
Haladyna, Thomas M. – IDEA Center, Inc., 2018
Writing multiple-choice test items to measure student learning in higher education is a challenge. Based on extensive scholarly research and experience, the author describes various item formats, offers guidelines for creating these items, and provides many examples of both good and bad test items. He also suggests some shortcuts for developing…
Descriptors: Test Construction, Multiple Choice Tests, Test Items, Higher Education
Agnello, Paul – ProQuest LLC, 2018
Pseudowords (words that are not real but resemble real words in a language) have been used increasingly as a technique to reduce contamination due to construct-irrelevant variance in assessments of verbal fluid reasoning (Gf). However, despite pseudowords being researched heavily in other psychology sub-disciplines, they have received little…
Descriptors: Scores, Intelligence Tests, Difficulty Level, Item Analysis
Anjar Putro Utomo; Erlia Narulita; Kinya Shimizu – Journal of Baltic Science Education, 2018
The aim of this research was to assess the classification of science test items of TIMSS grade 8 based on higher order thinking skills (HOTS) and determine whether those classified-science test items can be an assessment tool in science class. Sixteen sample test items of HOTS were chosen from 37 reasoning items of TIMSS 1999, 2003, and 2011;…
Descriptors: Foreign Countries, Achievement Tests, Elementary Secondary Education, International Assessment
Zhao, Xueyu; Solano-Flores, Guillermo – Journal of Multilingual and Multicultural Development, 2021
We investigated whether consensus-based test translation review procedures can be used effectively in cultural contexts with high social stratification. We staged two test translation review panels in China -- where social stratification may potentially inhibit individuals' ability to express opinion and disagreement. We adapted a consensus-based…
Descriptors: Translation, Test Construction, Error Patterns, Foreign Countries
Gökçe, Semirhan; Berberoglu, Giray; Wells, Craig S.; Sireci, Stephen G. – Journal of Psychoeducational Assessment, 2021
The 2015 Trends in International Mathematics and Science Study (TIMSS) involved 57 countries and 43 different languages to assess students' achievement in mathematics and science. The purpose of this study is to evaluate whether items and test scores are affected as the differences between language families and cultures increase. Using…
Descriptors: Language Classification, Elementary Secondary Education, Mathematics Achievement, Mathematics Tests
Mullis, Ina V. S., Ed.; Martin, Michael O., Ed.; von Davier, Matthias, Ed. – International Association for the Evaluation of Educational Achievement, 2021
TIMSS (Trends in International Mathematics and Science Study) is a long-standing international assessment of mathematics and science at the fourth and eighth grades that has been collecting trend data every four years since 1995. About 70 countries use TIMSS trend data for monitoring the effectiveness of their education systems in a global…
Descriptors: Achievement Tests, International Assessment, Science Achievement, Mathematics Achievement
Huggins, Anne Corinne – Educational and Psychological Measurement, 2014
Invariant relationships in the internal mechanisms of estimating achievement scores on educational tests serve as the basis for concluding that a particular test is fair with respect to statistical bias concerns. Equating invariance and differential item functioning are both concerned with invariant relationships yet are treated separately in the…
Descriptors: Test Bias, Test Items, Equated Scores, Achievement Tests
Shih, Ching-Lin; Liu, Tien-Hsiang; Wang, Wen-Chung – Educational and Psychological Measurement, 2014
The simultaneous item bias test (SIBTEST) method regression procedure and the differential item functioning (DIF)-free-then-DIF strategy are applied to the logistic regression (LR) method simultaneously in this study. These procedures are used to adjust the effects of matching true score on observed score and to better control the Type I error…
Descriptors: Test Bias, Regression (Statistics), Test Items, True Scores
Liu, Ou Lydia; Brew, Chris; Blackmore, John; Gerard, Libby; Madhok, Jacquie; Linn, Marcia C. – Educational Measurement: Issues and Practice, 2014
Content-based automated scoring has been applied in a variety of science domains. However, many prior applications involved simplified scoring rubrics without considering rubrics representing multiple levels of understanding. This study tested a concept-based scoring tool for content-based scoring, c-rater™, for four science items with rubrics…
Descriptors: Science Tests, Test Items, Scoring, Automation
Yeo, Lian-Ming; Tzeng, Yuh-Tsuen – EURASIA Journal of Mathematics, Science and Technology Education, 2019
Previous study has shown that tracing gesture may enhance the worked example-based learning by reducing cognitive load. The present study attempted to replicate the previous results and further explored the individual differences in tracing effect in relation to the learners' working-memory capacity. Specifically, 11- to 13-year-old students…
Descriptors: Demonstrations (Educational), Mathematics Instruction, Instructional Effectiveness, Geometric Concepts
Löwenadler, John – Language Testing, 2019
This study aims to investigate patterns of variation in the interplay of L2 language ability and general reading comprehension skills in L2 reading, by comparing item-level effects of test-takers' results on L1 and L2 reading comprehension tests. The material comes from more than 500,000 people tested on L1 (Swedish) and L2 (English) in the…
Descriptors: Swedish, English (Second Language), Second Language Learning, Second Language Instruction

Peer reviewed
Direct link
