Publication Date
| In 2026 | 0 |
| Since 2025 | 55 |
| Since 2022 (last 5 years) | 197 |
| Since 2017 (last 10 years) | 497 |
| Since 2007 (last 20 years) | 745 |
Descriptor
| Test Items | 1189 |
| Test Reliability | 1189 |
| Test Validity | 687 |
| Test Construction | 567 |
| Foreign Countries | 349 |
| Difficulty Level | 280 |
| Item Analysis | 253 |
| Psychometrics | 236 |
| Item Response Theory | 219 |
| Factor Analysis | 184 |
| Multiple Choice Tests | 173 |
| More ▼ | |
Source
Author
| Schoen, Robert C. | 12 |
| LaVenia, Mark | 5 |
| Liu, Ou Lydia | 5 |
| Anderson, Daniel | 4 |
| Bauduin, Charity | 4 |
| DiLuzio, Geneva J. | 4 |
| Farina, Kristy | 4 |
| Haladyna, Thomas M. | 4 |
| Huck, Schuyler W. | 4 |
| Petscher, Yaacov | 4 |
| Stansfield, Charles W. | 4 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 39 |
| Researchers | 30 |
| Teachers | 24 |
| Administrators | 13 |
| Support Staff | 3 |
| Counselors | 2 |
| Students | 2 |
| Community | 1 |
| Parents | 1 |
| Policymakers | 1 |
Location
| Turkey | 69 |
| Indonesia | 37 |
| Germany | 20 |
| Canada | 17 |
| Florida | 17 |
| China | 16 |
| Australia | 15 |
| California | 12 |
| Iran | 11 |
| India | 10 |
| New York | 9 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
Baghaei, Purya; Dourakhshan, Alireza – International Journal of Language Testing, 2016
The purpose of the present study is to compare the psychometric qualities of canonical single-response multiple-choice items with their double-response counterparts. Thirty, two-response fouroption grammar items for undergraduate students of English were constructed. A second version of the test was constructed by replacing one of the correct…
Descriptors: Language Tests, Multiple Choice Tests, Test Items, Factor Analysis
Alexander, Patricia A.; Singer, Lauren M.; Jablansky, Sophie; Hattan, Courtney – Journal of Educational Psychology, 2016
This study investigated the relational reasoning capabilities of older adolescents and young adults when the focal assessment was a verbal and more schooled measure than 1 that was figural and more novel in its configuration. To achieve this end, the verbal test of relational reasoning (vTORR) was constructed to parallel the test of relational…
Descriptors: Thinking Skills, Adolescents, Young Adults, Cognitive Ability
Erkan, Senem Seda Sahenk; Dagal, Asude Balaban; Tezcan, Özlem – Journal of Education and Training Studies, 2016
The main purpose of this study was to develop a valid and reliable scale for printed and digital competencies ("The Printed and Digital Reading Habits Scale"). The problem statement of this research can be expressed as: "The Printed and Digital Reading Habits Scale: is a valid and reliable scale?" In this study, the scale…
Descriptors: Reading Habits, Preservice Teacher Education, Preservice Teachers, Teacher Competency Testing
Walker, Grant M.; Schwartz, Myrna F. – American Journal of Speech-Language Pathology, 2012
Purpose: To create two matched short forms of the Philadelphia Naming Test (PNT; Roach, Schwartz, Martin, Grewal, & Brecher, 1996) that yield similar results to the PNT for measuring anomia. Method: In Study 1, archived naming data from 94 individuals with aphasia were used to identify which PNT items should be included in the short forms. The 2…
Descriptors: Naming, Tests, Aphasia, Test Items
Wilcox, Bethany R.; Pollock, Steven J. – Physical Review Special Topics - Physics Education Research, 2015
Standardized conceptual assessment represents a widely used tool for educational researchers interested in student learning within the standard undergraduate physics curriculum. For example, these assessments are often used to measure student learning across educational contexts and instructional strategies. However, to support the large-scale…
Descriptors: Science Instruction, Scientific Concepts, College Science, Physics
Raker, Jeffrey R.; Trate, Jaclyn M.; Holme, Thomas A.; Murphy, Kristen – Journal of Chemical Education, 2013
Experts use their domain expertise and knowledge of examinees' ability levels as they write test items. The expert test writer can then estimate the difficulty of the test items subjectively. However, an objective method for assigning difficulty to a test item would capture the cognitive demands imposed on the examinee as well as be…
Descriptors: Organic Chemistry, Test Items, Item Analysis, Difficulty Level
Sabatini, J.; O'Reilly, T.; Halderman, L.; Bruce, K. – Grantee Submission, 2014
Existing reading comprehension assessments have been criticized by researchers, educators, and policy makers, especially regarding their coverage, utility, and authenticity. The purpose of the current study was to evaluate a new assessment of reading comprehension that was designed to broaden the construct of reading. In light of these issues, we…
Descriptors: Reading Comprehension, Vignettes, Reading Tests, Elementary School Students
Eckes, Thomas – Language Testing, 2014
Testlets are subsets of test items that are based on the same stimulus and are administered together. Tests that contain testlets are in widespread use in language testing, but they also share a fundamental problem: Items within a testlet are locally dependent with possibly adverse consequences for test score interpretation and use. Building on…
Descriptors: Test Items, Language Tests, Listening Comprehension Tests, German
Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2014
A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…
Descriptors: Test Items, Test Bias, Simulation, Hypothesis Testing
Palacio, Marcela; Gaviria, Sandra; Brown, James Dean – PROFILE: Issues in Teachers' Professional Development, 2016
Frustrations with traditional testing led a group of teachers at the English for adults program at Universidad EAFIT (Colombia) to design tests aligned with the institutional teaching philosophy and classroom practices. This article reports on a study of an item-by-item evaluation of a series of English exams for validity and reliability in an…
Descriptors: Foreign Countries, English (Second Language), Second Language Learning, Second Language Instruction
Avcu, Ramazan; Avcu, Seher – Eurasian Journal of Educational Research, 2015
Problem Statement: Among attitude measures, attitude scales are the most common, objective, and effective in gathering attitude data and there is a plenty of scales that measure various factors of attitude towards mathematics. However, there is a need for attitude scales that are content specific such as geometry, algebra, probability and…
Descriptors: Foreign Countries, Attitude Measures, Mathematics Instruction, Geometry
Liu, Heidi Han-Ting; Lee, Young-Sun – SAGE Open, 2015
Self-regulation has become a widely discussed subject in education as it facilitates learners' ability to master their own learning. The purpose of the present study is to examine the psychometric properties of self-regulation in second language learning via Rasch measurement. A total of 528 high-school students from an East Asian country…
Descriptors: Self Control, Second Language Learning, English (Second Language), Measures (Individuals)
Avci, Filiz; Acar Sesen, Burçin; Kirbaslar, Fatma Gülay – Online Submission, 2015
Studies on science education show that students have many misunderstandings and misconceptions related to chemistry concepts. Many techniques have been used to determine those misconceptions such as multiple choice tests, interviews, drawing, etc. Two-tier diagnostic test is one of these techniques for using to determine misconceptions and their…
Descriptors: Middle School Students, Grade 7, Chemistry, Science Instruction
Edwards, Michael C.; Flora, David B.; Thissen, David – Applied Measurement in Education, 2012
This article describes a computerized adaptive test (CAT) based on the uniform item exposure multi-form structure (uMFS). The uMFS is a specialization of the multi-form structure (MFS) idea described by Armstrong, Jones, Berliner, and Pashley (1998). In an MFS CAT, the examinee first responds to a small fixed block of items. The items comprising…
Descriptors: Adaptive Testing, Computer Assisted Testing, Test Format, Test Items
Mishra, Sanjaya; Sharma, Meenu; Sharma, Ramesh Chander; Singh, Alka; Thakur, Atul – Open Praxis, 2016
This paper describes the entire methodology for the development of a scale to measure Attitude towards Open Educational Resources (ATOER). Traditionally, it is observed that some teachers are more willing to share their work than others, indicating the need to understand teachers' psychological and behavioural determinants that influence use of…
Descriptors: Resource Units, Computer Uses in Education, Shared Resources and Services, Test Construction

Peer reviewed
Direct link
