Publication Date
| In 2026 | 0 |
| Since 2025 | 55 |
| Since 2022 (last 5 years) | 197 |
| Since 2017 (last 10 years) | 497 |
| Since 2007 (last 20 years) | 745 |
Descriptor
| Test Items | 1189 |
| Test Reliability | 1189 |
| Test Validity | 687 |
| Test Construction | 567 |
| Foreign Countries | 349 |
| Difficulty Level | 280 |
| Item Analysis | 253 |
| Psychometrics | 236 |
| Item Response Theory | 219 |
| Factor Analysis | 184 |
| Multiple Choice Tests | 173 |
| More ▼ | |
Source
Author
| Schoen, Robert C. | 12 |
| LaVenia, Mark | 5 |
| Liu, Ou Lydia | 5 |
| Anderson, Daniel | 4 |
| Bauduin, Charity | 4 |
| DiLuzio, Geneva J. | 4 |
| Farina, Kristy | 4 |
| Haladyna, Thomas M. | 4 |
| Huck, Schuyler W. | 4 |
| Petscher, Yaacov | 4 |
| Stansfield, Charles W. | 4 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 39 |
| Researchers | 30 |
| Teachers | 24 |
| Administrators | 13 |
| Support Staff | 3 |
| Counselors | 2 |
| Students | 2 |
| Community | 1 |
| Parents | 1 |
| Policymakers | 1 |
Location
| Turkey | 69 |
| Indonesia | 37 |
| Germany | 20 |
| Canada | 17 |
| Florida | 17 |
| China | 16 |
| Australia | 15 |
| California | 12 |
| Iran | 11 |
| India | 10 |
| New York | 9 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
Smarter Balanced Assessment Consortium, 2020
The Smarter Balanced Assessment Consortium (Smarter Balanced) strives to provide every student with a positive and productive assessment experience, generating results that are a fair and accurate estimate of each student's achievement. Further, Smarter Balanced is building on a framework of accessibility for all students, including English…
Descriptors: Student Evaluation, Evaluation Methods, English Language Learners, Students with Disabilities
Dutt, Anuradha; Tan, Marilyn; Alagumalai, Sivakumar; Nair, Rahul – Journal of Autism and Developmental Disorders, 2019
Functional Behavior Assessment (FBA) and behavior interventions have been effective in the management of challenging behavior among children with developmental disabilities including autism spectrum disorders. Research suggests the need for valid measurement instruments for verifying, calibrating and scoring competence in FBA and behavior…
Descriptors: Program Development, Program Validation, Functional Behavioral Assessment, Intervention
Storme, Martin; Myszkowski, Nils; Baron, Simon; Bernard, David – Journal of Intelligence, 2019
Assessing job applicants' general mental ability online poses psychometric challenges due to the necessity of having brief but accurate tests. Recent research (Myszkowski & Storme, 2018) suggests that recovering distractor information through Nested Logit Models (NLM; Suh & Bolt, 2010) increases the reliability of ability estimates in…
Descriptors: Intelligence Tests, Item Response Theory, Comparative Analysis, Test Reliability
Yao, Yuan – ProQuest LLC, 2019
Under the framework of item response theory (IRT) and generalizability (G-) theory, this study examined the effects of item difficulty on rating reliability and construct validity on both the constructed-response (CR) items and essay items on English examinations. The data collected for this study were students' scores and responses on the two…
Descriptors: Foreign Countries, College Students, Second Language Learning, English (Second Language)
Rowe, Elizabeth; Asbell-Clarke, Jodi; Almeda, Mia; Gasca, Santiago; Edwards, Teon; Bardar, Erin; Shute, Valerie; Ventura, Matthew – International Journal of Computer Science Education in Schools, 2021
The Inclusive Assessment of Computational Thinking (CT) designed for accessibility and learner variability was studied in over 50 classes in US schools (grades 3-8). The validation studies of IACT sampled thousands of students to establish IACT's construct and concurrent validity as well as test-retest reliability. IACT items for each CT practice…
Descriptors: Puzzles, Logical Thinking, Thinking Skills, Construct Validity
Ahmad, Jamilah; Siew, Nyet Moi – Journal of Baltic Science Education, 2021
There are limited research studies about the development of test instrument to assess the level of entrepreneurial thinking among children in STEM education. The purpose of this research was to develop an Entrepreneurial Science Thinking Test (ESTT) for primary school children in STEM Education and evaluate its validity and reliability. The ESTT…
Descriptors: Test Construction, Children, Entrepreneurship, Thinking Skills
McClain, Maryellen Brunson; Harris, Bryn; Schwartz, Sarah E.; Golson, Megan E. – Journal of Psychoeducational Assessment, 2021
Although the racial/ethnic demographics in the United States are changing, few studies evaluate the cultural and linguistic responsiveness of commonly used autism spectrum disorder screening and diagnostic assessment measures. The purpose of this study is to evaluate item and test functioning of the Autism Spectrum Rating Scales (ASRS) in a sample…
Descriptors: Autism, Pervasive Developmental Disorders, Rating Scales, Test Bias
Nebraska Department of Education, 2021
This technical report documents the processes and procedures implemented to support the Spring 2021 Nebraska Student-Centered Assessment System (NSCAS) Phase I Pilot in English Language Arts (ELA), Mathematics, and Science assessments by NWEA® under the supervision of the Nebraska Department of Education (NDE). The technical report shows how the…
Descriptors: Psychometrics, Standard Setting, English, Language Arts
Zaidi, Nikki L.; Swoboda, Christopher M.; Kelcey, Benjamin M.; Manuel, R. Stephen – Advances in Health Sciences Education, 2017
The extant literature has largely ignored a potentially significant source of variance in multiple mini-interview (MMI) scores by "hiding" the variance attributable to the sample of attributes used on an evaluation form. This potential source of hidden variance can be defined as rating items, which typically comprise an MMI evaluation…
Descriptors: Interviews, Scores, Generalizability Theory, Monte Carlo Methods
Byram, Jessica N.; Seifert, Mark F.; Brooks, William S.; Fraser-Cotlin, Laura; Thorp, Laura E.; Williams, James M.; Wilson, Adam B. – Anatomical Sciences Education, 2017
With integrated curricula and multidisciplinary assessments becoming more prevalent in medical education, there is a continued need for educational research to explore the advantages, consequences, and challenges of integration practices. This retrospective analysis investigated the number of items needed to reliably assess anatomical knowledge in…
Descriptors: Anatomy, Science Tests, Test Items, Test Reliability
Denizli, Zeynep Akkurt; Erdogan, Abdulkadir – Journal on Mathematics Education, 2018
This study aimed to develop a three-dimensional geometric thinking test to determine the geometric thinking of early graders in the paper-pencil environment. First, we determined the components of three-dimensional geometric thinking and prepared questions for each component. Then, we conducted the pilot studies of the test at three stages in six…
Descriptors: Geometry, Mathematics Instruction, Spatial Ability, Teaching Methods
Perkins, Kyle; Frank, Eva – Online Submission, 2018
This paper presents item-analysis data to illustrate how to identify a set of internally consistent test items that differentiate or discriminate among examinees who are highly proficient and nonproficient on the construct of interest. Suggestions for analyzing the quality of test items are offered as well as a pedagogical approach to augment the…
Descriptors: Item Analysis, Test Items, Test Reliability, Kinetics
Chamoy, Waritsa – ProQuest LLC, 2018
The main purpose of this study was to conduct a validation analysis of student surveys of teaching effectiveness implemented at Bangkok University, Thailand. This study included three phases; survey development, a pilot study, and a full implementation study. Four sources of validity evidence were collected to support intended interpretations and…
Descriptors: Foreign Countries, Psychometrics, Student Surveys, College Students
Cromley, Jennifer G.; Dai, Ting; Fechter, Tia; Nelson, Frank E.; Van Boekel, Martin; Du, Yang – Grantee Submission, 2021
Making inferences and reasoning with new scientific information is critical for successful performance in biology coursework. Thus, identifying students who are weak in these skills could allow the early provision of additional support and course placement recommendations to help students develop their reasoning abilities, leading to better…
Descriptors: Science Tests, Multiple Choice Tests, Logical Thinking, Inferences
DiStefano, Christine; Barth, Steven G.; Greer, Fred – Journal of Psychoeducational Assessment, 2019
This study investigated the effect of item position on descriptive statistics, psychometric information, and factor structure of the Pediatric Symptoms Checklist 17-item social-emotional screening instrument (PSC-17). The goal was to determine whether item position, either grouped by factor or mixed across constructs, produced similar results.…
Descriptors: Check Lists, Test Items, Factor Structure, Screening Tests

Peer reviewed
Direct link
