Publication Date
| In 2026 | 0 |
| Since 2025 | 53 |
| Since 2022 (last 5 years) | 195 |
| Since 2017 (last 10 years) | 495 |
| Since 2007 (last 20 years) | 743 |
Descriptor
| Test Items | 1187 |
| Test Reliability | 1187 |
| Test Validity | 685 |
| Test Construction | 566 |
| Foreign Countries | 349 |
| Difficulty Level | 280 |
| Item Analysis | 253 |
| Psychometrics | 234 |
| Item Response Theory | 219 |
| Factor Analysis | 183 |
| Multiple Choice Tests | 173 |
| More ▼ | |
Source
Author
| Schoen, Robert C. | 12 |
| LaVenia, Mark | 5 |
| Liu, Ou Lydia | 5 |
| Anderson, Daniel | 4 |
| Bauduin, Charity | 4 |
| DiLuzio, Geneva J. | 4 |
| Farina, Kristy | 4 |
| Haladyna, Thomas M. | 4 |
| Huck, Schuyler W. | 4 |
| Petscher, Yaacov | 4 |
| Stansfield, Charles W. | 4 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 39 |
| Researchers | 30 |
| Teachers | 24 |
| Administrators | 13 |
| Support Staff | 3 |
| Counselors | 2 |
| Students | 2 |
| Community | 1 |
| Parents | 1 |
| Policymakers | 1 |
Location
| Turkey | 69 |
| Indonesia | 37 |
| Germany | 20 |
| Canada | 17 |
| Florida | 17 |
| China | 16 |
| Australia | 15 |
| California | 12 |
| Iran | 11 |
| India | 10 |
| New York | 9 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
Lee, Young-Sun; Lembke, Erica; Moore, Douglas; Ginsburg, Herbert P.; Pappas, Sandra – Assessment for Effective Intervention, 2012
The present study examined the technical adequacy of curriculum-based measures (CBMs) of early numeracy. Six 1-min early mathematics tasks were administered to 137 kindergarten and first-grade students, along with an omnibus test of early mathematics. The CBM measures included Count Out Loud, Quantity Discrimination, Number Identification, Missing…
Descriptors: Numeracy, Curriculum Based Assessment, Mathematics Tests, Kindergarten
Peoples, Shelagh – ProQuest LLC, 2012
The purpose of this study was to determine which of three competing models will provide, reliable, interpretable, and responsive measures of elementary students' understanding of the nature of science (NOS). The Nature of Science Instrument-Elementary (NOSI-E), a 28-item Rasch-based instrument, was used to assess students' NOS…
Descriptors: Scientific Principles, Science Tests, Elementary School Students, Item Response Theory
Grady, Melissa D.; Rose, Roderick A. – Journal of Interpersonal Violence, 2011
This article examines the analysis of the psychometric properties, including the validity and reliability, of the Empathy Index (EI), a new instrument designed to measure empathy deficits of sex offenders. The EI was tested with a sample of 158 sex offenders incarcerated in North Carolina prisons. An exploratory factor analysis yielded three…
Descriptors: Aggression, Correctional Institutions, Test Validity, Factor Analysis
Ellinoudis, Theodoros; Evaggelinou, Christina; Kourtessis, Thomas; Konstantinidou, Zoe; Venetsanou, Fotini; Kambas, Antonis – Research in Developmental Disabilities: A Multidisciplinary Journal, 2011
The purpose of this study was to examine specific aspects of the reliability and validity of age band 1 of the Movement Assessment Battery for Children-Second Edition (MABC-2) (Henderson, Sugden, & Barnett, 2007) in Greek preschool children. One hundred and eighty-three children participated in the study; the children ranged in age from 36 to…
Descriptors: Test Validity, Preschool Children, Program Effectiveness, Factor Analysis
Morrow, James R., Jr.; Martin, Scott B.; Jackson, Allen W. – Research Quarterly for Exercise and Sport, 2010
The purpose of this study was to investigate the quality (reliability and validity) of large-scale fitness testing in Texas and determine if reliabilities and validities were related to potential confounding variables. Four test administration scenarios were conducted to investigate the quality of data collected statewide as part of the Texas…
Descriptors: Health Related Fitness, Tests, Test Validity, Test Reliability
Edwards, Michael C.; Cheavens, Jennifer S.; Heiy, Jane E.; Cukrowicz, Kelly C. – Psychological Assessment, 2010
The Center for Epidemiologic Studies Depression Scale (CES-D) is one of the most widely used measures of depressive symptoms in research today. The original psychometric work in support of the CES-D (Radloff, 1977) described a 4-factor model underlying the 20 items on the scale. Despite a long history of evidence supporting this structure,…
Descriptors: Test Items, Factor Structure, Measures (Individuals), Psychometrics
Matthews, Percival; Rittle-Johnson, Bethany; McEldoon, Katherine; Taylor, Roger – Journal for Research in Mathematics Education, 2012
Knowledge of the equal sign as an indicator of mathematical equality is foundational to children's mathematical development and serves as a key link between arithmetic and algebra. The current findings reaffirmed a past finding that diverse items can be integrated onto a single scale, revealed the wide variability in children's knowledge of the…
Descriptors: Symbols (Mathematics), Elementary School Students, Mathematics Tests, Test Items
Koizumi, Rie; Sakai, Hideki; Ido, Takahiro; Ota, Hiroshi; Hayama, Megumi; Sato, Masatoshi; Nemoto, Akiko – Language Assessment Quarterly, 2011
This article reports on the development and validation of the English Diagnostic Test of Grammar (EDiT Grammar) for Japanese learners of English. From among the many aspects of grammar, this test focuses on the knowledge of basic English noun phrases (NPs), especially their internal structures, because previous research has indicated the…
Descriptors: Nouns, Diagnostic Tests, English (Second Language), Second Language Learning
Keller, Christopher M.; Kros, John F. – Marketing Education Review, 2011
Measures of survey reliability are commonly addressed in marketing courses. One statistic of reliability is "Cronbach's alpha." This paper presents an application of survey reliability as a reflexive application of multiple-choice exam validation. The application provides an interactive decision support system that incorporates survey item…
Descriptors: Test Validity, Marketing, Test Reliability, Multiple Choice Tests
Sood, Vishal – Journal on Educational Psychology, 2013
For identifying children with four major kinds of verbal learning disabilities viz. reading disability, speech and language comprehension disability, writing disability and mathematics disability, the present task was undertaken to construct and standardize verbal learning disabilities checklist. This checklist was developed by keeping in view the…
Descriptors: Verbal Learning, Learning Disabilities, Children, Disability Identification
Squires, Jane K.; Waddell, Misti L.; Clifford, Jantina R.; Funk, Kristin; Hoselton, Robert M.; Chen, Ching-I – Topics in Early Childhood Special Education, 2013
Psychometric and utility studies on Social Emotional Assessment Measure (SEAM), an innovative tool for assessing and monitoring social-emotional and behavioral development in infants and toddlers with disabilities, were conducted. The Infant and Toddler SEAM intervals were the study focus, using mixed methods, including item response theory…
Descriptors: Psychometrics, Evaluation Methods, Social Development, Emotional Development
Porter, Andrew C.; Polikoff, Morgan S.; Goldring, Ellen; Murphy, Joseph; Elliott, Stephen N.; May, Henry – Educational Administration Quarterly, 2010
Research has consistently shown that principal leadership matters for successful schools. Evaluating principals on the behaviors shown to improve student learning should be an important leverage point for raising leadership quality. Yet principals are often evaluated with the use of instruments with no theoretical background and little, if any,…
Descriptors: Psychometrics, Instructional Leadership, Principals, Test Construction
Marson, Stephen M.; DeAngelis, Donna; Mittal, Nisha – Research on Social Work Practice, 2010
Objectives: The purpose of this article is to create transparency for the psychometric methods employed for the development of the Association of Social Work Boards' (ASWB) exams. Results: The article includes an assessment of the macro (political) and micro (statistical) environments of testing social work competence. The seven-step process used…
Descriptors: Content Validity, Test Validity, Psychometrics, Social Work
Yoon, So Yoon – ProQuest LLC, 2011
Working under classical test theory (CTT) and item response theory (IRT) frameworks, this study investigated psychometric properties of the Revised Purdue Spatial Visualization Tests: Visualization of Rotations (Revised PSVT:R). The original version, the PSVT:R was designed by Guay (1976) to measure spatial visualization ability in…
Descriptors: Undergraduate Students, Test Bias, Guessing (Tests), Construct Validity
Bulut, Okan; Kan, Adnan – Eurasian Journal of Educational Research, 2012
Problem Statement: Computerized adaptive testing (CAT) is a sophisticated and efficient way of delivering examinations. In CAT, items for each examinee are selected from an item bank based on the examinee's responses to the items. In this way, the difficulty level of the test is adjusted based on the examinee's ability level. Instead of…
Descriptors: Adaptive Testing, Computer Assisted Testing, College Entrance Examinations, Graduate Students

Peer reviewed
Direct link
