Publication Date
| In 2026 | 0 |
| Since 2025 | 53 |
| Since 2022 (last 5 years) | 195 |
| Since 2017 (last 10 years) | 495 |
| Since 2007 (last 20 years) | 743 |
Descriptor
| Test Items | 1187 |
| Test Reliability | 1187 |
| Test Validity | 685 |
| Test Construction | 566 |
| Foreign Countries | 349 |
| Difficulty Level | 280 |
| Item Analysis | 253 |
| Psychometrics | 234 |
| Item Response Theory | 219 |
| Factor Analysis | 183 |
| Multiple Choice Tests | 173 |
| More ▼ | |
Source
Author
| Schoen, Robert C. | 12 |
| LaVenia, Mark | 5 |
| Liu, Ou Lydia | 5 |
| Anderson, Daniel | 4 |
| Bauduin, Charity | 4 |
| DiLuzio, Geneva J. | 4 |
| Farina, Kristy | 4 |
| Haladyna, Thomas M. | 4 |
| Huck, Schuyler W. | 4 |
| Petscher, Yaacov | 4 |
| Stansfield, Charles W. | 4 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 39 |
| Researchers | 30 |
| Teachers | 24 |
| Administrators | 13 |
| Support Staff | 3 |
| Counselors | 2 |
| Students | 2 |
| Community | 1 |
| Parents | 1 |
| Policymakers | 1 |
Location
| Turkey | 69 |
| Indonesia | 37 |
| Germany | 20 |
| Canada | 17 |
| Florida | 17 |
| China | 16 |
| Australia | 15 |
| California | 12 |
| Iran | 11 |
| India | 10 |
| New York | 9 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
White, Diana L.; Newton-Curtis, Linda; Lyons, Karen S. – Gerontologist, 2008
Purpose: The purpose of the study was to empirically test items of a new measure designed to assess person-directed care (PDC) practices in long-term care. Design and Methods: After reviewing the literature, we identified five areas related to PDC: personhood, comfort care, autonomy, knowing the person, and support for relationships. We also…
Descriptors: Measures (Individuals), Health Services, Test Items, Test Reliability
Tella, Adeyinka – Journal of Information Technology Education, 2011
The suitability of 52 items for measuring Blackboard course management system success was investigated with the aim of validating the Blackboard CMS success scale in an educational context. Through a survey, the Blackboard course management system (BCMS) success scale was administered to 503 students at the University of Botswana. Data collected…
Descriptors: Electronic Learning, Management Systems, User Satisfaction (Information), Intention
Sato, Edynn; Rabinowitz, Stanley; Gallagher, Carole; Huang, Chun-Wei – National Center for Education Evaluation and Regional Assistance, 2010
This study examined the effect of linguistic modification on middle school students' ability to show what they know and can do on math assessments. REL West's study on middle school math assessment accommodations found that simplifying the language--or linguistic modification--on standardized math test items made it easier for English Language…
Descriptors: Test Items, Standardized Tests, Mathematics Tests, Testing Accommodations
Rae, Gordon – Psychological Methods, 2007
The relationship between stratified alpha (alpha-sub(s)) and the reliability of a test composed of interrelated nonhomogeneous items is examined. It is mathematically demonstrated that when there is congeneric equivalence within the strata or subtests, the difference between the coefficients is a function of the variances of the loadings within…
Descriptors: Test Reliability, Test Items, Computation, Error of Measurement
Siddiek, Ahmed Gumaa – English Language Teaching, 2010
Examinations--among other things--are tools of quality control by which we can measure the attainment of the national educational goals. High-quality examinations are means of evaluation that can help teachers modify their teaching techniques, as well as helping learners adjust their learning strategies. Examinations are also benchmarks that can…
Descriptors: Foreign Countries, Student Certification, Questionnaires, Test Validity
Sawchuk, Stephen – Education Digest: Essential Readings Condensed for Quick Review, 2010
Most experts in the testing community have presumed that the $350 million promised by the U.S. Department of Education to support common assessments would promote those that made greater use of open-ended items capable of measuring higher-order critical-thinking skills. But as measurement experts consider the multitude of possibilities for an…
Descriptors: Educational Quality, Test Items, Comparative Analysis, Multiple Choice Tests
Montgomery, Janine Marie; Newton, Brendan; Smith, Christiane – Journal of Psychoeducational Assessment, 2008
The Gilliam Autism Rating Scale-Second Edition (GARS-2) is a screening tool for autism spectrum disorders for individuals between the ages of 3 and 22. It was designed to help differentiate those with autism from those with severe behavioral disorders as well as from those who are typically developing. It is a norm-referenced instrument that…
Descriptors: Autism, Rating Scales, Test Reviews, Norm Referenced Tests
Matson, Johnny L.; Wilkins, Jonathan; Sevin, Jay A.; Knight, Cheryl; Boisjoli, Jessica A.; Sharp, Brenda – Research in Autism Spectrum Disorders, 2009
The success of early intervention programs has in large part spurred increasing emphasis on identifying children with autism and Pervasive Developmental Disorder-Not Otherwise Specified (PDD-NOS) at the earliest possible ages. National and international professional groups have called for early screening and diagnosis, yet the technology to…
Descriptors: Check Lists, Early Intervention, Autism, Infants
Test-Retest Reliability of a Theory of Mind Task Battery for Children with Autism Spectrum Disorders
Hutchins, Tiffany L.; Prelock, Patricia A.; Chace, Wendy – Focus on Autism and Other Developmental Disabilities, 2008
This study examined for the first time the test-retest reliability of theory-of-mind tasks when administered to children with Autism Spectrum Disorders (ASD). A total of 16 questions within 9 tasks targeting a range of content and complexity were administered at 2 times to 17 children with ASD. In all, 13 questions demonstrated adequate…
Descriptors: Autism, Response Style (Tests), Verbal Ability, Test Reliability
Price, Sarah Kye; Handrick, Sandii Leland – Research on Social Work Practice, 2009
Objectives: This study presents the design, implementation, and evaluation of a culturally relevant and responsive approach to screening for perinatal depression in low-income, predominantly African American women. Method: The study details the development of the community-informed instrument and subsequent evaluation of its psychometric…
Descriptors: Females, Factor Analysis, Psychometrics, Depression (Psychology)
Shujuan, Wang; Meihua, Qian; Jianxin, Zhang – Journal of Psychoeducational Assessment, 2009
This article examines the psychometric structure of the Anxiety Control Questionnaire (ACQ) in Chinese adolescents. With the data collected from 212 senior high school students (94 females, 110 males, 8 unknown), seven models are tested using confirmatory factor analyses in the framework of the multitrait-multimethod strategy. Results indicate…
Descriptors: Multitrait Multimethod Techniques, Factor Structure, Adolescents, Measures (Individuals)
Abedi, Jamal; Kao, Jenny C.; Leon, Seth; Sullivan, Lisa; Herman, Joan L.; Pope, Rita; Nambiar, Veena; Mastergeorge, Ann M. – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2008
This study sought to explore factors that affect the accessibility of reading comprehension assessments for students with disabilities. The study consisted of testing students using reading comprehension passages that were broken down into shorter "segments" or "chunks." The results of the segmenting study indicated that: (a)…
Descriptors: Reading Comprehension, Disabilities, Reading Tests, Test Reliability
Ozmen, Haluk – Chemistry Education Research and Practice, 2008
This study aims to determine prospective science student teachers' alternative conceptions of the chemical equilibrium concept. A 13-item pencil and paper, two-tier multiple choice diagnostic instrument, the Test to Identify Students' Alternative Conceptions (TISAC), was developed and administered to 90 second-semester science student teachers…
Descriptors: Foreign Countries, Chemistry, Course Content, Student Teachers
Borsman, Denny; Romeijn, Jan-Willem; Wicherts, Jelte M. – Psychological Methods, 2008
This article shows that measurement invariance (defined in terms of an invariant measurement model in different groups) is generally inconsistent with selection invariance (defined in terms of equal sensitivity and specificity across groups). In particular, when a unidimensional measurement instrument is used and group differences are present in…
Descriptors: Test Items, Minority Groups, Measurement, Scores
Lee, Won-Chan – Applied Psychological Measurement, 2007
This article introduces a multinomial error model, which models an examinee's test scores obtained over repeated measurements of an assessment that consists of polytomously scored items. A compound multinomial error model is also introduced for situations in which items are stratified according to content categories and/or prespecified numbers of…
Descriptors: Simulation, Error of Measurement, Scoring, Test Items

Peer reviewed
Direct link
