Publication Date
| In 2026 | 0 |
| Since 2025 | 53 |
| Since 2022 (last 5 years) | 195 |
| Since 2017 (last 10 years) | 495 |
| Since 2007 (last 20 years) | 743 |
Descriptor
| Test Items | 1187 |
| Test Reliability | 1187 |
| Test Validity | 685 |
| Test Construction | 566 |
| Foreign Countries | 349 |
| Difficulty Level | 280 |
| Item Analysis | 253 |
| Psychometrics | 234 |
| Item Response Theory | 219 |
| Factor Analysis | 183 |
| Multiple Choice Tests | 173 |
| More ▼ | |
Source
Author
| Schoen, Robert C. | 12 |
| LaVenia, Mark | 5 |
| Liu, Ou Lydia | 5 |
| Anderson, Daniel | 4 |
| Bauduin, Charity | 4 |
| DiLuzio, Geneva J. | 4 |
| Farina, Kristy | 4 |
| Haladyna, Thomas M. | 4 |
| Huck, Schuyler W. | 4 |
| Petscher, Yaacov | 4 |
| Stansfield, Charles W. | 4 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 39 |
| Researchers | 30 |
| Teachers | 24 |
| Administrators | 13 |
| Support Staff | 3 |
| Counselors | 2 |
| Students | 2 |
| Community | 1 |
| Parents | 1 |
| Policymakers | 1 |
Location
| Turkey | 69 |
| Indonesia | 37 |
| Germany | 20 |
| Canada | 17 |
| Florida | 17 |
| China | 16 |
| Australia | 15 |
| California | 12 |
| Iran | 11 |
| India | 10 |
| New York | 9 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
Canel, Azize Nilgun – Educational Sciences: Theory and Practice, 2013
In this study, the process of developing the Marital Satisfaction Scale (MSS) aiming to support studies in the field of marital satisfaction and to obtain information about couples in a short time through psychological counseling is discussed. The scale including 101 yes-no items aiming to reveal couples' opinions about their marriages was…
Descriptors: Measures (Individuals), Marital Satisfaction, Parents, Child Rearing
Anderson, Daniel; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2012
In this technical report, we describe the results of a study of mathematics items written to align with the Common Core State Standards (CCSS) in grades 6-8. In each grade, CCSS items were organized into forms, and the reliability of these forms was evaluated along with an experimental form including items aligned with the National Council of…
Descriptors: Curriculum Based Assessment, Mathematics Tests, Academic Standards, State Standards
Razi, Salim – Online Submission, 2012
This study presents the processes of developing and establishing reliability and validity of a reading test by administering an integrative approach as conventional reliability and validity measures superficially reveals the difficulty of a reading test. In this respect, analysing vocabulary frequency of the test is regarded as a more eligible way…
Descriptors: Foreign Countries, Undergraduate Students, Reading Tests, Test Validity
Kettler, Ryan J.; Rodriguez, Michael C.; Bolt, Daniel M.; Elliott, Stephen N.; Beddow, Peter A.; Kurz, Alexander – Applied Measurement in Education, 2011
Federal policy on alternate assessment based on modified academic achievement standards (AA-MAS) inspired this research. Specifically, an experimental study was conducted to determine whether tests composed of modified items would have the same level of reliability as tests composed of original items, and whether these modified items helped reduce…
Descriptors: Multiple Choice Tests, Test Items, Alternative Assessment, Test Reliability
Tseng, Mei-Hui; Fu, Chung-Pei; Wilson, Brenda N.; Hu, Fu-Chang – Research in Developmental Disabilities: A Multidisciplinary Journal, 2010
The aim of this study was to adapt and evaluate the Developmental Coordination Disorder Questionnaire (DCDQ) for use in Chinese-speaking countries. A total of 1082 parents completed the DCDQ and 35 parents repeated it after 2 weeks for test-retest reliability. Two items were deleted after examination of test consistency. Cronbach's [alpha] for the…
Descriptors: Test Validity, Measures (Individuals), Psychometrics, Probability
Bozdogan, Aykut Emre; Uzoglu, Mustafa – Online Submission, 2012
The purpose of this study is to develop a reliable and a valid scale to determine the attitudes of the primary students towards tablet PC. The items of the scale were determined by scanning the relevant literature and taking the opinions of the experts. The first draft of the scale including 49 items as a result of content reliability was applied…
Descriptors: Test Construction, Attitude Measures, Student Attitudes, Elementary School Students
Alonzo, Julie; Gonzalez, Magaly; Tindal, Gerald – Behavioral Research and Teaching, 2013
In this study, we describe two studies used to select appropriate assessments to measure phonemic awareness, alphabetic principle, and fluency in the Spanish language for students receiving literacy instruction in Spanish. We first describe two studies in which we use linear regression and correlations to examine the appropriateness of different…
Descriptors: Curriculum Based Assessment, Spanish, Phonemic Awareness, Alphabets
Camilli, Gregory – Educational Research and Evaluation, 2013
In the attempt to identify or prevent unfair tests, both quantitative analyses and logical evaluation are often used. For the most part, fairness evaluation is a pragmatic attempt at determining whether procedural or substantive due process has been accorded to either a group of test takers or an individual. In both the individual and comparative…
Descriptors: Alternative Assessment, Test Bias, Test Content, Test Format
Dixon, Valerie E. – ProQuest LLC, 2011
The purpose of this study was to develop an instrument to assess probation officers knowledge levels of offenders with intellectual disabilities by utilizing a synthesis of subject matter analysis technique and a comprehensive review of literature. This study was conducted in two phases. The first phase was devoted to establishing the knowledge…
Descriptors: Caseworkers, Measures (Individuals), Mental Retardation, Knowledge Level
Bauer, Daniel; Holzer, Matthias; Kopp, Veronika; Fischer, Martin R. – Advances in Health Sciences Education, 2011
To compare different scoring algorithms for Pick-N multiple correct answer multiple-choice (MC) exams regarding test reliability, student performance, total item discrimination and item difficulty. Data from six 3rd year medical students' end of term exams in internal medicine from 2005 to 2008 at Munich University were analysed (1,255 students,…
Descriptors: Medical Students, Test Reliability, Internal Medicine, Scoring
Beddow, Peter A. – International Journal of Disability, Development and Education, 2012
In the arena of educational testing, accessibility refers to the degree to which students are given the opportunity to participate in and engage a test. Accessibility theory is a model for examining the interactions between the test-taker and the test itself and defining how they may decrease some students' access to the test event, ultimately…
Descriptors: Test Results, Test Items, Educational Testing, Scores
Pae, Hye K.; Greenberg, Daphne; Morris, Robin D. – Language Assessment Quarterly, 2012
The aim of this study was to apply the Rasch model to an analysis of the psychometric properties of the Peabody Picture Vocabulary Test--III Form A (PPVT--IIIA) items with struggling adult readers. The PPVT--IIIA was administered to 229 African American adults whose isolated word reading skills were between third and fifth grades. Conformity of…
Descriptors: African Americans, Test Items, Construct Validity, Test Validity
Tasdemir, Mehmet – Journal of Instructional Psychology, 2010
This study aims at comparing the difficulty levels, discrimination powers and powers of testing achievement of multiple choice tests and true-false tests, and thus revealing the rightness or wrongness of the commonly believed hypothesis that multiple choice tests don't bear the same properties as true-false tests. The research was performed with…
Descriptors: Achievement Tests, Multiple Choice Tests, Objective Tests, Student Evaluation
Guler, Nese; Gelbal, Selahattin – Educational Sciences: Theory and Practice, 2010
In this study, the Classical test theory and generalizability theory were used for determination to reliability of scores obtained from measurement tool of mathematics success. 24 open-ended mathematics question of the TIMSS-1999 was applied to 203 students in 2007-spring semester. Internal consistency of scores was found as 0.92. For…
Descriptors: Generalizability Theory, Test Theory, Test Reliability, Interrater Reliability
Sandmann, Lorilee R.; Jordan, Jenny W.; Mull, Casey D.; Valentine, Thomas – Journal of Higher Education Outreach and Engagement, 2014
Community engagement professionals and partners serve as, work with, study, and build the capacity of boundary spanners. To augment knowledge about these functions, the Weerts-Sandmann Boundary Spanning Conceptual Framework (2010) has been operationalized through a survey instrument to examine community engagement boundary-spanning behaviors by…
Descriptors: Outreach Programs, Change Agents, Community Involvement, Employee Attitudes

Peer reviewed
Direct link
