Publication Date
| In 2026 | 0 |
| Since 2025 | 40 |
| Since 2022 (last 5 years) | 185 |
| Since 2017 (last 10 years) | 468 |
| Since 2007 (last 20 years) | 916 |
Descriptor
Source
Author
| Sireci, Stephen G. | 10 |
| Briesch, Amy M. | 5 |
| McIntosh, Kent | 5 |
| Prasetyo, Zuhdan Kun | 5 |
| Volpe, Robert J. | 5 |
| Kartowagiran, Badrun | 4 |
| Mardapi, Djemari | 4 |
| Messick, Samuel | 4 |
| Siew, Nyet Moi | 4 |
| Abell, Neil | 3 |
| Barbera, Jack | 3 |
| More ▼ | |
Publication Type
Education Level
Location
| Turkey | 70 |
| Indonesia | 50 |
| Malaysia | 33 |
| Australia | 22 |
| China | 18 |
| Canada | 16 |
| India | 16 |
| United States | 14 |
| Texas | 13 |
| Hong Kong | 12 |
| Taiwan | 12 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Cisneros-Cohernour, Edith J. – Quality of Higher Education, 2005
This paper focuses on the validity of the research conducted under the leading paradigm in the evaluation of teaching in higher education. Messick's framework on validity is used to identify the strengths and limitations of the research, mostly centered on the study of student ratings of instruction. Critical issues that need to be addressed by…
Descriptors: Higher Education, Student Evaluation of Teacher Performance, Teacher Evaluation, Educational Quality
Zhang, James J.; Lam, Eddie T. C.; Smith, Dennis W.; Fleming, David S.; Connaughton, Dan P. – Measurement in Physical Education and Exercise Science, 2006
The purpose of this study was to develop the Scale for Program Facilitators (SPF) to assess the effectiveness of after school achievement programs through four steps: (a) identification of a theoretical framework, (b) formulation of the initial scale, (c) test of content validity, and (d) conducting confirmatory factor analyses (CFA). A…
Descriptors: Content Validity, After School Programs, Measures (Individuals), Program Effectiveness
Siu, Andrew M. H.; Shek, Daniel T. L. – Research on Social Work Practice, 2005
Objectives: Psychometric properties of the Chinese version of the Interpersonal Reactivity Index (C-IRI) for the assessment of empathy in Chinese people were examined. Method: The Interpersonal Reactivity Index (IRI) was translated to Chinese, and an expert panel reviewed its content validity and cultural relevance. The translated instrument…
Descriptors: Cultural Relevance, Content Validity, Construct Validity, Factor Structure
Moradi, Bonnie; Subich, Linda Mezydlo – Counseling Psychologist, 2002
Reliability and validity of three current instruments (Feminist Identity Scale [FIS], Feminist Identity Development Scale [FIDS]J Feminist Identity Composite [FIC]) used to operationalize Downing and Roush's model of feminist identity development were compared. A sample of 245 women completed all three instruments, and a separate sample of 35…
Descriptors: Feminism, Social Desirability, Females, Content Validity
Hula, William; Doyle, Patrick J.; McNeil, Malcolm R.; Mikolic, Joseph M. – Journal of Speech, Language, and Hearing Research, 2006
The purpose of this research was to examine the validity of the 55-item Revised Token Test (RTT) and to compare traditional and Rasch-based scores in their ability to detect group differences and change over time. The 55-item RTT was administered to 108 left- and right-hemisphere stroke survivors, and the data were submitted to Rasch analysis.…
Descriptors: Test Items, Brain Hemisphere Functions, Individual Differences, Difficulty Level
Morrison, James L.; Howell, Scott – Innovate: Journal of Online Education, 2007
Editor-in-Chief James L. Morrison interviews Scott Howell, the co-editor of a three-volume book series entitled "Online Assessment and Measurement" that was published in 2006 by IDEA Group. In discussing his own research, Howell first highlights the value of test blueprints as a valuable tool for ensuring an effective alignment of…
Descriptors: Student Evaluation, Testing, Exhibits, Teaching Methods
Kobrin, Jennifer L.; Deng, Hui; Shaw, Emily J. – Journal of Applied Testing Technology, 2007
This study was designed to address two frequent criticisms of the SAT essay--that essay length is the best predictor of scores, and that there is an advantage in using more "sophisticated" examples as opposed to personal experience. The study was based on 2,820 essays from the first three administrations of the new SAT. Each essay was…
Descriptors: Testing Programs, Computer Assisted Testing, Construct Validity, Writing Skills
Hoover, Sheila J.; Abhaya, P. S. – 1995
Computer-based instruction could have considerable impact on improving the quality of science education. Simulations and interactive problems provide a means for students to explore scientific concepts and experiment without the expense or hazard of using actual materials. This paper focuses on the instructional design process as it relates to the…
Descriptors: Communication (Thought Transfer), Computer Assisted Instruction, Computer Simulation, Computer Software Evaluation
Solomon, Alan – 1987
A panel of expert referees from the Philadelphia school district categorized items from secondary-level standardized mathematics tests according to National Assessment of Educational Progress (NAEP) subobjectives for mathematics. The following tests were covered by the study: (1) California Achievement Tests (Levels 19 and 20); (2) Comprehensive…
Descriptors: Content Validity, Educational Objectives, High Schools, Mathematical Concepts
Tittle, Carol Kehr – 1986
Statewide minimum competency testing programs have emphasized basic skills in reading, mathematics, and writing. However, continuing concerns are expressed in national reports about the level of achievement, particularly in mathematics and science, and increased testing has been suggested as a means of encouraging curriculum change and evaluating…
Descriptors: Content Validity, Elementary School Curriculum, Elementary Secondary Education, Measurement Objectives
Lee, Kwangyhuyn; Weimer, Debbi – Education Policy Center at Michigan State University, 2002
Michigan is designing a new accountability system that combines high standards and statewide testing within a school accreditation framework. Sound assessment techniques are critical if the accountability system is to provide relevant information to schools and policymakers. One important component of a sound assessment system is measurement of…
Descriptors: Testing Programs, Academic Achievement, Program Effectiveness, Accountability
Peer reviewedSmith, I. Leon; Hambleton, Ronald K. – Educational Measurement: Issues and Practice, 1990
Implementing measurement specialists' ideas about content validity with licensure examinations and the problem of court litigation are discussed. Validity issues surfacing when sponsors of national licensure examinations conduct validity investigations are considered. Issues include local versus national focus on content validity, job analysis,…
Descriptors: Classification, Content Validity, Court Litigation, Job Analysis
Peer reviewedShea, Judy A.; And Others – Evaluation and the Health Professions, 1988
Item response theory's (IRT) suitability for medical examination data was examined, using data on 2,000 candidates who took the 1980 and 1982 American Board of Internal Medicine Certifying Examinations. Focus was on determining whether the tests met IRT assumptions and applying one-parameter and three-parameter IRT models to the data. (TJH)
Descriptors: Content Validity, Goodness of Fit, Graduate Medical Education, Guessing (Tests)
Howard, Elissa M.; Weiler, Robert M. – American Journal of Health Education, 2003
As America's public school population becomes more diverse, selecting culturally appropriate instructional materials becomes increasingly important. Though several existing scales assess the quality of curricula materials, most do not consider cultural relevance. The Curricula Appropriateness Scale (CAS) was designed to determine the…
Descriptors: Feedback (Response), Cultural Relevance, Content Validity, Measures (Individuals)
Siebert, Darcy Clay; Siebert, Carl F. – Research on Social Work Practice, 2005
Objective: This article reports the validation of the Caregiver Role Identity Scale, designed to measure the prominence of helping professionals' identity as personal and professional caregivers. The authors developed the measure to test its application to burnout, depression, and professional impairment among social workers. Method: Data from a…
Descriptors: Content Validity, Caregiver Role, Identification, Measures (Individuals)

Direct link
