Publication Date
| In 2026 | 0 |
| Since 2025 | 40 |
| Since 2022 (last 5 years) | 185 |
| Since 2017 (last 10 years) | 468 |
| Since 2007 (last 20 years) | 916 |
Descriptor
Source
Author
| Sireci, Stephen G. | 10 |
| Briesch, Amy M. | 5 |
| McIntosh, Kent | 5 |
| Prasetyo, Zuhdan Kun | 5 |
| Volpe, Robert J. | 5 |
| Kartowagiran, Badrun | 4 |
| Mardapi, Djemari | 4 |
| Messick, Samuel | 4 |
| Siew, Nyet Moi | 4 |
| Abell, Neil | 3 |
| Barbera, Jack | 3 |
| More ▼ | |
Publication Type
Education Level
Location
| Turkey | 70 |
| Indonesia | 50 |
| Malaysia | 33 |
| Australia | 22 |
| China | 18 |
| Canada | 16 |
| India | 16 |
| United States | 14 |
| Texas | 13 |
| Hong Kong | 12 |
| Taiwan | 12 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedMessick, Samuel – American Psychologist, 1995
Presents a comprehensive review of validity that includes an empirical evaluation of the actual and potential consequences of score interpretation and use, how those consequences come about, and what determines them. Six distinguishable aspects of construct validity are highlighted as a means of addressing central issues implicit in the notion of…
Descriptors: Concurrent Validity, Construct Validity, Content Validity, Models
Peer reviewedBordage, Georges; And Others – Academic Medicine, 1995
Three related Canadian studies assessed the content validity of 59 clinical problems designed as part of a test of medical decision-making skills. Focus was on the key features, i.e., the critical or essential steps in identification and management of the clinical problem. Results support content validity of the key features. (MSE)
Descriptors: Clinical Teaching (Health Professions), Content Validity, Decision Making, Foreign Countries
Cisneros-Cohernour, Edith J. – Quality of Higher Education, 2005
This paper focuses on the validity of the research conducted under the leading paradigm in the evaluation of teaching in higher education. Messick's framework on validity is used to identify the strengths and limitations of the research, mostly centered on the study of student ratings of instruction. Critical issues that need to be addressed by…
Descriptors: Higher Education, Student Evaluation of Teacher Performance, Teacher Evaluation, Educational Quality
Siu, Andrew M. H.; Shek, Daniel T. L. – Research on Social Work Practice, 2005
Objectives: Psychometric properties of the Chinese version of the Interpersonal Reactivity Index (C-IRI) for the assessment of empathy in Chinese people were examined. Method: The Interpersonal Reactivity Index (IRI) was translated to Chinese, and an expert panel reviewed its content validity and cultural relevance. The translated instrument…
Descriptors: Cultural Relevance, Content Validity, Construct Validity, Factor Structure
Moradi, Bonnie; Subich, Linda Mezydlo – Counseling Psychologist, 2002
Reliability and validity of three current instruments (Feminist Identity Scale [FIS], Feminist Identity Development Scale [FIDS]J Feminist Identity Composite [FIC]) used to operationalize Downing and Roush's model of feminist identity development were compared. A sample of 245 women completed all three instruments, and a separate sample of 35…
Descriptors: Feminism, Social Desirability, Females, Content Validity
Morrison, James L.; Howell, Scott – Innovate: Journal of Online Education, 2007
Editor-in-Chief James L. Morrison interviews Scott Howell, the co-editor of a three-volume book series entitled "Online Assessment and Measurement" that was published in 2006 by IDEA Group. In discussing his own research, Howell first highlights the value of test blueprints as a valuable tool for ensuring an effective alignment of…
Descriptors: Student Evaluation, Testing, Exhibits, Teaching Methods
Kobrin, Jennifer L.; Deng, Hui; Shaw, Emily J. – Journal of Applied Testing Technology, 2007
This study was designed to address two frequent criticisms of the SAT essay--that essay length is the best predictor of scores, and that there is an advantage in using more "sophisticated" examples as opposed to personal experience. The study was based on 2,820 essays from the first three administrations of the new SAT. Each essay was…
Descriptors: Testing Programs, Computer Assisted Testing, Construct Validity, Writing Skills
Fischer, Ronald G.; Fischer, Jerome M. – Alberta Journal of Educational Research, 2006
Based on theories of emotional intelligence, adult education, psychology of reading, and emotions and literature, this study was designed to develop and validate the Affective Response to Literature Survey (ARLS), a psychological instrument used to measure an emotional response to literature. Initially, 27 items were generated by a review of…
Descriptors: Psychometrics, Transformative Learning, Reader Response, Emotional Response
Hoover, Sheila J.; Abhaya, P. S. – 1995
Computer-based instruction could have considerable impact on improving the quality of science education. Simulations and interactive problems provide a means for students to explore scientific concepts and experiment without the expense or hazard of using actual materials. This paper focuses on the instructional design process as it relates to the…
Descriptors: Communication (Thought Transfer), Computer Assisted Instruction, Computer Simulation, Computer Software Evaluation
Solomon, Alan – 1987
A panel of expert referees from the Philadelphia school district categorized items from secondary-level standardized mathematics tests according to National Assessment of Educational Progress (NAEP) subobjectives for mathematics. The following tests were covered by the study: (1) California Achievement Tests (Levels 19 and 20); (2) Comprehensive…
Descriptors: Content Validity, Educational Objectives, High Schools, Mathematical Concepts
Tittle, Carol Kehr – 1986
Statewide minimum competency testing programs have emphasized basic skills in reading, mathematics, and writing. However, continuing concerns are expressed in national reports about the level of achievement, particularly in mathematics and science, and increased testing has been suggested as a means of encouraging curriculum change and evaluating…
Descriptors: Content Validity, Elementary School Curriculum, Elementary Secondary Education, Measurement Objectives
Lee, Kwangyhuyn; Weimer, Debbi – Education Policy Center at Michigan State University, 2002
Michigan is designing a new accountability system that combines high standards and statewide testing within a school accreditation framework. Sound assessment techniques are critical if the accountability system is to provide relevant information to schools and policymakers. One important component of a sound assessment system is measurement of…
Descriptors: Testing Programs, Academic Achievement, Program Effectiveness, Accountability
Peer reviewedSmith, I. Leon; Hambleton, Ronald K. – Educational Measurement: Issues and Practice, 1990
Implementing measurement specialists' ideas about content validity with licensure examinations and the problem of court litigation are discussed. Validity issues surfacing when sponsors of national licensure examinations conduct validity investigations are considered. Issues include local versus national focus on content validity, job analysis,…
Descriptors: Classification, Content Validity, Court Litigation, Job Analysis
Peer reviewedShea, Judy A.; And Others – Evaluation and the Health Professions, 1988
Item response theory's (IRT) suitability for medical examination data was examined, using data on 2,000 candidates who took the 1980 and 1982 American Board of Internal Medicine Certifying Examinations. Focus was on determining whether the tests met IRT assumptions and applying one-parameter and three-parameter IRT models to the data. (TJH)
Descriptors: Content Validity, Goodness of Fit, Graduate Medical Education, Guessing (Tests)
Howard, Elissa M.; Weiler, Robert M. – American Journal of Health Education, 2003
As America's public school population becomes more diverse, selecting culturally appropriate instructional materials becomes increasingly important. Though several existing scales assess the quality of curricula materials, most do not consider cultural relevance. The Curricula Appropriateness Scale (CAS) was designed to determine the…
Descriptors: Feedback (Response), Cultural Relevance, Content Validity, Measures (Individuals)

Direct link
