Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 2 |
| Since 2017 (last 10 years) | 4 |
| Since 2007 (last 20 years) | 10 |
Descriptor
Source
Author
| Sireci, Stephen G. | 2 |
| Argüelles Álvarez, Irina | 1 |
| Atalmis, Erkan Hasan | 1 |
| Conger, Anthony J. | 1 |
| Coniam, David | 1 |
| Deiger, Megan | 1 |
| Faulkner-Bond, Molly | 1 |
| Feranchak, Bret | 1 |
| Galindo, Addy M. | 1 |
| Leal, Johanna P. | 1 |
| Lee, Tony | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 8 |
| Reports - Research | 8 |
| Reports - Evaluative | 2 |
| Tests/Questionnaires | 2 |
| Opinion Papers | 1 |
| Reports - Descriptive | 1 |
| Speeches/Meeting Papers | 1 |
Education Level
| Higher Education | 4 |
| Postsecondary Education | 4 |
| Elementary Education | 2 |
| Elementary Secondary Education | 2 |
| Early Childhood Education | 1 |
| Grade 12 | 1 |
| Grade 3 | 1 |
| Grade 4 | 1 |
| Grade 8 | 1 |
| High Schools | 1 |
| Intermediate Grades | 1 |
| More ▼ | |
Audience
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
| National Assessment of… | 1 |
| National Longitudinal Study… | 1 |
What Works Clearinghouse Rating
Coniam, David; Lee, Tony; Milanovic, Michael; Pike, Nigel; Zhao, Wen – Language Education & Assessment, 2022
The calibration of test materials generally involves the interaction between empirical analysis and expert judgement. This paper explores the extent to which scale familiarity might affect expert judgement as a component of test validation in the calibration process. It forms part of a larger study that investigates the alignment of the…
Descriptors: Specialists, Language Tests, Test Validity, College Faculty
O'Malley, Fran; Norton, Scott – American Institutes for Research, 2022
This paper provides the National Center for Education Statistics (NCES), National Assessment Governing Board (NAGB), and the National Assessment of Educational Progress (NAEP) community with information that may help maintain the validity and utility of the NAEP assessments for civics and U.S. history as revisions are planned to the NAEP…
Descriptors: National Competency Tests, United States History, Test Validity, Governing Boards
Tonekaboni, Fateme Roohani; Ravand, Hamdollah; Rezvani, Reza – International Journal of Language Testing, 2021
Investigating the processes underlying test performance is a major source of data supporting the explanation inference in the validity argument (Chappelle, 2021). One way of modeling the cognitive processes underlying test performance is by constructing a Q-matrix, which is essentially about summarizing the attributes explaining test-takers'…
Descriptors: Reading Comprehension, Reading Tests, High Stakes Tests, Inferences
Feranchak, Bret; Deiger, Megan – AERA Online Paper Repository, 2017
Increasingly content area projects and programs at the K-12 level, such as in mathematics, involve a programmatic component or project emphasis on developing "teacher leadership". However, there is no consistent definition or framework for this construct and even fewer validated tools for measuring it. This paper describes our efforts in…
Descriptors: Teacher Leadership, Mathematics Instruction, Guidelines, Elementary Secondary Education
Atalmis, Erkan Hasan – Journal of Education and Training Studies, 2016
Multiple-choice (MC) items are commonly used in high-stake tests. Thus, each item of such tests should be meticulously constructed to increase the accuracy of decisions based on test results. Haladyna and his colleagues (2002) addressed the valid item-writing guidelines to construct high quality MC items in order to increase test reliability and…
Descriptors: Foreign Countries, Guidelines, Compliance (Psychology), Difficulty Level
Wiley, Colby P.; Wedeking, Travis; Galindo, Addy M. – Journal of Psychoeducational Assessment, 2013
This article reviews the Conners Early Childhood (Conners EC; Conners, 2009), a behavior and development rating scale intended to assess children in early childhood, specifically defined as ages 2 to 6 years. Using multiple informants across multiple settings, the Conners EC is administered for the purpose of early identification of disorders or…
Descriptors: Test Reviews, Rating Scales, Developmental Delays, Disability Identification
Argüelles Álvarez, Irina – International Journal of English Studies, 2013
The new requirement placed on students in tertiary settings in Spain to demonstrate a B1 or a B2 proficiency level of English, in accordance with the Common European Framework of Reference for Languages (CEFRL), has led most Spanish universities to develop a program of certification or accreditation of the required level. The first part of this…
Descriptors: Language Proficiency, Multiple Choice Tests, Second Language Learning, Foreign Countries
Leal, Johanna P. – Latin American Journal of Content and Language Integrated Learning, 2016
On-going bilingual programs without regard to needs analysis; little research on the actual effects of CLIL in Colombia and vague awareness or knowledge about the necessary considerations for effective CLIL programs, underpin the need to address a particular issue of curriculum as it is summative assessment. This small scale study takes place in a…
Descriptors: Science Instruction, Second Language Learning, Second Language Instruction, Language Proficiency
Sireci, Stephen G.; Faulkner-Bond, Molly – Review of Research in Education, 2015
Across the globe, educational tests are being used at a rapidly increasing rate. More recently, educational tests are being used to inform educational policy and for holding educators accountable for student learning. One reason educational assessments are used for these important purposes is that they are considered to provide reliable and…
Descriptors: English Language Learners, Accountability, Educational Testing, Student Evaluation
Sireci, Stephen G. – Educational Researcher, 2007
Lissitz and Samuelsen (2007) propose a new framework for conceptualizing test validity that separates analysis of test properties from analysis of the construct measured. In response, the author of this article reviews fundamental characteristics of test validity, drawing largely from seminal writings as well as from the accepted standards. He…
Descriptors: Test Content, Test Validity, Guidelines, Test Items
Spool, Mark D. – 1975
Content validity is concerned with three components: (1) the job content; (2) the test content, and (3) the strength of the relationship between the two. A content validation study, to be considered adequate and defensible should include at least the following four procedures: (1) A thorough and accurate job analysis (to define the job content);…
Descriptors: Content Analysis, Correlation, Evaluation, Guidelines
Conger, Anthony J.; And Others – 1976
A review of the literature on the validity and reliability of survey data is presented prior to an analysis of the reliability of selected questions in the Second Followup Questionnaire of the National Longitudinal Study of the High School Class of 1972 (NLS). The reliability study includes an evaluation of test-retest reliability as a function of…
Descriptors: Academic Ability, Data Analysis, Data Collection, Demography

Peer reviewed
Direct link
