Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 2 |
| Since 2017 (last 10 years) | 5 |
| Since 2007 (last 20 years) | 12 |
Descriptor
| Test Validity | 10 |
| Test Construction | 5 |
| Foreign Countries | 4 |
| Models | 4 |
| Test Reliability | 4 |
| Validity | 4 |
| Psychometrics | 3 |
| Student Evaluation | 3 |
| Computer Assisted Testing | 2 |
| Construct Validity | 2 |
| Cutting Scores | 2 |
| More ▼ | |
Source
| International Journal of… | 16 |
Author
| Buckendahl, Chad W. | 2 |
| Aidman, Eugene V. | 1 |
| Amy Clark | 1 |
| Arce, Alvaro J. | 1 |
| Bartram, Dave | 1 |
| Beck, Klaus | 1 |
| Bonner, Cavan V. | 1 |
| Carlson, Laurie | 1 |
| Davis-Becker, Susan L. | 1 |
| Faulkner-Bond, Molly | 1 |
| Geisinger, Kurt F. | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 16 |
| Reports - Descriptive | 16 |
Education Level
| Higher Education | 4 |
| Elementary Education | 2 |
| Postsecondary Education | 2 |
| Secondary Education | 2 |
| Elementary Secondary Education | 1 |
| Grade 10 | 1 |
| Grade 11 | 1 |
| Grade 12 | 1 |
| High Schools | 1 |
Audience
| Practitioners | 1 |
| Researchers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| International English… | 1 |
| Wechsler Adult Intelligence… | 1 |
What Works Clearinghouse Rating
Arce, Alvaro J.; Young, Michael J. – International Journal of Testing, 2022
The paper argues that contemporary test validity theory places the consequences of testing on the lives of all college applicants at the back of the test validation argument. It introduces the notion of test efficacy as a process to gather evidence on claims on consequences of testing on all college applicants that can be traced back to validity.…
Descriptors: Test Validity, Test Theory, College Applicants, College Entrance Examinations
Liou, Gloria; Bonner, Cavan V.; Tay, Louis – International Journal of Testing, 2022
With the advent of big data and advances in technology, psychological assessments have become increasingly sophisticated and complex. Nevertheless, traditional psychometric issues concerning the validity, reliability, and measurement bias of such assessments remain fundamental in determining whether score inferences of human attributes are…
Descriptors: Psychometrics, Computer Assisted Testing, Adaptive Testing, Data
Shavelson, Richard J.; Zlatkin-Troitschanskaia, Olga; Beck, Klaus; Schmidt, Susanne; Marino, Julian P. – International Journal of Testing, 2019
Following employers' criticisms and recent societal developments, policymakers and educators have called for students to develop a range of generic skills such as critical thinking ("twenty-first century skills"). So far, such skills have typically been assessed by student self-reports or with multiple-choice tests. An alternative…
Descriptors: Critical Thinking, Cognitive Tests, Performance Based Assessment, Student Evaluation
Sabatini, John; O'Reilly, Tenaha; Weeks, Jonathan; Wang, Zuowei – International Journal of Testing, 2020
The construct of reading comprehension has changed significantly in the twenty-first century; however, some test designs have not evolved sufficiently to capture these changes. Specifically, the nature of literacy sources and skills required has changed (wrought primarily by widespread use of digital technologies). Modern theories of comprehension…
Descriptors: Reading Comprehension, Reading Tests, Vignettes, Test Construction
Faulkner-Bond, Molly; Sireci, Stephen G. – International Journal of Testing, 2015
Throughout the world, tests are administered to some examinees who are not fully proficient in the language in which they are being tested. It has long been acknowledged that proficiency in the language in which a test is administered often affects examinees' performance on a test. Depending on the context and intended uses for a particular…
Descriptors: Language Minorities, Test Validity, Language Proficiency, Test Construction
Sue Bechard; Amy Clark; Russell Swinburne Romine; Meagan Karvonen; Neal Kingston; Karen Erickson – International Journal of Testing, 2019
Evidence-based approaches to assessment design, development, and administration provide a strong foundation for an assessment's validity argument but can be time consuming, resource intensive, and complex to implement. This article describes an evidence-based approach used for one assessment that addresses these challenges. Evidence-centered…
Descriptors: Evidence Based Practice, Test Construction, Test Validity, Measurement
Davis-Becker, Susan L.; Buckendahl, Chad W. – International Journal of Testing, 2013
A critical component of the standard setting process is collecting evidence to evaluate the recommended cut scores and their use for making decisions and classifying students based on test performance. Kane (1994, 2001) proposed a framework by which practitioners can identify and evaluate evidence of the results of the standard setting from (1)…
Descriptors: Standard Setting (Scoring), Evidence, Validity, Cutting Scores
Lim, Gad S.; Geranpayeh, Ardeshir; Khalifa, Hanan; Buckendahl, Chad W. – International Journal of Testing, 2013
Standard setting theory has largely developed with reference to a typical situation, determining a level or levels of performance for one exam for one context. However, standard setting is now being used with international reference frameworks, where some parameters and assumptions of classical standard setting do not hold. We consider the…
Descriptors: Standard Setting (Scoring), Validity, Models, Language Tests
Roivainen, Eka – International Journal of Testing, 2013
To study the concept of national IQ profile, we compared U.S. and Finnish WAIS, WAIS-R, and WAIS III nonverbal and working memory subtest norms. The U.S. standardization samples had consistently higher scores on the Coding and Digit span subtests, while the Finnish samples had higher scores on the Block design subtest. No stable cross-national…
Descriptors: Intelligence Tests, Profiles, Cultural Influences, Nonverbal Tests
Lindley, Patricia A.; Bartram, Dave – International Journal of Testing, 2012
In this article, we present the background to the development of test reviewing by the British Psychological Society (BPS) in the United Kingdom. We also describe the role played by the BPS in the development of the EFPA test review model and its adaptation for use in test reviewing in the United Kingdom. We conclude with a discussion of lessons…
Descriptors: Test Reviews, Professional Associations, Psychology, Global Approach
Geisinger, Kurt F. – International Journal of Testing, 2012
This article sets the stage for the description of a variety of approaches to test reviewing worldwide. It describes the importance of test reviewing as a protection of the public and of society and also the benefits of this activity for test users, who must choose measures to use in particular situations with particular clients at a particular…
Descriptors: Test Reviews, Evaluation Methods, Evaluation Criteria, Global Approach
Govaerts, Sophie; Gregoire, Jacques – International Journal of Testing, 2008
This article describes the development and two studies on the construct validity of the Academic Emotions Scale (AES). The AES is a French self-report questionnaire assessing six emotions in the context of school learning: enjoyment, hope, pride, anxiety, shame and frustration. Its construct validity was studied through exploratory and…
Descriptors: Construct Validity, Test Validity, Factor Structure, Measures (Individuals)
Peer reviewedTurner, Ronna C.; Carlson, Laurie – International Journal of Testing, 2003
Item-objective congruence as developed by R. Rovinelli and R. Hambleton is used in test development for evaluating content validity at the item development stage. Provides a mathematical extension to the Rovinelli and Hambleton index that is applicable for the multidimensional case. (SLD)
Descriptors: Content Validity, Test Construction, Test Content, Test Items
Peer reviewedZimmerman, Donald W.; Zumbo, Bruno D. – International Journal of Testing, 2001
Presents a model of tests and measurement that identifies test scores with Hilbert space vectors and true and error components of scores with linear operators. This geometric point of view brings to light relations among elementary concepts in test theory, including reliability, validity, and parallel tests. (Author/SLD)
Descriptors: Models, Probability, Reliability, Scores
Peer reviewedRuhe, Valerie – International Journal of Testing, 2002
Demonstrates how the framework provided by S. Messick (1988) provides a set of lenses with which to explore issues in the validation of small-scale assessments in new technology-mediated environments. In technology-based distributed learning, the conception of validity will not change, but validation practice will be different. (SLD)
Descriptors: Distance Education, Educational Assessment, Educational Technology, Models
Previous Page | Next Page ยป
Pages: 1 | 2
Direct link
