Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 7 |
| Since 2007 (last 20 years) | 20 |
Descriptor
| Classification | 68 |
| Test Construction | 68 |
| Computer Assisted Testing | 21 |
| Testing | 20 |
| Test Items | 18 |
| Test Validity | 13 |
| Testing Problems | 11 |
| Evaluation Methods | 10 |
| Foreign Countries | 10 |
| Educational Testing | 9 |
| Psychometrics | 9 |
| More ▼ | |
Source
Author
Publication Type
Education Level
| Elementary Secondary Education | 7 |
| Higher Education | 4 |
| Postsecondary Education | 4 |
| Elementary Education | 1 |
| Grade 2 | 1 |
| Grade 3 | 1 |
| High Schools | 1 |
| Secondary Education | 1 |
Audience
| Teachers | 1 |
Location
| United States | 3 |
| Canada | 2 |
| United Kingdom | 2 |
| Australia | 1 |
| Brazil | 1 |
| China | 1 |
| Japan | 1 |
| Kenya | 1 |
| Mexico | 1 |
| New Jersey | 1 |
| Spain | 1 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Jing Ma – ProQuest LLC, 2024
This study investigated the impact of scoring polytomous items later on measurement precision, classification accuracy, and test security in mixed-format adaptive testing. Utilizing the shadow test approach, a simulation study was conducted across various test designs, lengths, number and location of polytomous item. Results showed that while…
Descriptors: Scoring, Adaptive Testing, Test Items, Classification
Cai, Liuhan; Albano, Anthony D.; Roussos, Louis A. – Measurement: Interdisciplinary Research and Perspectives, 2021
Multistage testing (MST), an adaptive test delivery mode that involves algorithmic selection of predefined item modules rather than individual items, offers a practical alternative to linear and fully computerized adaptive testing. However, interactions across stages between item modules and examinee groups can lead to challenges in item…
Descriptors: Adaptive Testing, Test Items, Item Response Theory, Test Construction
Leventhal, Brian C.; Grabovsky, Irina – Educational Measurement: Issues and Practice, 2020
Standard setting is arguably one of the most subjective techniques in test development and psychometrics. The decisions when scores are compared to standards, however, are arguably the most consequential outcomes of testing. Providing licensure to practice in a profession has high stake consequences for the public. Denying graduation or forcing…
Descriptors: Standard Setting (Scoring), Weighted Scores, Test Construction, Psychometrics
Ketabi, Somaye; Alavi, Seyyed Mohammed; Ravand, Hamdollah – International Journal of Language Testing, 2021
Although Diagnostic Classification Models (DCMs) were introduced to education system decades ago, it seems that these models were not employed for the original aims upon which they had been designed. Using DCMs has been mostly common in analyzing large-scale non-diagnostic tests and these models have been rarely used in developing Cognitive…
Descriptors: Diagnostic Tests, Test Construction, Goodness of Fit, Classification
Munoz, Albert; Mackay, Jonathon – Journal of University Teaching and Learning Practice, 2019
Online testing is a popular practice for tertiary educators, largely owing to efficiency in automation, scalability, and capability to add depth and breadth to subject offerings. As with all assessments, designs need to consider whether student cheating may be inadvertently made easier and more difficult to detect. Cheating can jeopardise the…
Descriptors: Cheating, Test Construction, Computer Assisted Testing, Classification
Mark L. Davison; David J. Weiss; Ozge Ersan; Joseph N. DeWeese; Gina Biancarosa; Patrick C. Kennedy – Grantee Submission, 2021
MOCCA is an online assessment of inferential reading comprehension for students in 3rd through 6th grades. It can be used to identify good readers and, for struggling readers, identify those who overly rely on either a Paraphrasing process or an Elaborating process when their comprehension is incorrect. Here a propensity to over-rely on…
Descriptors: Reading Tests, Computer Assisted Testing, Reading Comprehension, Elementary School Students
Schmidgall, Jonathan E.; Getman, Edward P.; Zu, Jiyun – Language Testing, 2018
In this study, we define the term "screener test," elaborate key considerations in test design, and describe how to incorporate the concepts of practicality and argument-based validation to drive an evaluation of screener tests for language assessment. A screener test is defined as a brief assessment designed to identify an examinee as a…
Descriptors: Test Validity, Test Use, Test Construction, Language Tests
Maksimovic, Jelena; Petrovic, Jelena; Osmanovic, Jelena – Research in Pedagogy, 2015
The worldwide expansion of higher education introduced the problem of quality of knowledge that graduate students possess, as well as question whether they are competent to fulfill the requirements of their future profession. Education and training for professional work, in our educational system, is realized in various ways: through lectures,…
Descriptors: Teacher Competencies, Teacher Competency Testing, Test Construction, Preservice Teachers
Wyse, Adam E. – Applied Psychological Measurement, 2011
In many practical testing situations, alternate test forms from the same testing program are not strictly parallel to each other and instead the test forms exhibit small psychometric differences. This article investigates the potential practical impact that these small psychometric differences can have on expected classification accuracy. Ten…
Descriptors: Test Format, Test Construction, Testing Programs, Psychometrics
Gierl, Mark J.; Lai, Hollis – International Journal of Testing, 2012
Automatic item generation represents a relatively new but rapidly evolving research area where cognitive and psychometric theories are used to produce tests that include items generated using computer technology. Automatic item generation requires two steps. First, test development specialists create item models, which are comparable to templates…
Descriptors: Foreign Countries, Psychometrics, Test Construction, Test Items
Walker, Michael E. – Measurement: Interdisciplinary Research and Perspectives, 2010
"Linking" is a term given to a general class of procedures by which one represents scores X on one test or measure in terms of scores Y on another test or measure. A recent taxonomy by Holland and Dorans (2006; Holland, 2007) organizes the various types of links into three broad categories: prediction, scale aligning, and equating. In…
Descriptors: Foreign Countries, Test Construction, Test Validity, Measurement Techniques
Gnambs, Timo; Batinic, Bernad – Educational and Psychological Measurement, 2011
Computer-adaptive classification tests focus on classifying respondents in different proficiency groups (e.g., for pass/fail decisions). To date, adaptive classification testing has been dominated by research on dichotomous response formats and classifications in two groups. This article extends this line of research to polytomous classification…
Descriptors: Test Length, Computer Assisted Testing, Classification, Test Items
von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010
The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…
Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria
Laborda, Jesus Garcia; Bakieva, Margarita; Gonzalez-Such, Jose; Pavon, Ana Sevilla – Online Submission, 2010
Since the Spanish Educational system is changing and promoting the use of online tests, it was necessary to study the transformation of test items in the "Spanish University Entrance Examination" (IB P.A.U.) to diminish the effect of test delivery changes (through its computerization) in order to affect the least the current model. The…
Descriptors: Foreign Countries, College Entrance Examinations, Computer Assisted Testing, Test Items
Carlson, Janet F.; Benson, Nicholas; Oakland, Thomas – School Psychology International, 2010
Implications of the International Classification of Functioning, Disability and Health (ICF) on the development and use of tests in school settings are enumerated. We predict increased demand for behavioural assessments that consider a person's activities, participation and person-environment interactions, including measures that: (a) address…
Descriptors: Classification, Models, Test Construction, Test Use

Direct link
Peer reviewed
