Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 4 |
| Since 2017 (last 10 years) | 7 |
| Since 2007 (last 20 years) | 11 |
Descriptor
| Guidelines | 18 |
| Test Format | 18 |
| Test Items | 18 |
| Test Construction | 7 |
| Difficulty Level | 5 |
| Student Evaluation | 5 |
| Comparative Analysis | 4 |
| Foreign Countries | 4 |
| Language Tests | 4 |
| Multiple Choice Tests | 4 |
| Rating Scales | 4 |
| More ▼ | |
Source
Author
| Algina, James | 1 |
| Ascalon, M. Evelina | 1 |
| Braem, Penny Boyes | 1 |
| Burton, J. Dylan | 1 |
| Choe, Edison M. | 1 |
| Crespo, L. M. | 1 |
| David, Gergely | 1 |
| Davis, Bruce W. | 1 |
| Ebling, Sarah | 1 |
| Erickson, Harley E. | 1 |
| Galbraith, R. M. | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 11 |
| Reports - Research | 9 |
| Guides - Non-Classroom | 3 |
| Reports - Descriptive | 2 |
| Reports - Evaluative | 2 |
| Dissertations/Theses -… | 1 |
| Guides - General | 1 |
| Reference Materials -… | 1 |
Education Level
| High Schools | 2 |
| Elementary Education | 1 |
| Elementary Secondary Education | 1 |
| Grade 12 | 1 |
| Grade 4 | 1 |
| Grade 8 | 1 |
| High School Equivalency… | 1 |
| Higher Education | 1 |
| Intermediate Grades | 1 |
| Junior High Schools | 1 |
| Middle Schools | 1 |
| More ▼ | |
Audience
| Practitioners | 5 |
| Administrators | 3 |
| Community | 1 |
| Teachers | 1 |
Location
| China | 1 |
| European Union | 1 |
| Georgia | 1 |
| Hungary | 1 |
| Switzerland | 1 |
| Turkey | 1 |
| Vietnam | 1 |
Laws, Policies, & Programs
| Job Training Partnership Act… | 1 |
Assessments and Surveys
| General Educational… | 1 |
| National Assessment of… | 1 |
What Works Clearinghouse Rating
Choe, Edison M.; Han, Kyung T. – Journal of Educational Measurement, 2022
In operational testing, item response theory (IRT) models for dichotomous responses are popular for measuring a single latent construct [theta], such as cognitive ability in a content domain. Estimates of [theta], also called IRT scores or [theta hat], can be computed using estimators based on the likelihood function, such as maximum likelihood…
Descriptors: Scores, Item Response Theory, Test Items, Test Format
Huang, Hung-Yu – Educational and Psychological Measurement, 2023
The forced-choice (FC) item formats used for noncognitive tests typically develop a set of response options that measure different traits and instruct respondents to make judgments among these options in terms of their preference to control the response biases that are commonly observed in normative tests. Diagnostic classification models (DCMs)…
Descriptors: Test Items, Classification, Bayesian Statistics, Decision Making
Kárász, Judit T.; Széll, Krisztián; Takács, Szabolcs – Quality Assurance in Education: An International Perspective, 2023
Purpose: Based on the general formula, which depends on the length and difficulty of the test, the number of respondents and the number of ability levels, this study aims to provide a closed formula for the adaptive tests with medium difficulty (probability of solution is p = 1/2) to determine the accuracy of the parameters for each item and in…
Descriptors: Test Length, Probability, Comparative Analysis, Difficulty Level
Burton, J. Dylan – Language Assessment Quarterly, 2023
The effects of question or task complexity on second language speaking have traditionally been investigated using complexity, accuracy, and fluency measures. Response processes in speaking tests, however, may manifest in other ways, such as through nonverbal behavior. Eye behavior, in the form of averted gaze or blinking frequency, has been found…
Descriptors: Oral Language, Speech Communication, Language Tests, Eye Movements
Haug, Tobias; Ebling, Sarah; Braem, Penny Boyes; Tissi, Katja; Sidler-Miserez, Sandra – Language Education & Assessment, 2019
In German Switzerland the learning and assessment of Swiss German Sign Language ("Deutschschweizerische Gebärdensprache," DSGS) takes place in different contexts, for example, in tertiary education or in continuous education courses. By way of the still ongoing implementation of the Common European Framework of Reference for DSGS,…
Descriptors: German, Sign Language, Language Tests, Test Items
Karagöl, Efecan – Journal of Language and Linguistic Studies, 2020
Turkish and Foreign Languages Research and Application Center (TÖMER) is one of the important institutions for learning Turkish as a foreign language. In these institutions, proficiency tests are applied at the end of each level. However, test applications in TÖMERs vary between each center as there is no shared program in teaching Turkish as a…
Descriptors: Language Tests, Turkish, Language Proficiency, Second Language Learning
National Assessment Governing Board, 2017
The National Assessment of Educational Progress (NAEP) is the only continuing and nationally representative measure of trends in academic achievement of U.S. elementary and secondary school students in various subjects. For more than four decades, NAEP assessments have been conducted periodically in reading, mathematics, science, writing, U.S.…
Descriptors: Mathematics Achievement, Multiple Choice Tests, National Competency Tests, Educational Trends
GED Testing Service, 2016
This guide is designed to help adult educators and administrators better understand the content of the GED® test. This guide is tailored to each test subject and highlights the test's item types, assessment targets, and guidelines for how items will be scored. This 2016 edition has been updated to include the most recent information about the…
Descriptors: Guidelines, Teaching Guides, High School Equivalency Programs, Test Items
Sunnassee, Devdass – ProQuest LLC, 2011
Small sample equating remains a largely unexplored area of research. This study attempts to fill in some of the research gaps via a large-scale, IRT-based simulation study that evaluates the performance of seven small-sample equating methods under various test characteristic and sampling conditions. The equating methods considered are typically…
Descriptors: Test Length, Test Format, Sample Size, Simulation
Penfield, Randall D.; Algina, James – Journal of Educational Measurement, 2006
One approach to measuring unsigned differential test functioning is to estimate the variance of the differential item functioning (DIF) effect across the items of the test. This article proposes two estimators of the DIF effect variance for tests containing dichotomous and polytomous items. The proposed estimators are direct extensions of the…
Descriptors: Test Bias, Test Format, Test Items, Simulation
Ascalon, M. Evelina; Meyers, Lawrence S.; Davis, Bruce W.; Smits, Niels – Applied Measurement in Education, 2007
This article examined two item-writing guidelines: the format of the item stem and homogeneity of the answer set. Answering the call of Haladyna, Downing, and Rodriguez (2002) for empirical tests of item writing guidelines and extending the work of Smith and Smith (1988) on differential use of item characteristics, a mock multiple-choice driver's…
Descriptors: Guidelines, Difficulty Level, Standard Setting, Driver Education
Mizokawa, Donald T.; Hamlin, Michael D. – Educational Technology, 1984
Suggestions for software design in computer managed testing (CMT) cover instructions to testees, their physical format, provision of practice items, and time limit information; test item presentation, physical format, discussion of task demands, review capabilities, and rate of presentation; pedagogically helpful utilities; typefonts; vocabulary;…
Descriptors: Computer Assisted Testing, Decision Making, Guidelines, Test Construction
David, Gergely – Language Testing, 2007
Some educational contexts almost mandate the application of multiple-choice (MC) testing techniques, even if they are deplored by many practitioners in the field. In such contexts especially, research into how well these types of item perform and how their performance may be characterised is both appropriate and desirable. The focus of this paper…
Descriptors: Student Evaluation, Grammar, Language Tests, Test Items
Wallach, P. M.; Crespo, L. M.; Holtzman, K. Z.; Galbraith, R. M.; Swanson, D. B. – Advances in Health Sciences Education, 2006
Purpose: In conjunction with curricular changes, a process to develop integrated examinations was implemented. Pre-established guidelines were provided favoring vignettes, clinically relevant material, and application of knowledge rather than simple recall. Questions were read aloud in a committee including all course directors, and a reviewer…
Descriptors: Test Items, Rating Scales, Examiners, Guidelines
Stacey, Susan E.; Moyer, Kerry L. – 1982
This handbook was developed for use by individuals with limited experience in performing good survey-based research. The essential procedures in the preparation of a survey are outlined and discussed. The method of survey construction proposed consists of several tasks: (1) survey objectives and research questions are specified; (2) literature is…
Descriptors: Data Collection, Elementary Secondary Education, Evaluation Methods, Guidelines
Previous Page | Next Page »
Pages: 1 | 2
Peer reviewed
Direct link
