ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	4
Since 2017 (last 10 years)	7
Since 2007 (last 20 years)	11

Descriptor

Guidelines	18
Test Format	18
Test Items	18
Test Construction	7
Difficulty Level	5
Student Evaluation	5
Comparative Analysis	4
Foreign Countries	4
Language Tests	4
Multiple Choice Tests	4
Rating Scales	4
Simulation	4
Test Validity	4
Computer Assisted Testing	3
Elementary Secondary Education	3
Item Analysis	3
Item Response Theory	3
Scores	3
Second Language Learning	3
Test Length	3
Test Reliability	3
Academic Achievement	2
Behavioral Objectives	2
Correlation	2
Decision Making	2
More ▼

Source

Journal of Educational…	2
Advances in Health Sciences…	1
Applied Measurement in…	1
Educational Technology	1
Educational and Psychological…	1
GED Testing Service	1
Journal of Language and…	1
Language Assessment Quarterly	1
Language Education &…	1
Language Testing	1
National Assessment Governing…	1
ProQuest LLC	1
Quality Assurance in…	1
More ▼

Publication Type

Journal Articles	11
Reports - Research	9
Guides - Non-Classroom	3
Reports - Descriptive	2
Reports - Evaluative	2
Dissertations/Theses -…	1
Guides - General	1
Reference Materials -…	1

Education Level

High Schools	2
Elementary Education	1
Elementary Secondary Education	1
Grade 12	1
Grade 4	1
Grade 8	1
High School Equivalency…	1
Higher Education	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1
More ▼

Audience

Practitioners	5
Administrators	3
Community	1
Teachers	1

Location

China	1
European Union	1
Georgia	1
Hungary	1
Switzerland	1
Turkey	1
Vietnam	1

Laws, Policies, & Programs

Job Training Partnership Act…

Assessments and Surveys

General Educational…	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 18 results Save | Export

Constructing a Robust Score Scale from IRT Scores with Informed Boundaries

Peer reviewed

Direct link

Choe, Edison M.; Han, Kyung T. – Journal of Educational Measurement, 2022

In operational testing, item response theory (IRT) models for dichotomous responses are popular for measuring a single latent construct [theta], such as cognitive ability in a content domain. Estimates of [theta], also called IRT scores or [theta hat], can be computed using estimators based on the likelihood function, such as maximum likelihood…

Descriptors: Scores, Item Response Theory, Test Items, Test Format

Diagnostic Classification Model for Forced-Choice Items and Noncognitive Tests

Peer reviewed

Direct link

Huang, Hung-Yu – Educational and Psychological Measurement, 2023

The forced-choice (FC) item formats used for noncognitive tests typically develop a set of response options that measure different traits and instruct respondents to make judgments among these options in terms of their preference to control the response biases that are commonly observed in normative tests. Diagnostic classification models (DCMs)…

Descriptors: Test Items, Classification, Bayesian Statistics, Decision Making

Closed Formula of Test Length Required for Adaptive Testing with Medium Probability of Solution

Peer reviewed

Direct link

Kárász, Judit T.; Széll, Krisztián; Takács, Szabolcs – Quality Assurance in Education: An International Perspective, 2023

Purpose: Based on the general formula, which depends on the length and difficulty of the test, the number of respondents and the number of ability levels, this study aims to provide a closed formula for the adaptive tests with medium difficulty (probability of solution is p = 1/2) to determine the accuracy of the parameters for each item and in…

Descriptors: Test Length, Probability, Comparative Analysis, Difficulty Level

Gazing into Cognition: Eye Behavior in Online L2 Speaking Tests

Peer reviewed

Direct link

Burton, J. Dylan – Language Assessment Quarterly, 2023

The effects of question or task complexity on second language speaking have traditionally been investigated using complexity, accuracy, and fluency measures. Response processes in speaking tests, however, may manifest in other ways, such as through nonverbal behavior. Eye behavior, in the form of averted gaze or blinking frequency, has been found…

Descriptors: Oral Language, Speech Communication, Language Tests, Eye Movements

Sign Language Learning and Assessment in German Switzerland: Exploring the Potential of Vocabulary Size Tests for Swiss German Sign Language

Peer reviewed
PDF on ERIC

Download full text

Haug, Tobias; Ebling, Sarah; Braem, Penny Boyes; Tissi, Katja; Sidler-Miserez, Sandra – Language Education & Assessment, 2019

In German Switzerland the learning and assessment of Swiss German Sign Language ("Deutschschweizerische Gebärdensprache," DSGS) takes place in different contexts, for example, in tertiary education or in continuous education courses. By way of the still ongoing implementation of the Common European Framework of Reference for DSGS,…

Descriptors: German, Sign Language, Language Tests, Test Items

Proficiency Exams in Teaching Turkish as a Foreign Language in TÖMER (Turkish and Foreign Languages Research and Application Centers)

Peer reviewed
PDF on ERIC

Download full text

Karagöl, Efecan – Journal of Language and Linguistic Studies, 2020

Turkish and Foreign Languages Research and Application Center (TÖMER) is one of the important institutions for learning Turkish as a foreign language. In these institutions, proficiency tests are applied at the end of each level. However, test applications in TÖMERs vary between each center as there is no shared program in teaching Turkish as a…

Descriptors: Language Tests, Turkish, Language Proficiency, Second Language Learning

Abridged Mathematics Framework for the 2017 National Assessment of Educational Progress

Download full text

National Assessment Governing Board, 2017

The National Assessment of Educational Progress (NAEP) is the only continuing and nationally representative measure of trends in academic achievement of U.S. elementary and secondary school students in various subjects. For more than four decades, NAEP assessments have been conducted periodically in reading, mathematics, science, writing, U.S.…

Descriptors: Mathematics Achievement, Multiple Choice Tests, National Competency Tests, Educational Trends

Assessment Guide for Educators: Introduction

Download full text

GED Testing Service, 2016

This guide is designed to help adult educators and administrators better understand the content of the GED® test. This guide is tailored to each test subject and highlights the test's item types, assessment targets, and guidelines for how items will be scored. This 2016 edition has been updated to include the most recent information about the…

Descriptors: Guidelines, Teaching Guides, High School Equivalency Programs, Test Items

Conditions Affecting the Accuracy of Classical Equating Methods for Small Samples under the NEAT Design: A Simulation Study

Direct link

Sunnassee, Devdass – ProQuest LLC, 2011

Small sample equating remains a largely unexplored area of research. This study attempts to fill in some of the research gaps via a large-scale, IRT-based simulation study that evaluates the performance of seven small-sample equating methods under various test characteristic and sampling conditions. The equating methods considered are typically…

Descriptors: Test Length, Test Format, Sample Size, Simulation

A Generalized DIF Effect Variance Estimator for Measuring Unsigned Differential Test Functioning in Mixed Format Tests

Peer reviewed

Direct link

Penfield, Randall D.; Algina, James – Journal of Educational Measurement, 2006

One approach to measuring unsigned differential test functioning is to estimate the variance of the differential item functioning (DIF) effect across the items of the test. This article proposes two estimators of the DIF effect variance for tests containing dichotomous and polytomous items. The proposed estimators are direct extensions of the…

Descriptors: Test Bias, Test Format, Test Items, Simulation

Peer reviewed

Direct link

Ascalon, M. Evelina; Meyers, Lawrence S.; Davis, Bruce W.; Smits, Niels – Applied Measurement in Education, 2007

This article examined two item-writing guidelines: the format of the item stem and homogeneity of the answer set. Answering the call of Haladyna, Downing, and Rodriguez (2002) for empirical tests of item writing guidelines and extending the work of Smith and Smith (1988) on differential use of item characteristics, a mock multiple-choice driver's…

Descriptors: Guidelines, Difficulty Level, Standard Setting, Driver Education

Guidelines for Computer-Managed Testing.

Mizokawa, Donald T.; Hamlin, Michael D. – Educational Technology, 1984

Suggestions for software design in computer managed testing (CMT) cover instructions to testees, their physical format, provision of practice items, and time limit information; test item presentation, physical format, discussion of task demands, review capabilities, and rate of presentation; pedagogically helpful utilities; typefonts; vocabulary;…

Descriptors: Computer Assisted Testing, Decision Making, Guidelines, Test Construction

Investigating the Performance of Alternative Types of Grammar Items

Peer reviewed

Direct link

David, Gergely – Language Testing, 2007

Some educational contexts almost mandate the application of multiple-choice (MC) testing techniques, even if they are deplored by many practitioners in the field. In such contexts especially, research into how well these types of item perform and how their performance may be characterised is both appropriate and desirable. The focus of this paper…

Descriptors: Student Evaluation, Grammar, Language Tests, Test Items

Use of a Committee Review Process to Improve the Quality of Course Examinations

Peer reviewed

Direct link

Wallach, P. M.; Crespo, L. M.; Holtzman, K. Z.; Galbraith, R. M.; Swanson, D. B. – Advances in Health Sciences Education, 2006

Purpose: In conjunction with curricular changes, a process to develop integrated examinations was implemented. Pre-established guidelines were provided favoring vignettes, clinically relevant material, and application of knowledge rather than simple recall. Questions were read aloud in a committee including all course directors, and a reviewer…

Descriptors: Test Items, Rating Scales, Examiners, Guidelines

A Guide to Survey Development: Manual on Writing a Survey.

Stacey, Susan E.; Moyer, Kerry L. – 1982

This handbook was developed for use by individuals with limited experience in performing good survey-based research. The essential procedures in the preparation of a survey are outlined and discussed. The method of survey construction proposed consists of several tasks: (1) survey objectives and research questions are specified; (2) literature is…

Descriptors: Data Collection, Elementary Secondary Education, Evaluation Methods, Guidelines

Previous Page | Next Page »

Pages: 1 | 2

Algina, James	1
Ascalon, M. Evelina	1
Braem, Penny Boyes	1
Burton, J. Dylan	1
Choe, Edison M.	1
Crespo, L. M.	1
David, Gergely	1
Davis, Bruce W.	1
Ebling, Sarah	1
Erickson, Harley E.	1
Galbraith, R. M.	1
Hamlin, Michael D.	1
Han, Kyung T.	1
Haug, Tobias	1
Holtzman, K. Z.	1
Huang, Hung-Yu	1
Karagöl, Efecan	1
Kárász, Judit T.	1
Meyers, Lawrence S.	1
Miller, Patrick W.	1
Mizokawa, Donald T.	1
Moyer, Kerry L.	1
Penfield, Randall D.	1
Sidler-Miserez, Sandra	1
Smits, Niels	1
More ▼