ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	3

Descriptor

Classification	6
Test Format	6
Test Reliability	6
College Students	2
Evaluation Methods	2
Language Tests	2
Statistical Analysis	2
Test Validity	2
Algorithms	1
Attitude Measures	1
Cognitive Processes	1
Criterion Referenced Tests	1
Cutting Scores	1
Diagnostic Tests	1
Educational Assessment	1
English (Second Language)	1
Foreign Countries	1
Goodness of Fit	1
Guidelines	1
Higher Education	1
Item Response Theory	1
Language Proficiency	1
Learning Processes	1
Listening Comprehension	1
Listening Comprehension Tests	1
More ▼

Source

Educational and Psychological…	1
Journal of Educational…	1
Journal of Educational and…	1
Language Testing	1
Sociological Methods &…	1

Author

Aryadoust, Vahid	1
Berk, Ronald A.	1
Chiu, Chia-Yi	1
Köhn, Hans Friedrich	1
Luo, Lan	1
Menold, Natalja	1
Schriesheim, Chester A.	1
Stansfield, Charles W.	1
Tausch, Anja	1
Wang, Yu	1

Publication Type

Journal Articles	5
Information Analyses	3
Reports - Research	3
Guides - Non-Classroom	1
Reports - Evaluative	1

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Location

Germany

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 6 results Save | Export

The Typology of Second Language Listening Constructs: A Systematic Review

Peer reviewed

Direct link

Aryadoust, Vahid; Luo, Lan – Language Testing, 2023

This study reviewed conceptualizations and operationalizations of second language (L2) listening constructs. A total of 157 peer-reviewed papers published in 19 journals in applied linguistics were coded for (1) publication year, author, source title, location, language, and reliability and (2) listening subskills, cognitive processes, attributes,…

Descriptors: Test Format, Listening Comprehension Tests, Second Language Learning, Second Language Instruction

Nonparametric Classification Method for Multiple-Choice Items in Cognitive Diagnosis

Peer reviewed

Direct link

Wang, Yu; Chiu, Chia-Yi; Köhn, Hans Friedrich – Journal of Educational and Behavioral Statistics, 2023

The multiple-choice (MC) item format has been widely used in educational assessments across diverse content domains. MC items purportedly allow for collecting richer diagnostic information. The effectiveness and economy of administering MC items may have further contributed to their popularity not just in educational assessment. The MC item format…

Descriptors: Multiple Choice Tests, Nonparametric Statistics, Test Format, Educational Assessment

Measurement of Latent Variables with Different Rating Scales: Testing Reliability and Measurement Equivalence by Varying the Verbalization and Number of Categories

Peer reviewed

Direct link

Menold, Natalja; Tausch, Anja – Sociological Methods & Research, 2016

Effects of rating scale forms on cross-sectional reliability and measurement equivalence were investigated. A randomized experimental design was implemented, varying category labels and number of categories. The participants were 800 students at two German universities. In contrast to previous research, reliability assessment method was used,…

Descriptors: Rating Scales, Test Reliability, Measurement, Classification

A Consumers' Guide to Criterion-Referenced Test Reliability. Reliability.

Peer reviewed

Berk, Ronald A. – Journal of Educational Measurement, 1980

A dozen different approaches that yield 13 reliability indices for criterion-referenced tests were identified and grouped into three categories: threshold loss function, squared-error loss function, and domain score estimation. Indices were evaluated within each category. (Author/RL)

Descriptors: Classification, Criterion Referenced Tests, Cutting Scores, Evaluation Methods

The Effect of Grouped versus Randomized Questionnaire Format on Scale Reliability and Validity: A Three-Study Investigation.

Peer reviewed

Schriesheim, Chester A.; And Others – Educational and Psychological Measurement, 1989

Three studies explored the effects of grouping versus randomized items in questionnaires on internal consistency and test-retest reliability with samples of 80, 80, and 100, respectively, university students and undergraduates. The 2 correlational and 1 experimental studies were reasonably consistent in demonstrating that neither format was…

Descriptors: Classification, College Students, Evaluation Methods, Higher Education

IDEA Oral Language Proficiency Test (IPT II).

Download full text

Stansfield, Charles W. – 1990

The IDEA Oral Language Proficiency Test (IPT II), an individually-administered measure of speaking and listening proficiency in English as a Second Language designed for secondary school students, is described and discussed. The test consists of 91 items and requires 5-25 minutes to administer. Raw scores are converted to one of seven proficiency…

Descriptors: Classification, English (Second Language), Language Proficiency, Language Tests