ERIC - Search Results

Publication Date

In 2026	0
Since 2025	18
Since 2022 (last 5 years)	120
Since 2017 (last 10 years)	262
Since 2007 (last 20 years)	435

Descriptor

Test Format	956
Test Items	956
Test Construction	363
Multiple Choice Tests	260
Foreign Countries	227
Difficulty Level	199
Higher Education	179
Computer Assisted Testing	160
Item Response Theory	151
Item Analysis	149
Scores	146
Comparative Analysis	143
Test Validity	127
Test Reliability	119
Language Tests	109
Mathematics Tests	98
English (Second Language)	88
Scoring	88
Achievement Tests	87
Testing	79
Reading Tests	76
Student Evaluation	75
Elementary Secondary Education	72
Science Tests	72
College Students	71
More ▼

Education Level

Higher Education	151
Postsecondary Education	128
Secondary Education	100
Elementary Education	62
Middle Schools	48
Junior High Schools	38
Elementary Secondary Education	36
Grade 8	35
High Schools	32
Intermediate Grades	24
Grade 4	22
Grade 6	10
Grade 7	10
Early Childhood Education	9
Grade 3	9
Grade 5	9
Primary Education	8
Grade 12	7
Grade 9	6
Kindergarten	3
Preschool Education	3
Grade 1	2
Grade 10	2
Grade 2	2
Grade 11	1
More ▼

Audience

Practitioners	62
Teachers	47
Researchers	32
Students	15
Administrators	13
Parents	6
Policymakers	5
Community	1
Counselors	1

Location

Turkey	27
Canada	15
Germany	15
Australia	13
Israel	13
Japan	12
Netherlands	10
United Kingdom	10
United States	9
Arizona	6
Iran	6
Malaysia	6
South Korea	6
Sweden	6
China	5
New Jersey	5
United Kingdom (England)	5
Louisiana	4
Taiwan	4
Belgium	3
Florida	3
Hong Kong	3
Indonesia	3
Nigeria	3
Ohio	3
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	2
No Child Left Behind Act 2001	2
Elementary and Secondary…	1
Head Start	1
Job Training Partnership Act…	1
Perkins Loan Program	1

What Works Clearinghouse Rating

Test Items X

Showing 181 to 195 of 956 results Save | Export

The Use of Three-Option Multiple Choice Items for Classroom Assessment

Peer reviewed
PDF on ERIC

Download full text

Atalmis, Erkan Hasan – International Journal of Assessment Tools in Education, 2018

Although multiple-choice items (MCIs) are widely used for classroom assessment, designing MCIs with sufficient number of plausible distracters is very challenging for teachers. In this regard, previous empirical studies reveal that using three-option MCIs provides various advantages when compared to four-option MCIs due to less preparation and…

Descriptors: Multiple Choice Tests, Test Items, Difficulty Level, Test Reliability

Using Confidence Intervals to Determine Adequate Item Sample Sizes for Vocabulary Tests: An Essential but Overlooked Practice

Peer reviewed

Direct link

Gyllstad, Henrik; McLean, Stuart; Stewart, Jeffrey – Language Testing, 2021

The last three decades have seen an increase of tests aimed at measuring an individual's vocabulary level or size. The target words used in these tests are typically sampled from word frequency lists, which are in turn based on language corpora. Conventionally, test developers sample items from frequency bands of 1000 words; different tests employ…

Descriptors: Vocabulary Development, Sample Size, Language Tests, Test Items

Examining How Spanish-Speaking English Language Learners Use Their Linguistic Resources and Language Modes in a Dual Language Mathematics Assessment Task

Peer reviewed

Direct link

A. Lopez, Alexis – Journal of Latinos and Education, 2023

In this study, I examined how 34 Spanish-speaking English language learners (ELLs) used their linguistic resources (English and Spanish) and language modes (oral and written language) to demonstrate their knowledge of proportional reasoning in a dual language mathematics assessment task. The assessment allows students to see the item in both…

Descriptors: Spanish Speaking, English Language Learners, Language Usage, Mathematics Instruction

Effects of Data-Collection Designs in the Comparison of Computer-Based and Paper-Based Tests

Peer reviewed

Direct link

Arce-Ferrer, Alvaro J.; Bulut, Okan – Journal of Experimental Education, 2019

This study investigated the performance of four widely used data-collection designs in detecting test-mode effects (i.e., computer-based versus paper-based testing). The experimental conditions included four data-collection designs, two test-administration modes, and the availability of an anchor assessment. The test-level and item-level results…

Descriptors: Data Collection, Test Construction, Test Format, Computer Assisted Testing

Mathematics Framework for the 2019 National Assessment of Educational Progress

Download full text

National Assessment Governing Board, 2019

Since 1973, the National Assessment of Educational Progress (NAEP) has gathered information about student achievement in mathematics. The NAEP assessment in mathematics has two components that differ in purpose. One assessment measures long-term trends in achievement among 9-, 13-, and 17-year-old students by using the same basic design each time.…

Descriptors: National Competency Tests, Mathematics Achievement, Grade 4, Grade 8

Item Cluster-Based Assessment: Modeling and Design

Direct link

Arneson, Amy – ProQuest LLC, 2019

This three-paper dissertation explores item cluster-based assessments, first in general as it relates to modeling, and then, specific issues surrounding a particular item cluster-based assessment designed. There should be a reasonable analogy between the structure of a psychometric model and the cognitive theory that the assessment is based upon.…

Descriptors: Item Response Theory, Test Items, Critical Thinking, Cognitive Tests

Statistically Comparing the Performance of Multiple Automated Raters across Multiple Items

Peer reviewed

Direct link

Kieftenbeld, Vincent; Boyer, Michelle – Applied Measurement in Education, 2017

Automated scoring systems are typically evaluated by comparing the performance of a single automated rater item-by-item to human raters. This presents a challenge when the performance of multiple raters needs to be compared across multiple items. Rankings could depend on specifics of the ranking procedure; observed differences could be due to…

Descriptors: Automation, Scoring, Comparative Analysis, Test Items

Can Reliability of Multiple Component Measuring Instruments Depend on Response Option Presentation Mode?

Peer reviewed

Direct link

Menold, Natalja; Raykov, Tenko – Educational and Psychological Measurement, 2016

This article examines the possible dependency of composite reliability on presentation format of the elements of a multi-item measuring instrument. Using empirical data and a recent method for interval estimation of group differences in reliability, we demonstrate that the reliability of an instrument need not be the same when polarity of the…

Descriptors: Test Reliability, Test Format, Test Items, Differences

Extension of Caution Indices to Mixed-Format Tests

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip – Grantee Submission, 2018

Tatsuoka (1984) suggested several extended caution indices and their standardized versions that have been used as person-fit statistics by researchers such as Drasgow, Levine, and McLaughlin (1987), Glas and Meijer (2003), and Molenaar and Hoijtink (1990). However, these indices are only defined for tests with dichotomous items. This paper extends…

Descriptors: Test Format, Goodness of Fit, Item Response Theory, Error Patterns

Embedded Field Test Item Statistics: Can They Be Trusted for Estimating Student Proficiency?

Peer reviewed

Direct link

Steedle, Jeffrey T.; Morrison, Kristin M. – Educational Assessment, 2019

Assessment items are commonly field tested prior to operational use to observe statistical item properties such as difficulty. Item parameter estimates from field testing may be used to assign scores via pre-equating or computer adaptive designs. This study examined differences between item difficulty estimates based on field test and operational…

Descriptors: Field Tests, Test Items, Statistics, Difficulty Level

The Development and Validation of a Lemma-Based Yes/No Vocabulary Size Test

Peer reviewed

Direct link

Masrai, Ahmed – SAGE Open, 2022

Vocabulary size measures serve important functions, not only with respect to placing learners at appropriate levels on language courses but also with a view to examining the progress of learners. One of the widely reported formats suitable for these purposes is the Yes/No vocabulary test. The primary aim of this study was to introduce and provide…

Descriptors: Vocabulary Development, Language Tests, English (Second Language), Second Language Learning

Technology-Enhanced Items in Grades 1-12 English Language Proficiency Assessments

Peer reviewed

Direct link

Kim, Ahyoung Alicia; Tywoniw, Rurik L.; Chapman, Mark – Language Assessment Quarterly, 2022

Technology-enhanced items (TEIs) are innovative, computer-delivered test items that allow test takers to better interact with the test environment compared to traditional multiple-choice items (MCIs). The interactive nature of TEIs offer improved construct coverage compared with MCIs but little research exists regarding students' performance on…

Descriptors: Language Tests, Test Items, Computer Assisted Testing, English (Second Language)

Trialing Alternative Listening Assessment Tasks: Interactions between Text Authenticity, Item Focus and Item Presentation Condition

Peer reviewed

Direct link

O'Grady, Stefan – Innovation in Language Learning and Teaching, 2023

Purpose: The current study applies an innovative approach to the assessment of second language listening comprehension skills. This is an important focus in need of innovation because scores generated through language assessment tasks should reflect variation in the target skill and the literature broadly suggests that conventional methods of…

Descriptors: Listening Comprehension, Second Language Learning, Correlation, English (Second Language)

Investigating Cognitive Effort and Response Quality of Question Formats in Web Surveys Using Paradata

Peer reviewed

Direct link

Höhne, Jan Karem; Schlosser, Stephan; Krebs, Dagmar – Field Methods, 2017

Measuring attitudes and opinions employing agree/disagree (A/D) questions is a common method in social research because it appears to be possible to measure different constructs with identical response scales. However, theoretical considerations suggest that A/D questions require a considerable cognitive processing. Item-specific (IS) questions,…

Descriptors: Online Surveys, Test Format, Test Items, Difficulty Level

Parameter Estimation in Rasch Models for Examinee-Selected Items

Peer reviewed

Direct link

Liu, Chen-Wei; Wang, Wen-Chung – Journal of Educational Measurement, 2017

The examinee-selected-item (ESI) design, in which examinees are required to respond to a fixed number of items in a given set of items (e.g., choose one item to respond from a pair of items), always yields incomplete data (i.e., only the selected items are answered and the others have missing data) that are likely nonignorable. Therefore, using…

Descriptors: Item Response Theory, Models, Maximum Likelihood Statistics, Data Analysis

« Previous Page | Next Page »

Pages: 1 | ... | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | ... | 64

Educational and Psychological…	48
Journal of Educational…	33
Applied Measurement in…	31
ProQuest LLC	28
ETS Research Report Series	16
Language Testing	16
International Journal of…	15
Educational Measurement:…	14
Practical Assessment,…	13
International Journal of…	11
Journal of Experimental…	11
Applied Psychological…	10
Educational Assessment	10
Language Assessment Quarterly	8
College Board	7
Grantee Submission	7
Online Submission	7
Journal of Psychoeducational…	6
Journal of Technology,…	6
Assessment & Evaluation in…	5
Teaching of Psychology	5
Advances in Health Sciences…	4
Education and Information…	4
Field Methods	4
Journal of Economic Education	4
More ▼

Plake, Barbara S.	12
Kim, Sooyeon	9
Huntley, Renee M.	8
Wainer, Howard	8
Haladyna, Thomas M.	7
Katz, Irvin R.	7
van der Linden, Wim J.	7
Allalouf, Avi	6
DeMars, Christine E.	5
Downing, Steven M.	5
Hambleton, Ronald K.	5
Sykes, Robert C.	5
Walker, Michael E.	5
Anderson, Paul S.	4
Bulut, Okan	4
Goldhammer, Frank	4
Herman, Joan	4
Keehner, Madeleine	4
Lawrence, Ida M.	4
Martinez, Michael E.	4
Pommerich, Mary	4
Sireci, Stephen G.	4
Stansfield, Charles W.	4
Stocking, Martha L.	4
More ▼

Reports - Research	593
Journal Articles	561
Speeches/Meeting Papers	172
Reports - Evaluative	156
Reports - Descriptive	87
Tests/Questionnaires	54
Guides - Non-Classroom	46
Information Analyses	30
Dissertations/Theses -…	29
Opinion Papers	19
Guides - Classroom - Teacher	18
Numerical/Quantitative Data	14
Books	4
Collected Works - General	4
Guides - Classroom - Learner	4
Reference Materials - General	4
ERIC Publications	3
Guides - General	3
Non-Print Media	3
Reference Materials -…	3
Computer Programs	2
ERIC Digests in Full Text	2
Multilingual/Bilingual…	2
Book/Product Reviews	1
Collected Works - Proceedings	1
More ▼

National Assessment of…	21
SAT (College Admission Test)	18
Program for International…	17
Trends in International…	16
ACT Assessment	14
Test of English as a Foreign…	13
Graduate Record Examinations	12
Advanced Placement…	8
Peabody Picture Vocabulary…	5
General Educational…	3
Graduate Management Admission…	3
International English…	3
Preliminary Scholastic…	3
Stanford Achievement Tests	3
State Trait Anxiety Inventory	3
Test of English for…	3
Texas Educational Assessment…	3
Armed Services Vocational…	2
College Level Examination…	2
Cornell Critical Thinking Test	2
Embedded Figures Test	2
Gates MacGinitie Reading Tests	2
Iowa Tests of Basic Skills	2
Law School Admission Test	2
Mathematics Anxiety Rating…	2
More ▼