ERIC - Search Results

Publication Date

In 2026	0
Since 2025	18
Since 2022 (last 5 years)	120
Since 2017 (last 10 years)	262
Since 2007 (last 20 years)	435

Descriptor

Test Format	956
Test Items	956
Test Construction	363
Multiple Choice Tests	260
Foreign Countries	227
Difficulty Level	199
Higher Education	179
Computer Assisted Testing	160
Item Response Theory	151
Item Analysis	149
Scores	146
Comparative Analysis	143
Test Validity	127
Test Reliability	119
Language Tests	109
Mathematics Tests	98
English (Second Language)	88
Scoring	88
Achievement Tests	87
Testing	79
Reading Tests	76
Student Evaluation	75
Elementary Secondary Education	72
Science Tests	72
College Students	71
More ▼

Education Level

Higher Education	151
Postsecondary Education	128
Secondary Education	100
Elementary Education	62
Middle Schools	48
Junior High Schools	38
Elementary Secondary Education	36
Grade 8	35
High Schools	32
Intermediate Grades	24
Grade 4	22
Grade 6	10
Grade 7	10
Early Childhood Education	9
Grade 3	9
Grade 5	9
Primary Education	8
Grade 12	7
Grade 9	6
Kindergarten	3
Preschool Education	3
Grade 1	2
Grade 10	2
Grade 2	2
Grade 11	1
More ▼

Audience

Practitioners	62
Teachers	47
Researchers	32
Students	15
Administrators	13
Parents	6
Policymakers	5
Community	1
Counselors	1

Location

Turkey	27
Canada	15
Germany	15
Australia	13
Israel	13
Japan	12
Netherlands	10
United Kingdom	10
United States	9
Arizona	6
Iran	6
Malaysia	6
South Korea	6
Sweden	6
China	5
New Jersey	5
United Kingdom (England)	5
Louisiana	4
Taiwan	4
Belgium	3
Florida	3
Hong Kong	3
Indonesia	3
Nigeria	3
Ohio	3
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	2
No Child Left Behind Act 2001	2
Elementary and Secondary…	1
Head Start	1
Job Training Partnership Act…	1
Perkins Loan Program	1

What Works Clearinghouse Rating

Test Items X

Showing 271 to 285 of 956 results Save | Export

Validity and Reliability of Scores Obtained on Multiple-Choice Questions: Why Functioning Distractors Matter

Peer reviewed
PDF on ERIC

Download full text

Ali, Syed Haris; Carr, Patrick A.; Ruit, Kenneth G. – Journal of the Scholarship of Teaching and Learning, 2016

Plausible distractors are important for accurate measurement of knowledge via multiple-choice questions (MCQs). This study demonstrates the impact of higher distractor functioning on validity and reliability of scores obtained on MCQs. Freeresponse (FR) and MCQ versions of a neurohistology practice exam were given to four cohorts of Year 1 medical…

Descriptors: Scores, Multiple Choice Tests, Test Reliability, Test Validity

EFL Teachers' Formal Assessment Practices Based on Exam Papers

Download full text

Kiliçkaya, Ferit – Online Submission, 2016

This study reports initial findings from a small-scale qualitative study aimed at gaining insights into English language teachers' assessment practices in Turkey by examining the formal exam papers. Based on the technique of content analysis, formal exam papers were analyzed in terms of assessment items, language skills tested as well as the…

Descriptors: English (Second Language), Qualitative Research, Content Analysis, Test Items

Effects of Design Properties on Parameter Estimation in Large-Scale Assessments

Peer reviewed

Direct link

Hecht, Martin; Weirich, Sebastian; Siegle, Thilo; Frey, Andreas – Educational and Psychological Measurement, 2015

The selection of an appropriate booklet design is an important element of large-scale assessments of student achievement. Two design properties that are typically optimized are the "balance" with respect to the positions the items are presented and with respect to the mutual occurrence of pairs of items in the same booklet. The purpose…

Descriptors: Measurement, Computation, Test Format, Test Items

Multiple Mini-Interviews in the Age of the Internet: Does Preparation Help Applicants to Medical School?

Peer reviewed

Direct link

Moshinsky, Avital; Ziegler, David; Gafni, Naomi – International Journal of Testing, 2017

Many medical schools have adopted multiple mini-interviews (MMI) as an advanced selection tool. MMIs are expensive and used to test only a few dozen candidates per day, making it infeasible to develop a different test version for each test administration. Therefore, some items are reused both within and across years. This study investigated the…

Descriptors: Interviews, Medical Schools, Test Validity, Test Reliability

How Do Korean College Students Perform in Two Different Vocabulary Assessments?

Peer reviewed
PDF on ERIC

Download full text

Bokyoung Park – English Teaching, 2017

This study investigated Korean college students' performance as measured by two different vocabulary assessment tools (the Productive Vocabulary Levels Test (PVLT) and the Productive Vocabulary Use Task (PVUT)) and the relationship these assessments have with students' writing proficiency. A total of 72 students participated in the study. The…

Descriptors: Foreign Countries, Vocabulary Development, Language Tests, Second Language Learning

Determining When Single Scoring for Constructed-Response Items Is as Effective as Double Scoring in Mixed-Format Licensure Tests

Peer reviewed

Direct link

Kim, Sooyeon; Moses, Tim – International Journal of Testing, 2013

The major purpose of this study is to assess the conditions under which single scoring for constructed-response (CR) items is as effective as double scoring in the licensure testing context. We used both empirical datasets of five mixed-format licensure tests collected in actual operational settings and simulated datasets that allowed for the…

Descriptors: Scoring, Test Format, Licensing Examinations (Professions), Test Items

Evaluating the Psychometric Characteristics of Generated Multiple-Choice Test Items

Peer reviewed

Direct link

Gierl, Mark J.; Lai, Hollis; Pugh, Debra; Touchie, Claire; Boulais, André-Philippe; De Champlain, André – Applied Measurement in Education, 2016

Item development is a time- and resource-intensive process. Automatic item generation integrates cognitive modeling with computer technology to systematically generate test items. To date, however, items generated using cognitive modeling procedures have received limited use in operational testing situations. As a result, the psychometric…

Descriptors: Psychometrics, Multiple Choice Tests, Test Items, Item Analysis

The "None of the Above" Option in Multiple-Choice Testing: An Experimental Study

Peer reviewed

Direct link

DiBattista, David; Sinnige-Egger, Jo-Anne; Fortuna, Glenda – Journal of Experimental Education, 2014

The authors assessed the effects of using "none of the above" as an option in a 40-item, general-knowledge multiple-choice test administered to undergraduate students. Examinees who selected "none of the above" were given an incentive to write the correct answer to the question posed. Using "none of the above" as the…

Descriptors: Multiple Choice Tests, Testing, Undergraduate Students, Test Items

A Comparison of Three Test Formats to Assess Word Difficulty

Peer reviewed

Direct link

Culligan, Brent – Language Testing, 2015

This study compared three common vocabulary test formats, the Yes/No test, the Vocabulary Knowledge Scale (VKS), and the Vocabulary Levels Test (VLT), as measures of vocabulary difficulty. Vocabulary difficulty was defined as the item difficulty estimated through Item Response Theory (IRT) analysis. Three tests were given to 165 Japanese students,…

Descriptors: Language Tests, Test Format, Comparative Analysis, Vocabulary

Modeling Local Item Dependence Due to Common Test Format with a Multidimensional Rasch Model

Peer reviewed

Direct link

Baghaei, Purya; Aryadoust, Vahid – International Journal of Testing, 2015

Research shows that test method can exert a significant impact on test takers' performance and thereby contaminate test scores. We argue that common test method can exert the same effect as common stimuli and violate the conditional independence assumption of item response theory models because, in general, subsets of items which have a shared…

Descriptors: Test Format, Item Response Theory, Models, Test Items

Accuracy and Variability of Item Parameter Estimates from Marginal Maximum a Posteriori Estimation and Bayesian Inference via Gibbs Samplers

Direct link

Wu, Yi-Fang – ProQuest LLC, 2015

Item response theory (IRT) uses a family of statistical models for estimating stable characteristics of items and examinees and defining how these characteristics interact in describing item and test performance. With a focus on the three-parameter logistic IRT (Birnbaum, 1968; Lord, 1980) model, the current study examines the accuracy and…

Descriptors: Item Response Theory, Test Items, Accuracy, Computation

Guide to English Language Arts/Literacy Released Items: Understanding Scoring. 2015

Download full text

Partnership for Assessment of Readiness for College and Careers, 2015

The Partnership for Assessment of Readiness for College and Careers (PARCC) is a group of states working together to develop a modern assessment that replaces previous state standardized tests. It provides better information for teachers and parents to identify where a student needs help, or is excelling, so they are able to enhance instruction to…

Descriptors: Literacy, Language Arts, Scoring Formulas, Scoring

Assessing the Impact of Characteristics of the Test, Common-Items, and Examinees on the Preservation of Equity Properties in Mixed-Format Test Equating

Direct link

Wolf, Raffaela – ProQuest LLC, 2013

Preservation of equity properties was examined using four equating methods--IRT True Score, IRT Observed Score, Frequency Estimation, and Chained Equipercentile--in a mixed-format test under a common-item nonequivalent groups (CINEG) design. Equating of mixed-format tests under a CINEG design can be influenced by factors such as attributes of the…

Descriptors: Testing, Item Response Theory, Equated Scores, Test Items

Examination of Test and Item Statistics from Visual and Verbal Mathematics Questions

Peer reviewed
PDF on ERIC

Download full text

Alpayar, Cagla; Gulleroglu, H. Deniz – Educational Research and Reviews, 2017

The aim of this research is to determine whether students' test performance and approaches to test questions change based on the type of mathematics questions (visual or verbal) administered to them. This research is based on a mixed-design model. The quantitative data are gathered from 297 seventh grade students, attending seven different middle…

Descriptors: Foreign Countries, Middle School Students, Grade 7, Student Evaluation

Assessment of Computer and Information Literacy in ICILS 2013: Do Different Item Types Measure the Same Construct?

Peer reviewed

Direct link

Ihme, Jan Marten; Senkbeil, Martin; Goldhammer, Frank; Gerick, Julia – European Educational Research Journal, 2017

The combination of different item formats is found quite often in large scale assessments, and analyses on the dimensionality often indicate multi-dimensionality of tests regarding the task format. In ICILS 2013, three different item types (information-based response tasks, simulation tasks, and authoring tasks) were used to measure computer and…

Descriptors: Foreign Countries, Computer Literacy, Information Literacy, International Assessment

« Previous Page | Next Page »

Pages: 1 | ... | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | ... | 64

Educational and Psychological…	48
Journal of Educational…	33
Applied Measurement in…	31
ProQuest LLC	28
ETS Research Report Series	16
Language Testing	16
International Journal of…	15
Educational Measurement:…	14
Practical Assessment,…	13
International Journal of…	11
Journal of Experimental…	11
Applied Psychological…	10
Educational Assessment	10
Language Assessment Quarterly	8
College Board	7
Grantee Submission	7
Online Submission	7
Journal of Psychoeducational…	6
Journal of Technology,…	6
Assessment & Evaluation in…	5
Teaching of Psychology	5
Advances in Health Sciences…	4
Education and Information…	4
Field Methods	4
Journal of Economic Education	4
More ▼

Plake, Barbara S.	12
Kim, Sooyeon	9
Huntley, Renee M.	8
Wainer, Howard	8
Haladyna, Thomas M.	7
Katz, Irvin R.	7
van der Linden, Wim J.	7
Allalouf, Avi	6
DeMars, Christine E.	5
Downing, Steven M.	5
Hambleton, Ronald K.	5
Sykes, Robert C.	5
Walker, Michael E.	5
Anderson, Paul S.	4
Bulut, Okan	4
Goldhammer, Frank	4
Herman, Joan	4
Keehner, Madeleine	4
Lawrence, Ida M.	4
Martinez, Michael E.	4
Pommerich, Mary	4
Sireci, Stephen G.	4
Stansfield, Charles W.	4
Stocking, Martha L.	4
More ▼

Reports - Research	593
Journal Articles	561
Speeches/Meeting Papers	172
Reports - Evaluative	156
Reports - Descriptive	87
Tests/Questionnaires	54
Guides - Non-Classroom	46
Information Analyses	30
Dissertations/Theses -…	29
Opinion Papers	19
Guides - Classroom - Teacher	18
Numerical/Quantitative Data	14
Books	4
Collected Works - General	4
Guides - Classroom - Learner	4
Reference Materials - General	4
ERIC Publications	3
Guides - General	3
Non-Print Media	3
Reference Materials -…	3
Computer Programs	2
ERIC Digests in Full Text	2
Multilingual/Bilingual…	2
Book/Product Reviews	1
Collected Works - Proceedings	1
More ▼

National Assessment of…	21
SAT (College Admission Test)	18
Program for International…	17
Trends in International…	16
ACT Assessment	14
Test of English as a Foreign…	13
Graduate Record Examinations	12
Advanced Placement…	8
Peabody Picture Vocabulary…	5
General Educational…	3
Graduate Management Admission…	3
International English…	3
Preliminary Scholastic…	3
Stanford Achievement Tests	3
State Trait Anxiety Inventory	3
Test of English for…	3
Texas Educational Assessment…	3
Armed Services Vocational…	2
College Level Examination…	2
Cornell Critical Thinking Test	2
Embedded Figures Test	2
Gates MacGinitie Reading Tests	2
Iowa Tests of Basic Skills	2
Law School Admission Test	2
Mathematics Anxiety Rating…	2
More ▼