ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	9
Since 2007 (last 20 years)	13

Descriptor

Foreign Countries	15
Models	15
Test Format	15
Test Items	12
Item Response Theory	7
Computer Assisted Testing	5
Multiple Choice Tests	5
Achievement Tests	4
International Assessment	4
Language Tests	4
Scores	4
Test Construction	4
College Students	3
Difficulty Level	3
Elementary School Students	3
English (Second Language)	3
Goodness of Fit	3
Reading Tests	3
Second Language Learning	3
Comparative Analysis	2
Correlation	2
Factor Analysis	2
French	2
Interrater Reliability	2
Item Analysis	2
More ▼

Source

Online Submission	2
Applied Measurement in…	1
Applied Psychological…	1
Educational and Psychological…	1
European Educational Research…	1
International Electronic…	1
International Journal of…	1
Journal of Educational…	1
Journal of Educational and…	1
Journal of Intelligence	1
Learning: Research and…	1
ProQuest LLC	1
More ▼

Publication Type

Reports - Research	12
Journal Articles	11
Dissertations/Theses -…	2
Reports - Evaluative	1
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Higher Education	5
Postsecondary Education	3
Elementary Education	2
Grade 4	2
Secondary Education	2
Elementary Secondary Education	1
Grade 8	1
Intermediate Grades	1

Audience

Location

Netherlands	3
Germany	2
Turkey	2
Belgium	1
Canada	1
China	1
Europe	1
France	1
Iran	1
Malaysia	1
Philippines	1
Saudi Arabia (Riyadh)	1
Singapore	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	2
International English…	1
Progress in International…	1
Test of English as a Foreign…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Fused SDT/IRT Models for Mixed-Format Exams

Peer reviewed

Direct link

Lawrence T. DeCarlo – Educational and Psychological Measurement, 2024

A psychological framework for different types of items commonly used with mixed-format exams is proposed. A choice model based on signal detection theory (SDT) is used for multiple-choice (MC) items, whereas an item response theory (IRT) model is used for open-ended (OE) items. The SDT and IRT models are shown to share a common conceptualization…

Descriptors: Test Format, Multiple Choice Tests, Item Response Theory, Models

A Bayesian Item Response Model for Examining Item Position Effects in Complex Survey Data

Peer reviewed

Direct link

Trendtel, Matthias; Robitzsch, Alexander – Journal of Educational and Behavioral Statistics, 2021

A multidimensional Bayesian item response model is proposed for modeling item position effects. The first dimension corresponds to the ability that is to be measured; the second dimension represents a factor that allows for individual differences in item position effects called persistence. This model allows for nonlinear item position effects on…

Descriptors: Bayesian Statistics, Item Response Theory, Test Items, Test Format

A Comparison of IRT Model Combinations for Assessing Fit in a Mixed Format Elementary School Science Test

Peer reviewed
PDF on ERIC

Download full text

Yilmaz, Haci Bayram – International Electronic Journal of Elementary Education, 2019

Open ended and multiple choice questions are commonly placed on the same tests; however, there is a discussion on the effects of using different item types on the test and item statistics. This study aims to compare model and item fit statistics in a mixed format test where multiple choice and constructed response items are used together. In this…

Descriptors: Item Response Theory, Models, Goodness of Fit, Elementary School Science

Testing Vocabulary Associations for Effective Long Term Learning

Download full text

Al-Jarf, Reima – Online Submission, 2023

This article aims to give a comprehensive guide to planning and designing vocabulary tests which include Identifying the skills to be covered by the test; outlining the course content covered; preparing a table of specifications that shows the skill, content topics and number of questions allocated to each; and preparing the test instructions. The…

Descriptors: Vocabulary Development, Learning Processes, Test Construction, Course Content

Same Test, Better Scores: Boosting the Reliability of Short Online Intelligence Recruitment Tests with Nested Logit Item Response Theory Models

Peer reviewed
PDF on ERIC

Download full text

Storme, Martin; Myszkowski, Nils; Baron, Simon; Bernard, David – Journal of Intelligence, 2019

Assessing job applicants' general mental ability online poses psychometric challenges due to the necessity of having brief but accurate tests. Recent research (Myszkowski & Storme, 2018) suggests that recovering distractor information through Nested Logit Models (NLM; Suh & Bolt, 2010) increases the reliability of ability estimates in…

Descriptors: Intelligence Tests, Item Response Theory, Comparative Analysis, Test Reliability

Investigating Structures of Reading Comprehension Attributes at Different Proficiency Levels: Applying Cognitive Diagnosis Models and Factor Analyses

Direct link

Yun, Joonmo – ProQuest LLC, 2017

Reading comprehension is an essential skill for success in school and post-school life. However, despite the importance of this skill, a considerable number of students in the U.S. have shown difficulties in reading comprehension. According to the 2015 National Assessment of Educational Progress (NAEP) of the National Center for Education…

Descriptors: Reading Comprehension, Factor Analysis, Achievement Tests, Foreign Countries

When the Type of Assessment Counteracts Teaching for Understanding

Peer reviewed

Direct link

Leber, Jasmin; Renkl, Alexander; Nückles, Matthias; Wäschle, Kristin – Learning: Research and Practice, 2018

According to the model of constructive alignment, learners adjust their learning strategies to the announced assessment (backwash effect). Hence, when teaching for understanding, the assessment method should be aligned with this teaching goal to ensure that learners engage in corresponding learning strategies. A quasi-experimental field study with…

Descriptors: Learning Strategies, Testing Problems, Educational Objectives, Learning Motivation

Age, Task Characteristics, and Acoustic Indicators of Engagement: Investigations into the Validity of a Technology-Enhanced Speaking Test for Young Language Learners

Download full text

Edward Paul Getman – Online Submission, 2020

Despite calls for engaging assessments targeting young language learners (YLLs) between 8 and 13 years old, what makes assessment tasks engaging and how such task characteristics affect measurement quality have not been well studied empirically. Furthermore, there has been a dearth of validity research about technology-enhanced speaking tests for…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Learner Engagement

Evaluating the Psychometric Characteristics of Generated Multiple-Choice Test Items

Peer reviewed

Direct link

Gierl, Mark J.; Lai, Hollis; Pugh, Debra; Touchie, Claire; Boulais, André-Philippe; De Champlain, André – Applied Measurement in Education, 2016

Item development is a time- and resource-intensive process. Automatic item generation integrates cognitive modeling with computer technology to systematically generate test items. To date, however, items generated using cognitive modeling procedures have received limited use in operational testing situations. As a result, the psychometric…

Descriptors: Psychometrics, Multiple Choice Tests, Test Items, Item Analysis

Modeling Local Item Dependence Due to Common Test Format with a Multidimensional Rasch Model

Peer reviewed

Direct link

Baghaei, Purya; Aryadoust, Vahid – International Journal of Testing, 2015

Research shows that test method can exert a significant impact on test takers' performance and thereby contaminate test scores. We argue that common test method can exert the same effect as common stimuli and violate the conditional independence assumption of item response theory models because, in general, subsets of items which have a shared…

Descriptors: Test Format, Item Response Theory, Models, Test Items

Assessment of Computer and Information Literacy in ICILS 2013: Do Different Item Types Measure the Same Construct?

Peer reviewed

Direct link

Ihme, Jan Marten; Senkbeil, Martin; Goldhammer, Frank; Gerick, Julia – European Educational Research Journal, 2017

The combination of different item formats is found quite often in large scale assessments, and analyses on the dimensionality often indicate multi-dimensionality of tests regarding the task format. In ICILS 2013, three different item types (information-based response tasks, simulation tasks, and authoring tasks) were used to measure computer and…

Descriptors: Foreign Countries, Computer Literacy, Information Literacy, International Assessment

Modeling Item-Position Effects within an IRT Framework

Peer reviewed

Direct link

Debeer, Dries; Janssen, Rianne – Journal of Educational Measurement, 2013

Changing the order of items between alternate test forms to prevent copying and to enhance test security is a common practice in achievement testing. However, these changes in item order may affect item and test characteristics. Several procedures have been proposed for studying these item-order effects. The present study explores the use of…

Descriptors: Item Response Theory, Test Items, Test Format, Models

Computerized Adaptive Testing for Polytomous Motivation Items: Administration Mode Effects and a Comparison with Short Forms

Peer reviewed

Direct link

Hol, A. Michiel; Vorst, Harrie C. M.; Mellenbergh, Gideon J. – Applied Psychological Measurement, 2007

In a randomized experiment (n = 515), a computerized and a computerized adaptive test (CAT) are compared. The item pool consists of 24 polytomous motivation items. Although items are carefully selected, calibration data show that Samejima's graded response model did not fit the data optimally. A simulation study is done to assess possible…

Descriptors: Student Motivation, Simulation, Adaptive Testing, Computer Assisted Testing

Modelling and Assessing Foreign Language Loss.

Schils, Erik; Weltens, Bert – 1991

A study investigated the retention and loss of school-learned foreign language skills. Subjects were 150 students of Dutch secondary schools with 4 or 6 years of French language instruction and 0, 2, and 4 years of disuse. Nine tests of general language proficiency, listening and reading comprehension, phonology, vocabulary, and grammar were…

Descriptors: Cloze Procedure, Foreign Countries, French, Language Maintenance

Model Responses for Examinations with Open-Ended Questions.

Kreeft, Henk; Sanders, Piet – 1983

In the Dutch national examinations, reading comprehension tests are used for all languages. For the native language, reading comprehension is tested with reading passages and related questions to which the test-taker provides his own response, not choosing from a group of alternatives. One problem encountered in testing with these items is…

Descriptors: Dutch, Evaluation Methods, Evaluators, Foreign Countries

Al-Jarf, Reima	1
Aryadoust, Vahid	1
Baghaei, Purya	1
Baron, Simon	1
Bernard, David	1
Boulais, André-Philippe	1
De Champlain, André	1
Debeer, Dries	1
Edward Paul Getman	1
Gerick, Julia	1
Gierl, Mark J.	1
Goldhammer, Frank	1
Hol, A. Michiel	1
Ihme, Jan Marten	1
Janssen, Rianne	1
Kreeft, Henk	1
Lai, Hollis	1
Lawrence T. DeCarlo	1
Leber, Jasmin	1
Mellenbergh, Gideon J.	1
Myszkowski, Nils	1
Nückles, Matthias	1
Pugh, Debra	1
Renkl, Alexander	1
Robitzsch, Alexander	1
More ▼