ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	7
Since 2007 (last 20 years)	17

Descriptor

Test Format	31
Item Response Theory	14
Test Items	12
Test Construction	10
Responses	9
Computer Assisted Testing	8
Scoring	5
Standardized Tests	5
Student Evaluation	5
Achievement Tests	4
Elementary Secondary Education	4
Equated Scores	4
Evaluation Methods	4
Models	4
Multiple Choice Tests	4
Psychometrics	4
Science Tests	4
Scores	4
Teaching Methods	4
Test Interpretation	4
Test Validity	4
Educational Assessment	3
English (Second Language)	3
Feedback (Response)	3
Foreign Countries	3
More ▼

Publication Type

Reports - Descriptive	31
Journal Articles	23
Books	1
Collected Works - General	1
Guides - Classroom - Teacher	1
Guides - Non-Classroom	1
Opinion Papers	1
Speeches/Meeting Papers	1

Education Level

Higher Education	4
Postsecondary Education	4
Elementary Secondary Education	3
High Schools	1

Audience

Teachers	3
Practitioners	2
Administrators	1
Students	1

Location

California	1
Hong Kong	1
Maryland	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Law School Admission Test	1
National Assessment of…	1
SAT (College Admission Test)	1
Test of Standard Written…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 31 results Save | Export

Information Functions of Rank-2PL Models for Forced-Choice Questionnaires

Peer reviewed

Direct link

Jianbin Fu; Xuan Tan; Patrick C. Kyllonen – Journal of Educational Measurement, 2024

This paper presents the item and test information functions of the Rank two-parameter logistic models (Rank-2PLM) for items with two (pair) and three (triplet) statements in forced-choice questionnaires. The Rank-2PLM model for pairs is the MUPP-2PLM (Multi-Unidimensional Pairwise Preference) and, for triplets, is the Triplet-2PLM. Fisher's…

Descriptors: Questionnaires, Test Items, Item Response Theory, Models

Understanding and Interpreting Human Scoring

Peer reviewed

Direct link

Glazer, Nancy; Wolfe, Edward W. – Applied Measurement in Education, 2020

This introductory article describes how constructed response scoring is carried out, particularly the rater monitoring processes and illustrates three potential designs for conducting rater monitoring in an operational scoring project. The introduction also presents a framework for interpreting research conducted by those who study the constructed…

Descriptors: Scoring, Test Format, Responses, Predictor Variables

Adapting Paper-Based Tests for Computer Administration: Lessons Learned from 30 Years of Mode Effects Studies in Education

Peer reviewed
PDF on ERIC

Download full text

Lynch, Sarah – Practical Assessment, Research & Evaluation, 2022

In today's digital age, tests are increasingly being delivered on computers. Many of these computer-based tests (CBTs) have been adapted from paper-based tests (PBTs). However, this change in mode of test administration has the potential to introduce construct-irrelevant variance, affecting the validity of score interpretations. Because of this,…

Descriptors: Computer Assisted Testing, Tests, Scores, Scoring

Assessing Learners' Productive Vocabulary Knowledge: Formats and Considerations

Peer reviewed
PDF on ERIC

Download full text

Sharakhimov, Shoaziz; Nurmukhamedov, Ulugbek – English Teaching Forum, 2021

Vocabulary learning is an incremental process. Vocabulary knowledge, especially for second-language learners, may develop across a lifetime. Teachers with experience in providing feedback on their students' vocabulary use in writing or speech might have noticed that it is sometimes difficult to pinpoint one aspect of word knowledge. The reason is…

Descriptors: Vocabulary Development, Second Language Learning, Second Language Instruction, English (Second Language)

Testing in the Age of Active Learning: Test Question Templates Help to Align Activities and Assessments

Peer reviewed
PDF on ERIC

Download full text

Crowther, Gregory J.; Wiggins, Benjamin L.; Jenkins, Lekelia D. – HAPS Educator, 2020

Many undergraduate biology instructors incorporate active learning exercises into their lessons while continuing to assess students with traditional exams. To better align practice and exams, we present an approach to question-asking that emphasizes templates instead of specific questions. Students and instructors can use these Test Question…

Descriptors: Science Tests, Active Learning, Biology, Undergraduate Students

Personalized Distance-Learning Experience through Virtual Oral Examinations in an Undergraduate Biochemistry Course

Peer reviewed

Direct link

Kamber, David N. – Journal of Chemical Education, 2021

In-person interactions between faculty and students personalize the learning experience and are the hallmark of primarily undergraduate institutions. These invaluable student--faculty interactions were disrupted during the 2019 SARS-CoV-2 global pandemic and led to a rapid, unprecedented shift to distance-learning. Within the space of virtual…

Descriptors: Biochemistry, Science Instruction, Teaching Methods, Undergraduate Students

TIMSS 2023 Assessment Frameworks

Download full text

Mullis, Ina V. S., Ed.; Martin, Michael O., Ed.; von Davier, Matthias, Ed. – International Association for the Evaluation of Educational Achievement, 2021

TIMSS (Trends in International Mathematics and Science Study) is a long-standing international assessment of mathematics and science at the fourth and eighth grades that has been collecting trend data every four years since 1995. About 70 countries use TIMSS trend data for monitoring the effectiveness of their education systems in a global…

Descriptors: Achievement Tests, International Assessment, Science Achievement, Mathematics Achievement

Rasch Analysis for Instrument Development: Why, When, and How?

Peer reviewed

Direct link

Boone, William J. – CBE - Life Sciences Education, 2016

This essay describes Rasch analysis psychometric techniques and how such techniques can be used by life sciences education researchers to guide the development and use of surveys and tests. Specifically, Rasch techniques can be used to document and evaluate the measurement functioning of such instruments. Rasch techniques also allow researchers to…

Descriptors: Item Response Theory, Psychometrics, Science Education, Educational Research

Applications of the Linear Logistic Test Model in Psychometric Research

Peer reviewed

Direct link

Kubinger, Klaus D. – Educational and Psychological Measurement, 2009

The linear logistic test model (LLTM) breaks down the item parameter of the Rasch model as a linear combination of some hypothesized elementary parameters. Although the original purpose of applying the LLTM was primarily to generate test items with specified item difficulty, there are still many other potential applications, which may be of use…

Descriptors: Models, Test Items, Psychometrics, Item Response Theory

Assessing Conceptual and Algorithmic Knowledge in General Chemistry with ACS Exams

Peer reviewed

Direct link

Holme, Thomas; Murphy, Kristen – Journal of Chemical Education, 2011

In 2005, the ACS Examinations Institute released an exam for first-term general chemistry in which items are intentionally paired with one conceptual and one traditional item. A second-term, paired-questions exam was released in 2007. This paper presents an empirical study of student performances on these two exams based on national samples of…

Descriptors: Chemistry, Science Tests, College Science, Undergraduate Students

Automated Scoring of Short-Answer Reading Items: Implications for Constructs

Peer reviewed

Direct link

Carr, Nathan T.; Xi, Xiaoming – Language Assessment Quarterly, 2010

This article examines how the use of automated scoring procedures for short-answer reading tasks can affect the constructs being assessed. In particular, it highlights ways in which the development of scoring algorithms intended to apply the criteria used by human raters can lead test developers to reexamine and even refine the constructs they…

Descriptors: Scoring, Automation, Reading Tests, Test Format

The Problem of Semantic Openness and Constructed Response

Peer reviewed

Direct link

Solheim, Oddny Judith; Skaftun, Atle – Assessment in Education: Principles, Policy & Practice, 2009

During the last three decades the constructed response format has gradually gained entry in large-scale assessments of reading-comprehension. In their 1991 Reading Literacy Study The International Association for the Evaluation of Educational Achievement (IEA) included constructed response items on an exploratory basis. Ten years later, in…

Descriptors: Reading Comprehension, Literacy, Reading Tests, Responses

On Bias in Linear Observed-Score Equating

Peer reviewed

Direct link

van der Linden, Wim J. – Measurement: Interdisciplinary Research and Perspectives, 2010

The traditional way of equating the scores on a new test form X to those on an old form Y is equipercentile equating for a population of examinees. Because the population is likely to change between the two administrations, a popular approach is to equate for a "synthetic population." The authors of the articles in this issue of the…

Descriptors: Test Format, Equated Scores, Population Distribution, Population Trends

A Rasch Perspective

Peer reviewed

Direct link

Schumacker, Randall E.; Smith, Everett V., Jr. – Educational and Psychological Measurement, 2007

Measurement error is a common theme in classical measurement models used in testing and assessment. In classical measurement models, the definition of measurement error and the subsequent reliability coefficients differ on the basis of the test administration design. Internal consistency reliability specifies error due primarily to poor item…

Descriptors: Measurement Techniques, Error of Measurement, Item Sampling, Item Response Theory

A Method for Estimating Classification Consistency Indices for Two Equated Forms

Peer reviewed

Direct link

Yi, Hyun Sook; Kim, Seonghoon; Brennan, Robert L. – Applied Psychological Measurement, 2007

Large-scale testing programs involving classification decisions typically have multiple forms available and conduct equating to ensure cut-score comparability across forms. A test developer might be interested in the extent to which an examinee who happens to take a particular form would have a consistent classification decision if he or she had…

Descriptors: Classification, Reliability, Indexes, Computation

Previous Page | Next Page »

Pages: 1 | 2 | 3

Applied Psychological…	2
Educational and Psychological…	2
Journal of Chemical Education	2
Journal of Educational…	2
American School Board Journal	1
Applied Measurement in…	1
Assessment in Education:…	1
CBE - Life Sciences Education	1
Educational Measurement:…	1
English Teaching Forum	1
HAPS Educator	1
International Association for…	1
Journal of Economic Education	1
Language Assessment Quarterly	1
Measurement:…	1
New Directions for Testing…	1
PS: Political Science and…	1
Practical Assessment,…	1
ReCALL	1
Rowman & Littlefield…	1
Structural Equation Modeling	1
More ▼

van der Linden, Wim J.	2
Anderson, Scarvia B.	1
Boone, William J.	1
Brennan, Robert L.	1
Buckendahl, Chad W.	1
Carr, Nathan T.	1
Coniam, David	1
Crowther, Gregory J.	1
DiVesta, Francis J.	1
Dixon, John	1
Ferrando, Pere J.	1
Frey, Andreas	1
Glazer, Nancy	1
Green, Bert F.	1
Hanson, Bradley A.	1
Harnisch, Delwyn L.	1
Hartig, Johannes	1
Hayn, Judith A.	1
Holme, Thomas	1
Impara, James C.	1
Jenkins, Lekelia D.	1
Jianbin Fu	1
Kamber, David N.	1
Kim, Jee-Seon	1
Kim, Seonghoon	1
More ▼