ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	9

Descriptor

Item Response Theory	14
Test Format	14
Test Items	6
Equated Scores	4
Models	4
Psychometrics	4
Test Construction	4
Achievement Tests	3
Ability	2
Computer Assisted Testing	2
Correlation	2
Difficulty Level	2
Educational Research	2
Error of Measurement	2
High School Students	2
Measurement Techniques	2
Multiple Choice Tests	2
Science Tests	2
Scores	2
Standardized Tests	2
Student Evaluation	2
Test Content	2
Test Reliability	2
Test Theory	2
Test Validity	2
More ▼

Source

Applied Psychological…	2
Educational and Psychological…	2
CBE - Life Sciences Education	1
Educational Measurement:…	1
Journal of Chemical Education	1
Journal of Economic Education	1
Journal of Educational…	1
Measurement:…	1
ReCALL	1
Structural Equation Modeling	1

Publication Type

Reports - Descriptive	14
Journal Articles	12

Education Level

Elementary Secondary Education	1
High Schools	1
Higher Education	1
Postsecondary Education	1

Audience

Location

Hong Kong

Laws, Policies, & Programs

Assessments and Surveys

Law School Admission Test	1
SAT (College Admission Test)	1
Test of Standard Written…	1

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Information Functions of Rank-2PL Models for Forced-Choice Questionnaires

Peer reviewed

Direct link

Jianbin Fu; Xuan Tan; Patrick C. Kyllonen – Journal of Educational Measurement, 2024

This paper presents the item and test information functions of the Rank two-parameter logistic models (Rank-2PLM) for items with two (pair) and three (triplet) statements in forced-choice questionnaires. The Rank-2PLM model for pairs is the MUPP-2PLM (Multi-Unidimensional Pairwise Preference) and, for triplets, is the Triplet-2PLM. Fisher's…

Descriptors: Questionnaires, Test Items, Item Response Theory, Models

Rasch Analysis for Instrument Development: Why, When, and How?

Peer reviewed

Direct link

Boone, William J. – CBE - Life Sciences Education, 2016

This essay describes Rasch analysis psychometric techniques and how such techniques can be used by life sciences education researchers to guide the development and use of surveys and tests. Specifically, Rasch techniques can be used to document and evaluate the measurement functioning of such instruments. Rasch techniques also allow researchers to…

Descriptors: Item Response Theory, Psychometrics, Science Education, Educational Research

Assessing Conceptual and Algorithmic Knowledge in General Chemistry with ACS Exams

Peer reviewed

Direct link

Holme, Thomas; Murphy, Kristen – Journal of Chemical Education, 2011

In 2005, the ACS Examinations Institute released an exam for first-term general chemistry in which items are intentionally paired with one conceptual and one traditional item. A second-term, paired-questions exam was released in 2007. This paper presents an empirical study of student performances on these two exams based on national samples of…

Descriptors: Chemistry, Science Tests, College Science, Undergraduate Students

On Bias in Linear Observed-Score Equating

Peer reviewed

Direct link

van der Linden, Wim J. – Measurement: Interdisciplinary Research and Perspectives, 2010

The traditional way of equating the scores on a new test form X to those on an old form Y is equipercentile equating for a population of examinees. Because the population is likely to change between the two administrations, a popular approach is to equate for a "synthetic population." The authors of the articles in this issue of the…

Descriptors: Test Format, Equated Scores, Population Distribution, Population Trends

Applications of the Linear Logistic Test Model in Psychometric Research

Peer reviewed

Direct link

Kubinger, Klaus D. – Educational and Psychological Measurement, 2009

The linear logistic test model (LLTM) breaks down the item parameter of the Rasch model as a linear combination of some hypothesized elementary parameters. Although the original purpose of applying the LLTM was primarily to generate test items with specified item difficulty, there are still many other potential applications, which may be of use…

Descriptors: Models, Test Items, Psychometrics, Item Response Theory

A Rasch Perspective

Peer reviewed

Direct link

Schumacker, Randall E.; Smith, Everett V., Jr. – Educational and Psychological Measurement, 2007

Measurement error is a common theme in classical measurement models used in testing and assessment. In classical measurement models, the definition of measurement error and the subsequent reliability coefficients differ on the basis of the test administration design. Internal consistency reliability specifies error due primarily to poor item…

Descriptors: Measurement Techniques, Error of Measurement, Item Sampling, Item Response Theory

A Method for Estimating Classification Consistency Indices for Two Equated Forms

Peer reviewed

Direct link

Yi, Hyun Sook; Kim, Seonghoon; Brennan, Robert L. – Applied Psychological Measurement, 2007

Large-scale testing programs involving classification decisions typically have multiple forms available and conduct equating to ensure cut-score comparability across forms. A test developer might be interested in the extent to which an examinee who happens to take a particular form would have a consistent classification decision if he or she had…

Descriptors: Classification, Reliability, Indexes, Computation

An NCME Instructional Module on Booklet Designs in Large-Scale Assessments of Student Achievement: Theory and Practice

Peer reviewed

Direct link

Frey, Andreas; Hartig, Johannes; Rupp, Andre A. – Educational Measurement: Issues and Practice, 2009

In most large-scale assessments of student achievement, several broad content domains are tested. Because more items are needed to cover the content domains than can be presented in the limited testing time to each individual student, multiple test forms or booklets are utilized to distribute the items to the students. The construction of an…

Descriptors: Measures (Individuals), Test Construction, Theory Practice Relationship, Design

Test Equating under the Multiple-Choice Model.

Peer reviewed

Kim, Jee-Seon; Hanson, Bradley A. – Applied Psychological Measurement, 2002

Presents a characteristic curve procedure for comparing transformations of the item response theory ability scale assuming the multiple-choice model. Illustrates the use of the method with an example equating American College Testing mathematics tests. (SLD)

Descriptors: Ability, Equated Scores, Item Response Theory, Mathematics Tests

Testing the Equivalence among Different Item Response Formats in Personality Measurement: A Structural Equation Modeling Approach.

Peer reviewed

Ferrando, Pere J. – Structural Equation Modeling, 2000

Discusses a procedure for testing the equivalence among different item response formats used in personality and attitude measurement. The procedure is based on the assumption that latent response variables underlie the observed item responses. It uses a nested series of confirmatory factor analysis models based on K. Joreskog's (1971) method for…

Descriptors: Attitude Measures, Correlation, Item Response Theory, Personality Assessment

Adaptive Testing with Equated Number-Correct Scoring. Research Report 99-02.

Download full text

van der Linden, Wim J. – 1999

A constrained computerized adaptive testing (CAT) algorithm is presented that automatically equates the number-correct scores on adaptive tests. The algorithm can be used to equate number-correct scores across different administrations of the same adaptive test as well as to an external reference test. The constraints are derived from a set of…

Descriptors: Ability, Adaptive Testing, Algorithms, Computer Assisted Testing

Evaluating Computer-Based and Paper-Based Versions of an English-Language Listening Test

Peer reviewed

Direct link

Coniam, David – ReCALL, 2006

This paper describes an English language listening test intended as computer-based testing material for secondary school students in Hong Kong, where considerable attention is being invested in online and computer-based testing. As well as providing a school-based testing facility, the study aims to contribute to the knowledge base regarding the…

Descriptors: Listening Comprehension Tests, Computer Assisted Testing, Foreign Countries, Grade 12

The SAT: Four Major Modifications of the 1970-85 Era.

Valley, John R. – 1992

From 1970 to 1985, the Scholastic Aptitude Test (SAT) underwent major modifications caused by: (1) the addition of the Test of Standard Written English (TSWE) to the College Board's Admissions Testing Program (ATP); (2) the passage of test disclosure legislation; (3) the institution of test sensitivity reviews; and (4) the use of item response…

Descriptors: Achievement Tests, College Entrance Examinations, Educational History, Equated Scores

Differential Item Functioning and Male-Female Differences on Multiple-Choice Tests in Economics.

Peer reviewed

Walstad, William B.; Robson, Denise – Journal of Economic Education, 1997

Applies Item Response Theory methods to data from the national norming of the Test of Economic Literacy to identify test questions with large male-female differences. Regression analysis showed a significant decrease in the magnitude of gender difference, although a difference was still present. (MJP)

Descriptors: Academic Aptitude, Comparative Testing, Economics, Economics Education

van der Linden, Wim J.	2
Boone, William J.	1
Brennan, Robert L.	1
Coniam, David	1
Ferrando, Pere J.	1
Frey, Andreas	1
Hanson, Bradley A.	1
Hartig, Johannes	1
Holme, Thomas	1
Jianbin Fu	1
Kim, Jee-Seon	1
Kim, Seonghoon	1
Kubinger, Klaus D.	1
Murphy, Kristen	1
Patrick C. Kyllonen	1
Robson, Denise	1
Rupp, Andre A.	1
Schumacker, Randall E.	1
Smith, Everett V., Jr.	1
Valley, John R.	1
Walstad, William B.	1
Xuan Tan	1
Yi, Hyun Sook	1
More ▼