ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	3

Source

Language Learning & Language…	1
ProQuest LLC	1
Research in Higher Education	1
Routledge, Taylor & Francis…	1

Publication Type

Reports - Research	12
Speeches/Meeting Papers	9
Books	2
Collected Works - General	1
Dissertations/Theses -…	1
Journal Articles	1

Education Level

Elementary Education	1
Elementary Secondary Education	1
Grade 8	1
Higher Education	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Researchers	15
Practitioners	1
Students	1
Teachers	1

Location

Laws, Policies, & Programs

Assessments and Surveys

California Critical Thinking…	1
SAT (College Admission Test)	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

The Effect of Parceling on the Measurement Invariance of US Students' Trends in International Mathematics and Science Study (TIMSS) 2015 Math Attitude Scores

Direct link

Kritika Thapa – ProQuest LLC, 2023

Measurement invariance is crucial for making valid comparisons across different groups (Kline, 2016; Vandenberg, 2002). To address the challenges associated with invariance testing such as large sample size requirements, the complexity of the model, etc., applied researchers have incorporated parcels. Parcels have been shown to alleviate skewness,…

Descriptors: Elementary Secondary Education, Achievement Tests, Foreign Countries, International Assessment

Assessing L2 Listening: Moving towards Authenticity. Language Learning & Language Teaching. Volume 50

Peer reviewed

Direct link

Ockey, Gary J.; Wagner, Elvis – Language Learning & Language Teaching, 2018

This book is relevant for language testers, listening researchers, and oral proficiency teachers, in that it explores four broad themes related to the assessment of L2 listening ability: the use of authentic, real-world spoken texts; the effects of different speech varieties of listening inputs; the use of audio-visual texts; and assessing…

Descriptors: Listening Comprehension, Second Language Learning, Second Language Instruction, Listening Comprehension Tests

Handbook of Polytomous Item Response Theory Models

Direct link

Nering, Michael L., Ed.; Ostini, Remo, Ed. – Routledge, Taylor & Francis Group, 2010

This comprehensive "Handbook" focuses on the most used polytomous item response theory (IRT) models. These models help us understand the interaction between examinees and test questions where the questions have various response categories. The book reviews all of the major models and includes discussions about how and where the models…

Descriptors: Guides, Item Response Theory, Test Items, Correlation

A Comparative Analysis of Two Order Analytic Techniques: Assessing Item Hierarchies in Real and Simulated Data.

Download full text

Chevalaz, Gerard M.; Tatsuoka, Kikumi K. – 1983

Two order theoretic techniques were presented and compared. Ordering theory of Krus and Bart (1974) and an extended Takeya's item relational structure analysis (IRS) by Tatsuoka and Tatsuoka (1981) were used to extract the hierarchical item structure from three datasets. Directed graphs were constructed and both methods were assessed as to how…

Descriptors: Comparative Analysis, Computer Simulation, Instructional Design, Item Analysis

A Short-Cut Statistic for Item Analysis of Mastery Tests: A Comparison of Three Procedures.

Download full text

Subkoviak, Michael J.; Harris, Deborah J. – 1984

This study examined three statistical methods for selecting items for mastery tests. One is the pretest-posttest method due to Cox and Vargas (1966); it is computationally simple, but has a number of serious limitations. The second is a latent trait method recommended by van der Linden (1981); it is computationally complex, but has a number of…

Descriptors: Comparative Analysis, Elementary Secondary Education, Item Analysis, Latent Trait Theory

Differential Item Performance and the Mantel-Haenszel Procedure.

Download full text

Holland, Paul W.; Thayer, Dorothy T. – 1986

The Mantel-Haenszel procedure (MH) is a practical, inexpensive, and powerful way to detect test items that function differently in two groups of examinees. MH is a natural outgrowth of previously suggested chi square methods, and it is also related to methods based on item response theory. The study of items that function differently for two…

Descriptors: Comparative Analysis, Hypothesis Testing, Item Analysis, Latent Trait Theory

A Comparison of ASCAL and LOGIST Parameter Estimation Programs.

Skaggs, Gary; Stevenson, Jose – 1986

This study assesses the accuracy of ASCAL, a microcomputer-based program for estimating item parameters for the three-parameter logistic model in item response theory. Item responses are generated from a three-parameter model, and item parameter estimates from ASCAL are compared to the generating item parameters and to estimates produced by…

Descriptors: Algorithms, Comparative Analysis, Computer Software, Estimation (Mathematics)

Technical Characteristics and Some Correlates of the California Critical Thinking Skills Test, Forms A and B.

Peer reviewed

Jacobs, Stanley S. – Research in Higher Education, 1995

Comparison of college freshman performance on two different forms of the California Critical Thinking Skills Test (n=684, 692) found a lack of equivalence between forms and low internal consistency reliability. It is suggested that, although the test may be useful for research, it is not appropriate for decision making about individual students.…

Descriptors: College Freshmen, Comparative Analysis, Critical Thinking, Educational Research

A Comparison of Item Response Theory Procedures for Assessing Response Dimensionality.

Kingsbury, G. Gage – 1985

A procedure for assessing content-area and total-test dimensionality which uses response function discrepancies (RFD) was studied. Three different versions of the RFD procedure were compared to Bejar's principal axis content-area procedure and Indow and Samejima's exploratory factor analytic technique. The procedures were compared in terms of the…

Descriptors: Achievement Tests, Comparative Analysis, Elementary Education, Estimation (Mathematics)

Implementing Full Information Factor Analysis: TESTFACT Program.

Muraki, Eiji – 1984

The TESTFACT computer program and full-information factor analysis of test items were used in a computer simulation conducted to correct for the guessing effect. Full-information factor analysis also corrects for omitted items. The present version of TESTFACT handles up to five factors and 150 items. A preliminary smoothing of the tetrachoric…

Descriptors: Comparative Analysis, Computer Simulation, Computer Software, Correlation

An Evaluation of Three Approximate Item Response Theory Models for Equating Test Scores.

Download full text

Marco, Gary L.; And Others – 1985

Three item response models were evaluated for estimating item parameters and equating test scores. The models, which approximated the traditional three-parameter model, included: (1) the Rasch one-parameter model, operationalized in the BICAL computer program; (2) an approximate three-parameter logistic model based on coarse group data divided…

Descriptors: College Entrance Examinations, Comparative Analysis, Computer Software, Equated Scores

A Comparison of Item Parameter Estimates and Ability Parameter Estimates Obtained by Different Methods Implemented by BILOG.

Download full text

Buhr, Dianne C.; Algina, James – 1986

The focus of this study is on the estimation procedures implemented in BILOG, a computer program. One purpose is to compare the item parameter estimates produced by various procedures available in BILOG. Four different models are used: the one, two, and three parameter model and a three parameter model with common guessing parameters. The results…

Descriptors: Ability, Bayesian Statistics, Comparative Analysis, Computer Oriented Programs

Optimal Item Selection with Credentialing Examinations.

Download full text

Hambleton, Ronald K.; And Others – 1987

The study compared two promising item response theory (IRT) item-selection methods, optimal and content-optimal, with two non-IRT item selection methods, random and classical, for use in fixed-length certification exams. The four methods were used to construct 20-item exams from a pool of approximately 250 items taken from a 1985 certification…

Descriptors: Comparative Analysis, Content Validity, Cutting Scores, Difficulty Level

Latent Trait Approach to Domain Score Estimation.

Phillips, Gary W. – 1982

This paper presents an introduction to the use of latent trait models for the estimation of domain scores. It was shown that these models provided an advantage over classical test theory and binomial error models in that unbiased estimates of true domain scores could be obtained even when items were not randomly selected from a universe of items.…

Descriptors: Comparative Analysis, Criterion Referenced Tests, Estimation (Mathematics), Goodness of Fit

Discrimination Indices Commonly Used in Military Training Environments: Effects of Departures from Normal Distributions.

Download full text

Sarvela, Paul D. – 1986

Four discrimination indices were compared, using score distributions which were normal, bimodal, and negatively skewed. The score distributions were systematically varied to represent the common circumstances of a military training situation using criterion-referenced mastery tests. Three 20-item tests were administered to 110 simulated subjects.…

Descriptors: Comparative Analysis, Criterion Referenced Tests, Item Analysis, Mastery Tests

Comparative Analysis	15
Item Analysis	15
Test Items	9
Latent Trait Theory	8
Statistical Analysis	6
Statistical Studies	6
Estimation (Mathematics)	5
Mathematical Models	5
Scores	5
Computer Software	4
Factor Analysis	4
Mastery Tests	4
Test Construction	4
Correlation	3
Achievement Tests	2
Computer Simulation	2
Criterion Referenced Tests	2
Difficulty Level	2
Elementary Secondary Education	2
Equated Scores	2
Goodness of Fit	2
Higher Education	2
Maximum Likelihood Statistics	2
Response Style (Tests)	2
Statistical Distributions	2
More ▼

Algina, James	1
Buhr, Dianne C.	1
Chevalaz, Gerard M.	1
Hambleton, Ronald K.	1
Harris, Deborah J.	1
Holland, Paul W.	1
Jacobs, Stanley S.	1
Kingsbury, G. Gage	1
Kritika Thapa	1
Marco, Gary L.	1
Muraki, Eiji	1
Nering, Michael L., Ed.	1
Ockey, Gary J.	1
Ostini, Remo, Ed.	1
Phillips, Gary W.	1
Sarvela, Paul D.	1
Skaggs, Gary	1
Stevenson, Jose	1
Subkoviak, Michael J.	1
Tatsuoka, Kikumi K.	1
Thayer, Dorothy T.	1
Wagner, Elvis	1
More ▼