ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	4
Since 2017 (last 10 years)	5
Since 2007 (last 20 years)	15

Descriptor

Mathematical Models	367
Test Items	367
Latent Trait Theory	143
Test Construction	110
Item Analysis	109
Difficulty Level	102
Item Response Theory	102
Estimation (Mathematics)	87
Equations (Mathematics)	80
Goodness of Fit	61
Statistical Analysis	54
Maximum Likelihood Statistics	50
Comparative Analysis	46
Computer Assisted Testing	45
Test Theory	42
Adaptive Testing	41
Computer Simulation	41
Simulation	41
Achievement Tests	38
Measurement Techniques	37
Testing Problems	37
Factor Analysis	35
Higher Education	35
Item Banks	35
Multiple Choice Tests	35
More ▼

Education Level

Higher Education	2
Grade 4	1
Grade 8	1
Postsecondary Education	1

Audience

Researchers	49
Practitioners	1

Location

Netherlands	5
United States	3
Australia	2
Belgium	2
Italy	2
California	1
China	1
Denmark	1
Florida	1
Georgia	1
Hungary	1
Israel	1
Japan	1
Ohio	1
South Carolina	1
Taiwan	1
United Kingdom (England)	1
More ▼

Laws, Policies, & Programs

What Works Clearinghouse Rating

Test Items X

Showing 166 to 180 of 367 results Save | Export

An Alternative Interpretation of Three Stability Models. Measurement and Methodology, Work Unit 2: Technical Adequacy of Tests.

Wilcox, Rand R. – 1978

Two fundamental problems in mental test theory are to estimate true score and to estimate the amount of error when testing an examinee. In this report, three probability models which characterize a single test item in terms of a population of examinees are described. How these models may be modified to characterize a single examinee in terms of an…

Descriptors: Achievement Tests, Comparative Analysis, Error of Measurement, Mathematical Models

Theoretical and Practical Consequences of the Use of Standardized Residuals as Rasch Model Fit Statistics.

Download full text

George, Archie A. – 1979

The appropriateness of the use of the standardized residual (SR) to assess congruence between sample test item responses and the one parameter latent trait (Rasch) item characteristic curve is investigated. Latent trait theory is reviewed, as well as theory of the SR, the apparent error in calculating the expected distribution of the SR, and…

Descriptors: Academic Ability, Computer Programs, Difficulty Level, Goodness of Fit

Methods of Assessing Bias and Fairness in Tests.

Merz, William R. – 1980

Several methods of assessing test item bias are described, and the concept of fair use of tests is examined. A test item is biased if individuals of equal ability have different probabilities of attaining the item correct. The following seven general procedures used to examine test items for bias are summarized and discussed: (1) analysis of…

Descriptors: Comparative Analysis, Evaluation Methods, Factor Analysis, Mathematical Models

Estimating Latent Distributions.

Peer reviewed

Mislevy, Robert J. – Psychometrika, 1984

Assuming vectors of item responses depend on ability through a fully specified item response model, this paper presents maximum likelihood equations for estimating the population parameters without estimating an ability parameter for each subject. Asymptotic standard errors, tests of fit, computing approximations, and details of four special cases…

Descriptors: Bayesian Statistics, Estimation (Mathematics), Goodness of Fit, Latent Trait Theory

The One-, Two- and Modified Two-Parameter Latent Trait Models: An Empirical Study of Relative Fit.

Peer reviewed

Albanese, Mark A.; Forsyth, Robert A. – Educational and Psychological Measurement, 1984

The purpose of this study was to compare the relative robustness of the one-, two-, and modified two-parameter latent trait logistic models for the Iowa Tests of Educational Development. Results suggest that the modified two-parameter model may provide the best representation of the data. (Author/BW)

Descriptors: Achievement Tests, Comparative Analysis, Goodness of Fit, Item Analysis

Rasch Measurement Transactions, Part 1.

Linacre, John M., Ed. – 1995

This volume and its companion, "Part 2," bring together transactions of the Rasch measurement special interest group of the American Educational Research Association. This volume opens with a discussion of the early years in Rasch measurement and then presents the "transactions" in chronological order, from a 1987 discussion…

Descriptors: Educational Assessment, Educational Research, Elementary Secondary Education, Item Response Theory

Rasch Measurement Transactions, Part 2.

Linacre, John M., Ed. – 1996

This volume and its companion, "Part 1," bring together transactions of the Rasch measurement special interest group of the American Educational Research Association. It presents "transactions" in chronological order, from a 1992 discussion through the winter 1995 volume. Four issues of the "Transactions" are…

Descriptors: Educational Assessment, Educational Research, Elementary Secondary Education, Item Response Theory

A Note on Vertical Equating via the Rasch Model for Groups of Quite Different Ability and Tests of Quite Different Difficulty.

Peer reviewed

Slinde, Jeffrey A.; Linn, Robert L. – Journal of Educational Measurement, 1979

The Rasch model was used to equate reading comprehension tests of widely different difficulty for three groups of fifth grade students of widely different ability. Under these extreme circumstances, the Rasch model equating was unsatisfactory. (Author/CTM)

Descriptors: Academic Ability, Bias, Difficulty Level, Equated Scores

Item Bias Detection Using Loglinear IRT.

Peer reviewed

Kelderman, Henk – Psychometrika, 1989

A method is proposed for the detection of item bias with respect to observed or unobserved subgroups, using a loglinear item response theory model assuming a Rasch model for ability and difficulty. A simulation study was performed with 200 sets of data to check the robustness of the method. (SLD)

Descriptors: Equations (Mathematics), Foreign Countries, Higher Education, Item Response Theory

Testing for DIF in a Model with Single Peaked Item Characteristic Curves: The PARELLA Model.

Peer reviewed

Hoijtink, Herbert; Molenaar, Ivo W. – Psychometrika, 1992

The PARallELogram Analysis (PARELLA) model is a probabilistic parallelogram model that can be used for the measurement of latent attitudes or latent preferences. A method is presented for testing for differential item functioning (DIF) for the PARELLA model using the approach of D. Thissen and others (1988). (SLD)

Descriptors: Attitude Measures, Computer Simulation, Equations (Mathematics), Estimation (Mathematics)

Interpreting Scales through Scale Anchoring.

Peer reviewed

Beaton, Albert E.; Allen, Nancy L. – Journal of Educational Statistics, 1992

The National Assessment of Educational Progress (NAEP) makes possible comparison of groups of students and provides information about what these groups know and can do. The scale anchoring techniques described in this chapter address the latter purpose. The direct method and the smoothing method of scale anchoring are discussed. (SLD)

Descriptors: Comparative Testing, Educational Assessment, Elementary Secondary Education, Knowledge Level

Constructing the Exact Significance Level for a Person-Fit Statistic.

Peer reviewed

Liou, Michelle; Chang, Chih-Hsin – Psychometrika, 1992

An extension is proposed for the network algorithm introduced by C.R. Mehta and N.R. Patel to construct exact tail probabilities for testing the general hypothesis that item responses are distributed according to the Rasch model. A simulation study indicates the efficiency of the algorithm. (SLD)

Descriptors: Algorithms, Computer Simulation, Difficulty Level, Equations (Mathematics)

Ordinal Test Fidelity Estimated by an Item Sampling Model.

Peer reviewed

Cliff, Norman; Donoghue, John R. – Psychometrika, 1992

A test theory using only ordinal assumptions is presented, based on the idea that the test items are a sample from a universe of items. The sum across items of the ordinal relations for a pair of persons on the universe items is analogous to a true score. (SLD)

Descriptors: Equations (Mathematics), Estimation (Mathematics), Item Response Theory, Item Sampling

Item Selection Using an Average Growth Approximation of Target Information Functions.

Peer reviewed

Luecht, Richard M.; Hirsch, Thomas M. – Applied Psychological Measurement, 1992

Derivations of several item selection algorithms for use in fitting test items to target information functions (IFs) are described. These algorithms, which use an average growth approximation of target IFs, were tested by generating six test forms and were found to provide reliable fit. (SLD)

Descriptors: Algorithms, Computer Assisted Testing, Equations (Mathematics), Goodness of Fit

A Cluster-Based Method for Test Construction.

Peer reviewed

Boekkooi-Timminga, Ellen – Applied Psychological Measurement, 1990

A new test construction model based on the Rasch model is proposed. This model, the cluster-based method, considers groups of interchangeable items rather than individual items and uses integer programing. Results for six test construction problems indicate that the method produces accurate results in small amounts of time. (SLD)

Descriptors: Cluster Analysis, Computer Assisted Testing, Equations (Mathematics), Item Banks

« Previous Page | Next Page »

Pages: 1 | ... | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | ... | 25

Journal of Educational…	31
Applied Psychological…	30
Psychometrika	21
Educational and Psychological…	10
Journal of Educational…	9
Applied Measurement in…	4
Journal of Educational and…	3
Journal of Experimental…	3
ProQuest LLC	3
Measurement:…	2
Multivariate Behavioral…	2
Online Submission	2
Australian Mathematics Teacher	1
Chemical Engineering Education	1
Contemporary Educational…	1
Developmental Review	1
Educational Measurement:…	1
Educational Technology &…	1
Evaluation in Education:…	1
Hacettepe University Journal…	1
Intelligence	1
International Journal of…	1
Journal of Applied Testing…	1
Journal of Education…	1
Journal of Education for…	1
More ▼

Reckase, Mark D.	23
Samejima, Fumiko	18
Ackerman, Terry A.	11
McKinley, Robert L.	11
Hambleton, Ronald K.	8
Wilcox, Rand R.	8
Kelderman, Henk	7
Mislevy, Robert J.	7
Spray, Judith A.	7
Douglass, James B.	6
Nandakumar, Ratna	6
Wright, Benjamin D.	6
van der Linden, Wim J.	6
Boekkooi-Timminga, Ellen	5
Cohen, Allan S.	5
Kim, Seock-Ho	5
Adema, Jos J.	4
De Ayala, R. J.	4
Dorans, Neil J.	4
Gustafsson, Jan-Eric	4
Muraki, Eiji	4
Rudner, Lawrence M.	4
Stocking, Martha L.	4
Wainer, Howard	4
More ▼

Reports - Research	207
Journal Articles	129
Reports - Evaluative	126
Speeches/Meeting Papers	117
Reports - Descriptive	12
Guides - Non-Classroom	7
Numerical/Quantitative Data	7
Books	5
Dissertations/Theses -…	4
Opinion Papers	4
Reports - General	4
Collected Works - General	3
Information Analyses	3
Non-Print Media	2
Book/Product Reviews	1
Collected Works - Proceedings	1
Computer Programs	1
ERIC Publications	1
Guides - Classroom - Teacher	1
Guides - General	1
More ▼

SAT (College Admission Test)	11
ACT Assessment	7
National Assessment of…	6
Armed Services Vocational…	3
Comprehensive Tests of Basic…	3
Graduate Record Examinations	3
California Achievement Tests	2
Iowa Tests of Educational…	2
Program for International…	2
Stanford Achievement Tests	2
Graduate Management Admission…	1
Iowa Tests of Basic Skills	1
Medical College Admission Test	1
Minnesota Multiphasic…	1
National Merit Scholarship…	1
Pre Professional Skills Tests	1
Preliminary Scholastic…	1
Raven Progressive Matrices	1
School and College Ability…	1
Stanford Binet Intelligence…	1
Stanford Diagnostic Reading…	1
Stanford Early School…	1
Students Evaluation of…	1
More ▼