ERIC - Search Results

Publication Date

In 2026	0
Since 2025	3
Since 2022 (last 5 years)	12
Since 2017 (last 10 years)	24
Since 2007 (last 20 years)	61

Descriptor

Comparative Analysis	99
Test Length	99
Item Response Theory	42
Test Items	40
Sample Size	31
Computer Assisted Testing	28
Simulation	27
Adaptive Testing	20
Test Format	20
Error of Measurement	17
Scores	17
Statistical Analysis	16
Test Reliability	16
Item Analysis	14
Models	14
Correlation	13
Monte Carlo Methods	13
Test Validity	13
Accuracy	12
Difficulty Level	12
Higher Education	12
Computation	11
Mathematical Models	11
Maximum Likelihood Statistics	11
Classification	10
More ▼

Publication Type

Reports - Research	66
Journal Articles	56
Speeches/Meeting Papers	20
Reports - Evaluative	19
Dissertations/Theses -…	12
Numerical/Quantitative Data	2
Tests/Questionnaires	2
Information Analyses	1
Reports - Descriptive	1

Education Level

Higher Education	8
Postsecondary Education	6
Elementary Secondary Education	3
Secondary Education	3
Elementary Education	2
High Schools	2
Grade 6	1
Grade 7	1
Intermediate Grades	1
Middle Schools	1

Audience

Researchers

Location

Turkey	4
Asia	1
Canada	1
China	1
Michigan	1
Netherlands	1
Singapore	1
United Kingdom	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	3
Wechsler Adult Intelligence…	3
Kaufman Brief Intelligence…	2
Minnesota Multiphasic…	2
ACTFL Oral Proficiency…	1
Advanced Placement…	1
Center for Epidemiologic…	1
Law School Admission Test	1
Marlowe Crowne Social…	1
NEO Five Factor Inventory	1
Program for International…	1
SAT (College Admission Test)	1
School and College Ability…	1
Sensation Seeking Scale	1
Trends in International…	1
Wechsler Individual…	1
Wechsler Intelligence Scale…	1
More ▼

What Works Clearinghouse Rating

Comparative Analysis X

Showing 76 to 90 of 99 results Save | Export

Examining Replication Effects in Rasch Fit Statistics.

Download full text

Schumacker, Randall E.; And Others – 1994

Rasch between and total weighted and unweighted fit statistics were compared using varying test lengths and sample sizes. Two test lengths (20 and 50 items) and three sample sizes (150, 500, and 1,000 were crossed. Each of the six combinations were replicated 100 times. In addition, power comparisons were made. Results indicated that there were no…

Descriptors: Comparative Analysis, Goodness of Fit, Item Response Theory, Power (Statistics)

Comparison of Item Targeting Strategies for Pass/Fail Computer Adaptive Tests.

Download full text

Bergstrom, Betty A.; Gershon, Richard – 1992

The most useful method of item selection for making pass-fail decisions with a Computerized Adaptive Test (CAT) was studied. Medical technology students (n=86) took a computer adaptive test in which items were targeted to the ability of the examinee. The adaptive algorithm that selected items and estimated person measures used the Rasch model and…

Descriptors: Adaptive Testing, Algorithms, Comparative Analysis, Computer Assisted Testing

A Comparison of Equating Methods under the Graded Response Model.

Download full text

Cohen, Allan S.; Kim, Seock-Ho – 1993

Equating tests from different calibrations under item response theory (IRT) requires calculation of the slope and intercept of the appropriate linear transformation. Two methods have been proposed recently for equating graded response items under IRT, a test characteristic curve method and a minimum chi-square method. These two methods are…

Descriptors: Chi Square, Comparative Analysis, Computer Simulation, Equated Scores

A Comparison of an Expert Systems Approach to Computerized Adaptive Testing and an Item Response Theory Model.

Download full text

Frick, Theodore W. – 1991

Expert systems can be used to aid decisionmaking. A computerized adaptive test is one kind of expert system, although not commonly recognized as such. A new approach, termed EXSPRT, was devised that combines expert systems reasoning and sequential probability ratio test stopping rules. Two versions of EXSPRT were developed, one with random…

Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Expert Systems

An Investigation of Hierarchical Bayes Procedures in Item Response Theory.

Download full text

Kim, Seock-Ho; And Others – 1992

Hierarchical Bayes procedures were compared for estimating item and ability parameters in item response theory. Simulated data sets from the two-parameter logistic model were analyzed using three different hierarchical Bayes procedures: (1) the joint Bayesian with known hyperparameters (JB1); (2) the joint Bayesian with information hyperpriors…

Descriptors: Ability, Bayesian Statistics, Comparative Analysis, Equations (Mathematics)

The Use of the Sequential Probability Ratio Test in Making Grade Classifications in Conjunction with Tailored Testing.

Download full text

Reckase, Mark D. – 1981

This report describes a study comparing the classification results obtained from a one-parameter and three-parameter logistic based tailored testing procedure used in conjunction with Wald's sequential probability ratio test (SPRT). Eighty-eight college students were classified into four grade categories using achievement test results obtained…

Descriptors: Adaptive Testing, Classification, Comparative Analysis, Computer Assisted Testing

Thin versus Thick Matching in the Mantel-Haenszel Procedure for Detecting DIF.

Peer reviewed

Donoghue, John R.; Allen, Nancy L. – Journal of Educational Statistics, 1993

Forming the matching variable for the Mantel-Haenszel differential item functioning (DIF) procedure through use of the total score as the matching variable (thin) and forming the matching variable by pooling total score levels (thick) were compared in a Monte Carlo study. Reasons thick matching is superior are discussed. (SLD)

Descriptors: Comparative Analysis, Computer Simulation, Equations (Mathematics), Graphs

Comparison of Difficulties and Reliabilities of Math-Completion and Multiple-Choice Item Formats.

Download full text

Oosterhof, Albert C.; Coats, Pamela K. – 1981

Instructors who develop classroom examinations that require students to provide a numerical response to a mathematical problem are often very concerned about the appropriateness of the multiple-choice format. The present study augments previous research relevant to this concern by comparing the difficulty and reliability of multiple-choice and…

Descriptors: Comparative Analysis, Difficulty Level, Grading, Higher Education

Effects of Test Length and Advancement Score on Several Criterion-Referenced Test Reliability and Validity Indices. Laboratory of Psychometric and Evaluation Research Report No. 86.

Download full text

Eignor, Daniel R.; Hambleton, Ronald K. – 1979

The purpose of the investigation was to obtain some relationships among (1) test lengths, (2) shape of domain-score distributions, (3) advancement scores, and (4) several criterion-referenced test score reliability and validity indices. The study was conducted using computer simulation methods. The values of variables under study were set to be…

Descriptors: Comparative Analysis, Computer Assisted Testing, Criterion Referenced Tests, Cutting Scores

An Adaptive Algebra Test: A Testlet-Based, Hierarchically-Structured Test with Validity-Based Scoring. Technical Report No. 90-92.

Download full text

Wainer, Howard; And Others – 1990

The initial development of a testlet-based algebra test was previously reported (Wainer and Lewis, 1990). This account provides the details of this excursion into the use of hierarchical testlets and validity-based scoring. A pretest of two 15-item hierarchical testlets was carried out in which examinees' performance on a 4-item subset of each…

Descriptors: Adaptive Testing, Algebra, Comparative Analysis, Computer Assisted Testing

Methods for Equating Mental Tests. Interim Report for Period March 1982-October 1984.

Download full text

Gialluca, Kathleen A.; And Others – 1984

In this study, simulated and actual Air Force test data were used to compare the different procedures for equating mental tests: conventional (equipercentile and linear), Item Response Theory (IRT), and strong true-score theory (STST); data collection designs used were single-group, equivalent-groups, and anchor test. Equating transformations were…

Descriptors: Adults, Cognitive Ability, Cognitive Tests, Comparative Analysis

A Comparison of Reliability Estimates from Single and Double Administrations of Criterion-Referenced Tests.

Schaefer, Mary M.; Gross, Susan K. – 1983

Viewing the reliability for criterion-referenced tests as that of mastery classification decisions, three models for determining reliability were examined using two test administrations so that two estimates could be compared to a standard. A major purpose of the research was to determine how several reliability coefficients (coefficient kappa, an…

Descriptors: Comparative Analysis, Correlation, Criterion Referenced Tests, Cutting Scores

Optimal Item Selection with Credentialing Examinations.

Download full text

Hambleton, Ronald K.; And Others – 1987

The study compared two promising item response theory (IRT) item-selection methods, optimal and content-optimal, with two non-IRT item selection methods, random and classical, for use in fixed-length certification exams. The four methods were used to construct 20-item exams from a pool of approximately 250 items taken from a 1985 certification…

Descriptors: Comparative Analysis, Content Validity, Cutting Scores, Difficulty Level

A Validity Comparison of Adaptive and Conventional Strategies for Mastery Testing.

Download full text

Kingsbury, G. Gage; Weiss, David J. – 1981

Conventional mastery tests designed to make optimal mastery classifications were compared with fixed-length and variable-length adaptive mastery tests. Comparisons between the testing procedures were made across five content areas in an introductory biology course from tests administered to volunteers. The criterion was the student's standing in…

Descriptors: Achievement Tests, Adaptive Testing, Biology, Comparative Analysis

A Comparison of a Bayesian and a Maximum Likelihood Tailored Testing Procedure.

Download full text

McKinley, Robert L.; Reckase, Mark D. – 1981

A study was conducted to compare tailored testing procedures based on a Bayesian ability estimation technique and on a maximum likelihood ability estimation technique. The Bayesian tailored testing procedure selected items so as to minimize the posterior variance of the ability estimate distribution, while the maximum likelihood tailored testing…

Descriptors: Academic Ability, Adaptive Testing, Bayesian Statistics, Comparative Analysis

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

ProQuest LLC	12
Educational and Psychological…	9
Applied Psychological…	8
ETS Research Report Series	5
Psychological Assessment	4
ACT Education Corp.	3
Applied Measurement in…	3
Educational Sciences: Theory…	2
International Journal of…	2
Journal of Educational…	2
Psychometrika	2
Asia Pacific Education Review	1
College Entrance Examination…	1
Education and Information…	1
Educational Research and…	1
European Journal of Special…	1
Grantee Submission	1
International Journal of…	1
Journal of Educational…	1
Journal of Educational and…	1
Journal of Psychoeducational…	1
Language Testing	1
Learning Disabilities: A…	1
Measurement in Physical…	1
Measurement:…	1
More ▼

Hambleton, Ronald K.	3
Dogan, Nuri	2
Drasgow, Fritz	2
Eggen, Theo J. H. M.	2
Frick, Theodore W.	2
Gessaroli, Marc E.	2
Kelecioglu, Hülya	2
Kim, Seock-Ho	2
Lee, Yi-Hsuan	2
Paek, Insu	2
Reckase, Mark D.	2
Schumacker, Randall E.	2
Weiss, David J.	2
Zhang, Jinming	2
Allan S. Cohen	1
Allen, Nancy L.	1
Allspach, Jill R.	1
Ann Arthur	1
Arsan, Nihan	1
Atalay Kabasakal, Kübra	1
Bazaldua, Diego A. Luna	1
Bejar, Isaac I.	1
Benton, Tom	1
Bergstrom, Betty A.	1
More ▼