ERIC - Search Results

Publication Date

In 2026	0
Since 2025	15
Since 2022 (last 5 years)	63
Since 2017 (last 10 years)	162
Since 2007 (last 20 years)	321

Descriptor

Test Length	636
Test Items	226
Item Response Theory	199
Test Construction	150
Sample Size	139
Test Reliability	133
Computer Assisted Testing	120
Test Validity	113
Simulation	107
Adaptive Testing	100
Comparative Analysis	99
Test Format	91
Scores	88
Error of Measurement	78
Foreign Countries	73
Statistical Analysis	71
Correlation	68
Item Analysis	65
Computation	62
Higher Education	61
Models	61
Accuracy	59
Difficulty Level	57
Testing Problems	54
Monte Carlo Methods	52
More ▼

Education Level

Higher Education	50
Postsecondary Education	42
Secondary Education	23
Elementary Education	21
Middle Schools	12
High Schools	11
Elementary Secondary Education	10
Junior High Schools	9
Early Childhood Education	8
Primary Education	7
Grade 3	6
Intermediate Grades	6
Grade 6	5
Grade 8	5
Grade 2	3
Grade 4	3
Grade 5	3
Grade 7	3
Kindergarten	3
Grade 11	2
Grade 12	2
Grade 9	2
Grade 1	1
Grade 10	1
Preschool Education	1
More ▼

Audience

Researchers	23
Practitioners	7
Administrators	2
Community	1
Students	1
Support Staff	1
Teachers	1

Location

Turkey	8
Australia	7
Canada	7
China	5
Netherlands	5
Japan	4
Taiwan	4
United Kingdom	4
Germany	3
Michigan	3
Singapore	3
South Korea	3
Ireland	2
New York	2
New Zealand	2
Pennsylvania	2
Peru	2
Alabama	1
Armenia	1
Asia	1
Brazil	1
California	1
Colombia	1
Florida	1
Ghana	1
More ▼

Laws, Policies, & Programs

Americans with Disabilities…	1
Equal Access	1
Job Training Partnership Act…	1
Race to the Top	1
Rehabilitation Act 1973…	1

What Works Clearinghouse Rating

Showing 286 to 300 of 636 results Save | Export

Correcting Fallacies in Validity, Reliability, and Classification

Peer reviewed

Direct link

Sijtsma, Klaas – International Journal of Testing, 2009

This article reviews three topics from test theory that continue to raise discussion and controversy and capture test theorists' and constructors' interest. The first topic concerns the discussion of the methodology of investigating and establishing construct validity; the second topic concerns reliability and its misuse, alternative definitions…

Descriptors: Construct Validity, Reliability, Classification, Test Theory

Application of Computerized Adaptive Testing to Entrance Examination for Graduate Studies in Turkey

Peer reviewed
PDF on ERIC

Download full text

Bulut, Okan; Kan, Adnan – Eurasian Journal of Educational Research, 2012

Problem Statement: Computerized adaptive testing (CAT) is a sophisticated and efficient way of delivering examinations. In CAT, items for each examinee are selected from an item bank based on the examinee's responses to the items. In this way, the difficulty level of the test is adjusted based on the examinee's ability level. Instead of…

Descriptors: Adaptive Testing, Computer Assisted Testing, College Entrance Examinations, Graduate Students

Variable-Length Computerized Adaptive Testing: Adaptation of the A-Stratified Strategy in Item Selection with Content Balancing

Direct link

Huo, Yan – ProQuest LLC, 2009

Variable-length computerized adaptive testing (CAT) can provide examinees with tailored test lengths. With the fixed standard error of measurement ("SEM") termination rule, variable-length CAT can achieve predetermined measurement precision by using relatively shorter tests compared to fixed-length CAT. To explore the application of…

Descriptors: Test Length, Test Items, Adaptive Testing, Item Analysis

Modification of the Mantel-Haenszel and Logistic Regression DIF Procedures to Incorporate the SIBTEST Regression Correction

Peer reviewed

Direct link

DeMars, Christine E. – Journal of Educational and Behavioral Statistics, 2009

The Mantel-Haenszel (MH) and logistic regression (LR) differential item functioning (DIF) procedures have inflated Type I error rates when there are large mean group differences, short tests, and large sample sizes.When there are large group differences in mean score, groups matched on the observed number-correct score differ on true score,…

Descriptors: Regression (Statistics), Test Bias, Error of Measurement, True Scores

Multidimensional Rasch Analysis of a Psychological Test with Multiple Subtests: A Statistical Solution for the Bandwidth-Fidelity Dilemma

Peer reviewed

Direct link

Cheng, Ying-Yao; Wang, Wen-Chung; Ho, Yi-Hui – Educational and Psychological Measurement, 2009

Educational and psychological tests are often composed of multiple short subtests, each measuring a distinct latent trait. Unfortunately, short subtests suffer from low measurement precision, which makes the bandwidth-fidelity dilemma inevitable. In this study, the authors demonstrate how a multidimensional Rasch analysis can be employed to take…

Descriptors: Item Response Theory, Measurement, Correlation, Measures (Individuals)

NCME 2008 Presidential Address: The Impact of Anchor Test Configuration on Student Proficiency Rates

Peer reviewed

Direct link

Fitzpatrick, Anne R. – Educational Measurement: Issues and Practice, 2008

Examined in this study were the effects of reducing anchor test length on student proficiency rates for 12 multiple-choice tests administered in an annual, large-scale, high-stakes assessment. The anchor tests contained 15 items, 10 items, or five items. Five content representative samples of items were drawn at each anchor test length from a…

Descriptors: Test Length, Multiple Choice Tests, Item Sampling, Student Evaluation

A Model Fit Statistic for Generalized Partial Credit Model

Peer reviewed

Direct link

Liang, Tie; Wells, Craig S. – Educational and Psychological Measurement, 2009

Investigating the fit of a parametric model is an important part of the measurement process when implementing item response theory (IRT), but research examining it is limited. A general nonparametric approach for detecting model misfit, introduced by J. Douglas and A. S. Cohen (2001), has exhibited promising results for the two-parameter logistic…

Descriptors: Sample Size, Nonparametric Statistics, Item Response Theory, Goodness of Fit

Simultaneous Estimation of Overall and Domain Abilities: A Higher-Order IRT Model Approach

Peer reviewed

Direct link

de la Torre, Jimmy; Song, Hao – Applied Psychological Measurement, 2009

Assessments consisting of different domains (e.g., content areas, objectives) are typically multidimensional in nature but are commonly assumed to be unidimensional for estimation purposes. The different domains of these assessments are further treated as multi-unidimensional tests for the purpose of obtaining diagnostic information. However, when…

Descriptors: Ability, Tests, Item Response Theory, Data Analysis

Test Length and Cognitive Fatigue: An Empirical Examination of Effects on Performance and Test-Taker Reactions

Peer reviewed

Direct link

Ackerman, Phillip L.; Kanfer, Ruth – Journal of Experimental Psychology: Applied, 2009

Person and situational determinants of cognitive ability test performance and subjective reactions were examined in the context of tests with different time-on-task requirements. Two hundred thirty-nine first-year university students participated in a within-participant experiment, with completely counterbalanced treatment conditions and test…

Descriptors: Test Length, Fatigue (Biology), Cognitive Ability, College Students

Comparison of Multiple-Indicators, Multiple-Causes--and Item Response Theory-Based Analyses of Subgroup Differences

Peer reviewed

Direct link

Willse, John T.; Goodman, Joshua T. – Educational and Psychological Measurement, 2008

This research provides a direct comparison of effect size estimates based on structural equation modeling (SEM), item response theory (IRT), and raw scores. Differences between the SEM, IRT, and raw score approaches are examined under a variety of data conditions (IRT models underlying the data, test lengths, magnitude of group differences, and…

Descriptors: Test Length, Structural Equation Models, Effect Size, Raw Scores

Simulated Tests of Differential Item Functioning Using SIBTEST with and without Impact

Peer reviewed

Direct link

Klockars, Alan J.; Lee, Yoonsun – Journal of Educational Measurement, 2008

Monte Carlo simulations with 20,000 replications are reported to estimate the probability of rejecting the null hypothesis regarding DIF using SIBTEST when there is DIF present and/or when impact is present due to differences on the primary dimension to be measured. Sample sizes are varied from 250 to 2000 and test lengths from 10 to 40 items.…

Descriptors: Test Bias, Test Length, Reference Groups, Probability

Exploratory and Confirmatory Studies of the Structure of the Bem Sex Role Inventory Short Form with Two Divergent Samples

Peer reviewed

Direct link

Choi, Namok; Fuqua, Dale R.; Newman, Jody L. – Educational and Psychological Measurement, 2009

The short form of the Bem Sex Role Inventory (BSRI) contains half as many items as the long form and yet has often demonstrated better reliability and validity. This study uses exploratory and confirmatory factor analytic methods to examine the structure of the short form of the BSRI. A structure noted elsewhere also emerged here, consisting of…

Descriptors: Sex Role, Measures (Individuals), Test Length, Gender Differences

Item Parameter Estimation for the MIRT Model: Bias and Precision of Confirmatory Factor Analysis-Based Models

Peer reviewed

Direct link

Finch, Holmes – Applied Psychological Measurement, 2010

The accuracy of item parameter estimates in the multidimensional item response theory (MIRT) model context is one that has not been researched in great detail. This study examines the ability of two confirmatory factor analysis models specifically for dichotomous data to properly estimate item parameters using common formulae for converting factor…

Descriptors: Item Response Theory, Computation, Factor Analysis, Models

The 2008-2009 Pennsylvania System of School Assessment Handbook for Assessment Coordinators: Writing, Reading and Mathematics, Science

Download full text

Pennsylvania Department of Education, 2010

This handbook describes the responsibilities of district and school assessment coordinators in the administration of the Pennsylvania System of School Assessment (PSSA). This updated guidebook contains the following sections: (1) General Assessment Guidelines for All Assessments; (2) Writing Specific Guidelines; (3) Reading and Mathematics…

Descriptors: Guidelines, Guides, Educational Assessment, Writing Tests

On Using Stochastic Curtailment to Shorten the SPRT in Sequential Mastery Testing

Peer reviewed

Direct link

Finkelman, Matthew – Journal of Educational and Behavioral Statistics, 2008

Sequential mastery testing (SMT) has been researched as an efficient alternative to paper-and-pencil testing for pass/fail examinations. One popular method for determining when to cease examination in SMT is the truncated sequential probability ratio test (TSPRT). This article introduces the application of stochastic curtailment in SMT to shorten…

Descriptors: Mastery Tests, Sequential Approach, Computer Assisted Testing, Adaptive Testing

« Previous Page | Next Page »

Pages: 1 | ... | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | ... | 43

Educational and Psychological…	86
Applied Psychological…	45
Journal of Educational…	29
ProQuest LLC	28
Applied Measurement in…	21
ETS Research Report Series	15
Journal of Psychoeducational…	15
Psychological Assessment	12
International Journal of…	11
International Journal of…	11
Psychometrika	10
Measurement:…	9
Journal of Educational and…	7
Journal of Experimental…	6
Educational Sciences: Theory…	5
Journal of Speech, Language,…	5
Language Testing	5
Assessment	4
Educational Measurement:…	4
Grantee Submission	4
Physical Review Physics…	4
ACT Education Corp.	3
Eurasian Journal of…	3
Field Methods	3
Journal of Clinical Psychology	3
More ▼

Hambleton, Ronald K.	15
Wang, Wen-Chung	9
Livingston, Samuel A.	6
Sijtsma, Klaas	6
Wainer, Howard	6
Weiss, David J.	6
Wilcox, Rand R.	6
Cheng, Ying	5
Gessaroli, Marc E.	5
Lee, Won-Chan	5
Lewis, Charles	5
Reckase, Mark D.	5
Cohen, Allan S.	4
De Ayala, R. J.	4
Drasgow, Fritz	4
Huynh, Huynh	4
Kim, Seock-Ho	4
Meijer, Rob R.	4
Paek, Insu	4
Schumacker, Randall E.	4
Tay, Louis	4
Wang, Chun	4
Wells, Craig S.	4
Axelrod, Bradley N.	3
More ▼

Reports - Research	421
Journal Articles	402
Reports - Evaluative	125
Speeches/Meeting Papers	92
Dissertations/Theses -…	28
Reports - Descriptive	22
Numerical/Quantitative Data	14
Tests/Questionnaires	12
Guides - Non-Classroom	11
Information Analyses	10
Opinion Papers	7
Reference Materials -…	2
Reports - General	2
Collected Works - General	1
Collected Works - Serials	1
ERIC Publications	1
Guides - Classroom - Learner	1
Guides - General	1
Historical Materials	1
More ▼

Test of English as a Foreign…	9
Wechsler Adult Intelligence…	9
SAT (College Admission Test)	8
Program for International…	6
Law School Admission Test	5
Minnesota Multiphasic…	5
Wechsler Intelligence Scale…	5
Graduate Record Examinations	4
Trends in International…	4
ACT Assessment	3
Iowa Tests of Basic Skills	3
Kaufman Brief Intelligence…	3
National Assessment of…	3
Advanced Placement…	2
Bem Sex Role Inventory	2
Comprehensive Tests of Basic…	2
MacArthur Communicative…	2
McCarthy Scales of Childrens…	2
Medical College Admission Test	2
Nelson Denny Reading Tests	2
Peabody Picture Vocabulary…	2
Self Description Questionnaire	2
Stanford Binet Intelligence…	2
Wechsler Intelligence Scales…	2
ACTFL Oral Proficiency…	1
More ▼