ERIC - Search Results

Publication Date

In 2026	0
Since 2025	15
Since 2022 (last 5 years)	63
Since 2017 (last 10 years)	162
Since 2007 (last 20 years)	321

Descriptor

Test Length	636
Test Items	226
Item Response Theory	199
Test Construction	150
Sample Size	139
Test Reliability	133
Computer Assisted Testing	120
Test Validity	113
Simulation	107
Adaptive Testing	100
Comparative Analysis	99
Test Format	91
Scores	88
Error of Measurement	78
Foreign Countries	73
Statistical Analysis	71
Correlation	68
Item Analysis	65
Computation	62
Higher Education	61
Models	61
Accuracy	59
Difficulty Level	57
Testing Problems	54
Monte Carlo Methods	52
More ▼

Education Level

Higher Education	50
Postsecondary Education	42
Secondary Education	23
Elementary Education	21
Middle Schools	12
High Schools	11
Elementary Secondary Education	10
Junior High Schools	9
Early Childhood Education	8
Primary Education	7
Grade 3	6
Intermediate Grades	6
Grade 6	5
Grade 8	5
Grade 2	3
Grade 4	3
Grade 5	3
Grade 7	3
Kindergarten	3
Grade 11	2
Grade 12	2
Grade 9	2
Grade 1	1
Grade 10	1
Preschool Education	1
More ▼

Audience

Researchers	23
Practitioners	7
Administrators	2
Community	1
Students	1
Support Staff	1
Teachers	1

Location

Turkey	8
Australia	7
Canada	7
China	5
Netherlands	5
Japan	4
Taiwan	4
United Kingdom	4
Germany	3
Michigan	3
Singapore	3
South Korea	3
Ireland	2
New York	2
New Zealand	2
Pennsylvania	2
Peru	2
Alabama	1
Armenia	1
Asia	1
Brazil	1
California	1
Colombia	1
Florida	1
Ghana	1
More ▼

Laws, Policies, & Programs

Americans with Disabilities…	1
Equal Access	1
Job Training Partnership Act…	1
Race to the Top	1
Rehabilitation Act 1973…	1

What Works Clearinghouse Rating

Showing 301 to 315 of 636 results Save | Export

Comparison of Parametric and Nonparametric Bootstrap Methods for Estimating Random Error in Equipercentile Equating

Peer reviewed

Direct link

Cui, Zhongmin; Kolen, Michael J. – Applied Psychological Measurement, 2008

This article considers two methods of estimating standard errors of equipercentile equating: the parametric bootstrap method and the nonparametric bootstrap method. Using a simulation study, these two methods are compared under three sample sizes (300, 1,000, and 3,000), for two test content areas (the Iowa Tests of Basic Skills Maps and Diagrams…

Descriptors: Test Length, Test Content, Simulation, Computation

Testing the Equality of Independent Alpha Coefficients Adjusted for Test Length.

Peer reviewed

Alsawalmeh, Yousef M.; Feldt, Leonard S. – Educational and Psychological Measurement, 1999

Develops a statistical test for the hypothesis that alpha'(1) =alpha'(2) when alpha'(1) is the Spearman-Brown extrapolated value of Cronbach's alpha reliability for test 1 and alpha'(2) is the unadjusted coefficient for test 2. The test is shown to exercise tight control of Type I error. (Author/SLD)

Descriptors: Reliability, Test Length

Mutual Information Item Selection in Adaptive Classification Testing

Peer reviewed

Direct link

Weissman, Alexander – Educational and Psychological Measurement, 2007

A general approach for item selection in adaptive multiple-category classification tests is provided. The approach uses mutual information (MI), a special case of the Kullback-Leibler distance, or relative entropy. MI works efficiently with the sequential probability ratio test and alleviates the difficulties encountered with using other local-…

Descriptors: Scientific Concepts, Probability, Test Length, Item Analysis

Comparing Different Approaches of Bias Correction for Ability Estimation in IRT Models. Research Report. ETS RR-08-13

Peer reviewed
PDF on ERIC

Download full text

Lee, Yi-Hsuan; Zhang, Jinming – ETS Research Report Series, 2008

The method of maximum-likelihood is typically applied to item response theory (IRT) models when the ability parameter is estimated while conditioning on the true item parameters. In practice, the item parameters are unknown and need to be estimated first from a calibration sample. Lewis (1985) and Zhang and Lu (2007) proposed the expected response…

Descriptors: Item Response Theory, Comparative Analysis, Computation, Ability

Teacher Educators: Course Experiences of Bachelor of Education Primary Students

Download full text

Bentley-Williams, Robyn; Forbes, Anne – Australian Association for Research in Education (NJ1), 2012

This investigation examined the course experiences of Bachelor of Education Primary students across each year of the course. The aims of the study were to identify gaps in what we know about our students; to identify relevant domains in student experiences and to assist with course improvements. A reflective inquiry paradigm was adopted for…

Descriptors: Foreign Countries, Bachelors Degrees, Preservice Teachers, Student Teacher Attitudes

A Simulation Study on the Performance of Four Multidimensional IRT Scale Linking Methods

Direct link

Wei, Youhua – ProQuest LLC, 2008

Scale linking is the process of developing the connection between scales of two or more sets of parameter estimates obtained from separate test calibrations. It is the prerequisite for many applications of IRT, such as test equating and differential item functioning analysis. Unidimensional scale linking methods have been studied and applied…

Descriptors: Test Length, Test Items, Sample Size, Simulation

Ramsay-Curve Item Response Theory for the Three-Parameter Logistic Item Response Model

Peer reviewed

Direct link

Woods, Carol M. – Applied Psychological Measurement, 2008

In Ramsay-curve item response theory (RC-IRT), the latent variable distribution is estimated simultaneously with the item parameters of a unidimensional item response model using marginal maximum likelihood estimation. This study evaluates RC-IRT for the three-parameter logistic (3PL) model with comparisons to the normal model and to the empirical…

Descriptors: Test Length, Computation, Item Response Theory, Maximum Likelihood Statistics

Modeling Nonignorable Missing Data in Speeded Tests

Peer reviewed

Direct link

Glas, Cees A. W.; Pimentel, Jonald L. – Educational and Psychological Measurement, 2008

In tests with time limits, items at the end are often not reached. Usually, the pattern of missing responses depends on the ability level of the respondents; therefore, missing data are not ignorable in statistical inference. This study models data using a combination of two item response theory (IRT) models: one for the observed response data and…

Descriptors: Intelligence Tests, Statistical Inference, Item Response Theory, Modeling (Psychology)

Comparing the Similarities and Differences of PISA 2003 and TIMSS. OECD Education Working Papers, No. 32

Direct link

Wu, Margaret – OECD Publishing (NJ1), 2010

This paper makes an in-depth comparison of the PISA (OECD) and TIMSS (IEA) mathematics assessments conducted in 2003. First, a comparison of survey methodologies is presented, followed by an examination of the mathematics frameworks in the two studies. The methodologies and the frameworks in the two studies form the basis for providing…

Descriptors: Mathematics Achievement, Foreign Countries, Gender Differences, Comparative Analysis

Investigation of a Nonparametric Procedure for Assessing Goodness-of-Fit in Item Response Theory

Peer reviewed

Direct link

Wells, Craig S.; Bolt, Daniel M. – Applied Measurement in Education, 2008

Tests of model misfit are often performed to validate the use of a particular model in item response theory. Douglas and Cohen (2001) introduced a general nonparametric approach for detecting misfit under the two-parameter logistic model. However, the statistical properties of their approach, and empirical comparisons to other methods, have not…

Descriptors: Test Length, Test Items, Monte Carlo Methods, Nonparametric Statistics

Coefficients of Effective Length.

Peer reviewed

Edwards, Roger H. – Educational and Psychological Measurement, 1981

Under certain conditions, a validity Coefficient of Effective Length (CEL) can produce highly misleading results. A modified coefficent is suggested for use when empirical studies indicate that underlying assumptions have been violated. (Author/BW)

Descriptors: Mathematical Formulas, Test Construction, Test Length

Impact of Fewer Questions per Section on SAT I Scores

Peer reviewed

Direct link

Bridgeman, Brent; Trapani, Catherine; Curley, Edward – Journal of Educational Measurement, 2004

The impact of allowing more time for each question on the SAT I: Reasoning Test scores was estimated by embedding sections with a reduced number of questions into the standard 30-minute equating section of two national test administrations. Thus, for example, questions were deleted from a verbal section that contained 35 questions to produce forms…

Descriptors: College Entrance Examinations, Test Length, Scores

The Impact of Anchor Test Length on Equating Results in a Nonequivalent Groups Design. Research Report. ETS RR-07-44

Peer reviewed
PDF on ERIC

Download full text

Ricker, Kathryn L.; von Davier, Alina A. – ETS Research Report Series, 2007

This study explored the effects of external anchor test length on final equating results of several equating methods, including equipercentile (frequency estimation), chained equipercentile, kernel equating (KE) poststratification PSE with optimal bandwidths, and KE PSE linear (large bandwidths) when using the nonequivalent groups anchor test…

Descriptors: Equated Scores, Test Items, Statistical Analysis, Test Length

Ramsay Curve IRT for Likert-Type Data

Peer reviewed

Direct link

Woods, Carol M. – Applied Psychological Measurement, 2007

Ramsay curve item response theory (RC-IRT) was recently developed to detect and correct for nonnormal latent variables when unidimensional IRT models are fitted to data using maximum marginal likelihood estimation. The purpose of this research is to evaluate the performance of RC-IRT for Likert-type item responses with varying test lengths, sample…

Descriptors: Test Length, Item Response Theory, Sample Size, Comparative Analysis

Historical Increase in the Number of Factors Measured by Commercial Tests of Cognitive Ability: Are We Overfactoring?

Peer reviewed

Direct link

Frazier, Thomas W.; Youngstrom, Eric A. – Intelligence, 2007

A historical increase in the number of factors purportedly measured by commercial tests of cognitive ability may result from four distinct pressures including: increasingly complex models of intelligence, test publishers' desires to provide clinically useful assessment instruments with greater interpretive value, test publishers' desires to…

Descriptors: Evaluation Criteria, Factor Structure, Cognitive Ability, Intelligence Tests

« Previous Page | Next Page »

Pages: 1 | ... | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | ... | 43

Educational and Psychological…	86
Applied Psychological…	45
Journal of Educational…	29
ProQuest LLC	28
Applied Measurement in…	21
ETS Research Report Series	15
Journal of Psychoeducational…	15
Psychological Assessment	12
International Journal of…	11
International Journal of…	11
Psychometrika	10
Measurement:…	9
Journal of Educational and…	7
Journal of Experimental…	6
Educational Sciences: Theory…	5
Journal of Speech, Language,…	5
Language Testing	5
Assessment	4
Educational Measurement:…	4
Grantee Submission	4
Physical Review Physics…	4
ACT Education Corp.	3
Eurasian Journal of…	3
Field Methods	3
Journal of Clinical Psychology	3
More ▼

Hambleton, Ronald K.	15
Wang, Wen-Chung	9
Livingston, Samuel A.	6
Sijtsma, Klaas	6
Wainer, Howard	6
Weiss, David J.	6
Wilcox, Rand R.	6
Cheng, Ying	5
Gessaroli, Marc E.	5
Lee, Won-Chan	5
Lewis, Charles	5
Reckase, Mark D.	5
Cohen, Allan S.	4
De Ayala, R. J.	4
Drasgow, Fritz	4
Huynh, Huynh	4
Kim, Seock-Ho	4
Meijer, Rob R.	4
Paek, Insu	4
Schumacker, Randall E.	4
Tay, Louis	4
Wang, Chun	4
Wells, Craig S.	4
Axelrod, Bradley N.	3
More ▼

Reports - Research	421
Journal Articles	402
Reports - Evaluative	125
Speeches/Meeting Papers	92
Dissertations/Theses -…	28
Reports - Descriptive	22
Numerical/Quantitative Data	14
Tests/Questionnaires	12
Guides - Non-Classroom	11
Information Analyses	10
Opinion Papers	7
Reference Materials -…	2
Reports - General	2
Collected Works - General	1
Collected Works - Serials	1
ERIC Publications	1
Guides - Classroom - Learner	1
Guides - General	1
Historical Materials	1
More ▼

Test of English as a Foreign…	9
Wechsler Adult Intelligence…	9
SAT (College Admission Test)	8
Program for International…	6
Law School Admission Test	5
Minnesota Multiphasic…	5
Wechsler Intelligence Scale…	5
Graduate Record Examinations	4
Trends in International…	4
ACT Assessment	3
Iowa Tests of Basic Skills	3
Kaufman Brief Intelligence…	3
National Assessment of…	3
Advanced Placement…	2
Bem Sex Role Inventory	2
Comprehensive Tests of Basic…	2
MacArthur Communicative…	2
McCarthy Scales of Childrens…	2
Medical College Admission Test	2
Nelson Denny Reading Tests	2
Peabody Picture Vocabulary…	2
Self Description Questionnaire	2
Stanford Binet Intelligence…	2
Wechsler Intelligence Scales…	2
ACTFL Oral Proficiency…	1
More ▼