ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	5
Since 2017 (last 10 years)	20
Since 2007 (last 20 years)	50

Descriptor

Correlation	68
Test Length	68
Item Response Theory	32
Test Items	30
Sample Size	25
Simulation	17
Statistical Analysis	15
Accuracy	14
Comparative Analysis	13
Computation	12
Models	12
Test Reliability	12
Difficulty Level	11
Factor Analysis	11
Item Analysis	9
Test Validity	9
Computer Assisted Testing	8
Adaptive Testing	7
Monte Carlo Methods	7
Test Construction	7
Evaluation Methods	6
Foreign Countries	6
Scores	6
Test Format	6
Equated Scores	5
More ▼

Publication Type

Journal Articles	52
Reports - Research	47
Reports - Evaluative	12
Speeches/Meeting Papers	9
Dissertations/Theses -…	4
Information Analyses	1

Education Level

Higher Education	4
Postsecondary Education	4
Elementary Education	3
Elementary Secondary Education	2
Early Childhood Education	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Intermediate Grades	1
Primary Education	1
Secondary Education	1
More ▼

Audience

Researchers

Location

Turkey	3
Germany	1
Ghana	1
Singapore	1
Taiwan	1

Laws, Policies, & Programs

Assessments and Surveys

Wechsler Intelligence Scale…	3
General Educational…	1
Kaufman Brief Intelligence…	1
MacArthur Communicative…	1
McCarthy Scales of Childrens…	1
Self Description Questionnaire	1
Wechsler Adult Intelligence…	1
Wechsler Intelligence Scales…	1

What Works Clearinghouse Rating

Correlation X

Showing 31 to 45 of 68 results Save | Export

Assessing Dimensionality of Noncompensatory Multidimensional Item Response Theory with Complex Structures

Peer reviewed

Direct link

Svetina, Dubravka – Educational and Psychological Measurement, 2013

The purpose of this study was to investigate the effect of complex structure on dimensionality assessment in noncompensatory multidimensional item response models using dimensionality assessment procedures based on DETECT (dimensionality evaluation to enumerate contributing traits) and NOHARM (normal ogive harmonic analysis robust method). Five…

Descriptors: Item Response Theory, Statistical Analysis, Computation, Test Length

Mutual Information Item Selection Method in Cognitive Diagnostic Computerized Adaptive Testing with Short Test Length

Peer reviewed

Direct link

Wang, Chun – Educational and Psychological Measurement, 2013

Cognitive diagnostic computerized adaptive testing (CD-CAT) purports to combine the strengths of both CAT and cognitive diagnosis. Cognitive diagnosis models aim at classifying examinees into the correct mastery profile group so as to pinpoint the strengths and weakness of each examinee whereas CAT algorithms choose items to determine those…

Descriptors: Computer Assisted Testing, Adaptive Testing, Cognitive Tests, Diagnostic Tests

Equating Multidimensional Tests under a Random Groups Design: A Comparison of Various Equating Procedures

Direct link

Lee, Eunjung – ProQuest LLC, 2013

The purpose of this research was to compare the equating performance of various equating procedures for the multidimensional tests. To examine the various equating procedures, simulated data sets were used that were generated based on a multidimensional item response theory (MIRT) framework. Various equating procedures were examined, including…

Descriptors: Equated Scores, Tests, Comparative Analysis, Item Response Theory

Indexing Creativity Fostering Teacher Behaviour: Replication and Modification

Download full text

Dikici, Ayhan; Soh, Kaycheng – Online Submission, 2015

Many measurement tools on creativity are available in the literature. One of these scales is Creativity Fostering Teacher Behaviour Index (CFTIndex) developed for Singaporean teacher originally. It was then translated into Turkish and trialled on teachers in Nigde province with acceptable reliability and factorial validity. The main purpose of…

Descriptors: Creativity, Teacher Behavior, Comparative Analysis, Turkish

An Investigation of Sample Size Splitting on ATFIND and DIMTEST

Peer reviewed

Direct link

Socha, Alan; DeMars, Christine E. – Educational and Psychological Measurement, 2013

Modeling multidimensional test data with a unidimensional model can result in serious statistical errors, such as bias in item parameter estimates. Many methods exist for assessing the dimensionality of a test. The current study focused on DIMTEST. Using simulated data, the effects of sample size splitting for use with the ATFIND procedure for…

Descriptors: Sample Size, Test Length, Correlation, Test Format

A Comparison of Four Methods of IRT Subscoring

Peer reviewed

Direct link

de la Torre, Jimmy; Song, Hao; Hong, Yuan – Applied Psychological Measurement, 2011

Lack of sufficient reliability is the primary impediment for generating and reporting subtest scores. Several current methods of subscore estimation do so either by incorporating the correlational structure among the subtest abilities or by using the examinee's performance on the overall test. This article conducted a systematic comparison of four…

Descriptors: Item Response Theory, Scoring, Methods, Comparative Analysis

Performance of the S - [chi][squared] Statistic for Full-Information Bifactor Models

Peer reviewed

Direct link

Li, Ying; Rupp, Andre A. – Educational and Psychological Measurement, 2011

This study investigated the Type I error rate and power of the multivariate extension of the S - [chi][squared] statistic using unidimensional and multidimensional item response theory (UIRT and MIRT, respectively) models as well as full-information bifactor (FI-bifactor) models through simulation. Manipulated factors included test length, sample…

Descriptors: Test Length, Item Response Theory, Statistical Analysis, Error Patterns

Checking Dimensionality in Item Response Models with Principal Component Analysis on Standardized Residuals

Peer reviewed

Direct link

Chou, Yeh-Tai; Wang, Wen-Chung – Educational and Psychological Measurement, 2010

Dimensionality is an important assumption in item response theory (IRT). Principal component analysis on standardized residuals has been used to check dimensionality, especially under the family of Rasch models. It has been suggested that an eigenvalue greater than 1.5 for the first eigenvalue signifies a violation of unidimensionality when there…

Descriptors: Test Length, Sample Size, Correlation, Item Response Theory

A Comparison of Three Content Balancing Methods for Fixed and Variable Length Computerized Adaptive Tests

Direct link

Shin, Chingwei David; Chien, Yuehmei; Way, Walter Denny – Pearson, 2012

Content balancing is one of the most important components in the computerized adaptive testing (CAT) especially in the K to 12 large scale tests that complex constraint structure is required to cover a broad spectrum of content. The purpose of this study is to compare the weighted penalty model (WPM) and the weighted deviation method (WDM) under…

Descriptors: Computer Assisted Testing, Elementary Secondary Education, Test Content, Models

Formulation of a DIMTEST Effect Size Measure (DESM) and Evaluation of the DESM Estimator Bias

Peer reviewed

Direct link

Seo, Minhee; Roussos, Louis A. – Journal of Educational Measurement, 2010

DIMTEST is a widely used and studied method for testing the hypothesis of test unidimensionality as represented by local item independence. However, DIMTEST does not report the amount of multidimensionality that exists in data when rejecting its null. To provide more information regarding the degree to which data depart from unidimensionality, a…

Descriptors: Effect Size, Statistical Bias, Computation, Test Length

A Short German Version of the Self Description Questionnaire I: Theoretical and Empirical Comparability

Peer reviewed

Direct link

Arens, A. Katrin; Yeung, Alexander Seeshing; Craven, Rhonda G.; Hasselhorn, Marcus – International Journal of Research & Method in Education, 2013

This study aims to develop a short German version of the Self Description Questionnaire (SDQ I-GS) in order to present a robust economical instrument for measuring German preadolescents' multidimensional self-concept. A full German version of the SDQ I (SDQ I-G) that maintained the original structure and thus length of the English original SDQ I…

Descriptors: Foreign Countries, Questionnaires, Test Construction, Test Length

The Effects of Young EFL Learners' Perceptions of Tests on Test Anxiety

Peer reviewed

Direct link

Aydin, Selami – Education 3-13, 2012

Studies conducted so far have mainly focused on the relationships between perceptions of tests and test anxiety among adult foreign language learners, while there is a lack of research focusing on young learners on the above-mentioned issue. Thus, this study aims to examine the relationship between test anxiety among young learners who study…

Descriptors: Test Length, Content Validity, Validity, Measures (Individuals)

Improving IRT Parameter Estimates with Small Sample Sizes: Evaluating the Efficacy of a New Data Augmentation Technique

Direct link

Foley, Brett Patrick – ProQuest LLC, 2010

The 3PL model is a flexible and widely used tool in assessment. However, it suffers from limitations due to its need for large sample sizes. This study introduces and evaluates the efficacy of a new sample size augmentation technique called Duplicate, Erase, and Replace (DupER) Augmentation through a simulation study. Data are augmented using…

Descriptors: Test Length, Sample Size, Simulation, Item Response Theory

Comparing Accuracy of Parameter Estimation Using IRT Models in the Presence of Guessing

Direct link

Fu, Qiong – ProQuest LLC, 2010

This research investigated how the accuracy of person ability and item difficulty parameter estimation varied across five IRT models with respect to the presence of guessing, targeting, and varied combinations of sample sizes and test lengths. The data were simulated with 50 replications under each of the 18 combined conditions. Five IRT models…

Descriptors: Item Response Theory, Guessing (Tests), Accuracy, Computation

Application of Computerized Adaptive Testing to Entrance Examination for Graduate Studies in Turkey

Peer reviewed
PDF on ERIC

Download full text

Bulut, Okan; Kan, Adnan – Eurasian Journal of Educational Research, 2012

Problem Statement: Computerized adaptive testing (CAT) is a sophisticated and efficient way of delivering examinations. In CAT, items for each examinee are selected from an item bank based on the examinee's responses to the items. In this way, the difficulty level of the test is adjusted based on the examinee's ability level. Instead of…

Descriptors: Adaptive Testing, Computer Assisted Testing, College Entrance Examinations, Graduate Students

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Educational and Psychological…	16
ProQuest LLC	4
Applied Psychological…	3
ETS Research Report Series	3
Eurasian Journal of…	3
Journal of Educational…	3
Educational Measurement:…	2
Journal of Speech, Language,…	2
Asia Pacific Education Review	1
College Student Journal	1
Contemporary Educational…	1
Education 3-13	1
Educational Sciences: Theory…	1
Field Methods	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Clinical Psychology	1
Journal of Creative Behavior	1
Journal of Educational and…	1
Journal of Experimental…	1
Journal of Learning in Higher…	1
Journal of Psychoeducational…	1
Measurement:…	1
Online Submission	1
More ▼

Wang, Wen-Chung	3
Bulut, Okan	2
Song, Hao	2
Svetina, Dubravka	2
Uysal, Ibrahim	2
Wang, Chun	2
de la Torre, Jimmy	2
Acar, Selcuk	1
Ackerman, Terry	1
Allan S. Cohen	1
Allison, Paul A.	1
Ang, Cheng	1
Anil, Duygu	1
Arens, A. Katrin	1
Arikan, Serkan	1
Atar, Burcu	1
Aybek, Eren Can	1
Aydin, Selami	1
Baris Pekmezci, Fulya	1
Bazaldua, Diego A. Luna	1
Berk, Ronald A.	1
Bleses, Dorthe	1
Boyd, Thomas A.	1
Brown, Joel M.	1
More ▼