ERIC - Search Results

Publication Date

In 2026	0
Since 2025	5
Since 2022 (last 5 years)	10
Since 2017 (last 10 years)	33
Since 2007 (last 20 years)	51

Descriptor

Test Length	133
Test Reliability	133
Test Validity	63
Test Items	44
Test Construction	42
Scores	24
Test Format	23
Computer Assisted Testing	21
Error of Measurement	20
Foreign Countries	20
Item Response Theory	19
Comparative Analysis	16
Statistical Analysis	16
Psychometrics	15
Difficulty Level	14
Item Analysis	14
Adaptive Testing	13
Language Tests	13
Testing Problems	13
Correlation	12
Higher Education	12
Mathematical Models	12
Testing	12
Mastery Tests	11
Cutting Scores	10
More ▼

Publication Type

Reports - Research	91
Journal Articles	74
Speeches/Meeting Papers	18
Reports - Evaluative	16
Reports - Descriptive	6
Tests/Questionnaires	4
Guides - Non-Classroom	3
Information Analyses	2
Opinion Papers	2
Reference Materials -…	2
Collected Works - Serials	1
Guides - General	1
Numerical/Quantitative Data	1
Reports - General	1
More ▼

Education Level

Higher Education	12
Postsecondary Education	11
Elementary Education	9
Secondary Education	6
Early Childhood Education	4
Grade 6	4
Intermediate Grades	4
Middle Schools	4
Primary Education	4
Grade 3	3
Grade 5	3
Grade 7	3
Junior High Schools	3
Elementary Secondary Education	2
Grade 2	2
Grade 4	2
Grade 8	2
High Schools	2
Grade 1	1
Grade 9	1
Kindergarten	1
More ▼

Audience

Researchers	4
Practitioners	2
Community	1
Support Staff	1

Location

China	4
Turkey	3
Australia	2
Canada	2
Ireland	2
Netherlands	2
Singapore	2
United Kingdom	2
Alabama	1
California	1
Germany	1
Illinois (Chicago)	1
Indiana	1
Japan	1
Kenya	1
Maryland	1
New Jersey	1
New Zealand	1
Pennsylvania	1
Peru	1
Poland	1
Portugal	1
South Korea	1
Spain	1
Taiwan	1
More ▼

Laws, Policies, & Programs

Job Training Partnership Act…

What Works Clearinghouse Rating

Test Reliability X

Showing 31 to 45 of 133 results Save | Export

Refining Change Measure with the Rasch Model

Peer reviewed

Direct link

Zaporozhets, Olga; Fox, Christine M.; Beltyukova, Svetlana A.; Laux, John M.; Piazza, Nick J.; Salyers, Kathleen – Measurement and Evaluation in Counseling and Development, 2015

This study was to develop a linear measure of change using University of Rhode Island Change Assessment items that represented Prochaska and DiClemente's theory. The resulting Toledo Measure of Change is short, is easy to use, and provides reliable scores for identification of individuals' stage of change and progression within that stage.

Descriptors: Item Response Theory, Change, Measures (Individuals), Test Construction

Effects of Differential Item Functioning on Examinees' Test Performance and Reliability of Test

Peer reviewed

Direct link

Lee, Yi-Hsuan; Zhang, Jinming – International Journal of Testing, 2017

Simulations were conducted to examine the effect of differential item functioning (DIF) on measurement consequences such as total scores, item response theory (IRT) ability estimates, and test reliability in terms of the ratio of true-score variance to observed-score variance and the standard error of estimation for the IRT ability parameter. The…

Descriptors: Test Bias, Test Reliability, Performance, Scores

Student Outcomes on MAP Growth: Comparison of Virtual and In-Person Administrations

Download full text

James, Syretta R.; Liu, Shihching Jessica; Maina, Nyambura; Wade, Julie; Wang, Helen; Wilson, Heather; Wolanin, Natalie – Montgomery County Public Schools, 2021

The impact of the COVID-19 pandemic continues to overwhelm the functioning and outcomes of educational systems throughout the nation. The public education system is under particular scrutiny given that students, families, and educators are under considerable stress to maintain academic progress. Since the beginning of the crisis, school-systems…

Descriptors: Achievement Tests, COVID-19, Pandemics, Public Schools

Examination of Polytomous Items' Psychometric Properties According to Nonparametric Item Response Theory Models in Different Test Conditions

Peer reviewed
PDF on ERIC

Download full text

Sengul Avsar, Asiye; Tavsancil, Ezel – Educational Sciences: Theory and Practice, 2017

This study analysed polytomous items' psychometric properties according to nonparametric item response theory (NIRT) models. Thus, simulated datasets--three different test lengths (10, 20 and 30 items), three sample distributions (normal, right and left skewed) and three samples sizes (100, 250 and 500)--were generated by conducting 20…

Descriptors: Test Items, Psychometrics, Nonparametric Statistics, Item Response Theory

Identifying Sets of Maximally Efficient Items from the Academic Competence Evaluation Scales-Teacher Form

Peer reviewed

Direct link

Anthony, Christopher James; DiPerna, James Clyde – School Psychology Quarterly, 2017

The Academic Competence Evaluation Scales-Teacher Form (ACES-TF; DiPerna & Elliott, 2000) was developed to measure student academic skills and enablers (interpersonal skills, engagement, motivation, and study skills). Although ACES-TF scores have demonstrated psychometric adequacy, the length of the measure may be prohibitive for certain…

Descriptors: Test Items, Efficiency, Item Response Theory, Test Length

Test Review: C. Mardell & D. S. Goldenberg. "Speed Developmental Indicators for the Assessment of Learning-Fourth Edition" ("Speed DIAL-4")

Peer reviewed

Direct link

Doskey, Elena M.; Lagunas, Brenda; SooHoo, Michelle; Lomax, Amanda; Bullick, Stephanie – Journal of Psychoeducational Assessment, 2013

The Speed DIAL-4 was developed from the Developmental Indicators for the Assessment of Learning, Fourth Edition (DIAL-4), a screening designed to identify children between the ages of 2 years, 6 months through 5 years, 11 months "who are in need of intervention or diagnostic assessment in the following areas: motor, concepts, language,…

Descriptors: Screening Tests, Young Children, Test Length, Scoring

On the Shortcomings of Shortened Tests: A Literature Review

Peer reviewed

Direct link

Kruyen, Peter M.; Emons, Wilco H. M.; Sijtsma, Klaas – International Journal of Testing, 2013

To efficiently assess multiple psychological constructs and to minimize the burden on respondents, psychologists increasingly use shortened versions of existing tests. However, compared to the longer test, a shorter test version may have a substantial impact on the reliability and the validity of the test scores in psychological research and…

Descriptors: Test Length, Psychological Testing, Test Use, Test Validity

Broadening the Scope of Reading Comprehension Using Scenario-Based Assessments: Preliminary Findings and Challenges

Peer reviewed
PDF on ERIC

Download full text

Sabatini, J.; O'Reilly, T.; Halderman, L.; Bruce, K. – Grantee Submission, 2014

Existing reading comprehension assessments have been criticized by researchers, educators, and policy makers, especially regarding their coverage, utility, and authenticity. The purpose of the current study was to evaluate a new assessment of reading comprehension that was designed to broaden the construct of reading. In light of these issues, we…

Descriptors: Reading Comprehension, Vignettes, Reading Tests, Elementary School Students

Validity and Reliability of Teacher-Made Tests: Case Study of Year 11 Physics in Nyahururu District of Kenya

Peer reviewed
PDF on ERIC

Download full text

Kinyua, Kiragu; Okunya, Luke Odiemo – African Educational Research Journal, 2014

This study was carried out to establish the factors influencing the validity and reliability of teacher made tests in Kenya. It was conducted in Nyahururu District of Laikipia County in Kenya. The study involved 42 teachers and 15 key informants selected from teachers holding various positions of academic responsibilities in their schools in…

Descriptors: Tests, Test Validity, Test Reliability, Physics

Comparing the Performance of Five Multidimensional CAT Selection Procedures with Different Stopping Rules

Peer reviewed

Direct link

Yao, Lihua – Applied Psychological Measurement, 2013

Through simulated data, five multidimensional computerized adaptive testing (MCAT) selection procedures with varying test lengths are examined and compared using different stopping rules. Fixed item exposure rates are used for all the items, and the Priority Index (PI) method is used for the content constraints. Two stopping rules, standard error…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Selection

Indexing Creativity Fostering Teacher Behaviour: Replication and Modification

Download full text

Dikici, Ayhan; Soh, Kaycheng – Online Submission, 2015

Many measurement tools on creativity are available in the literature. One of these scales is Creativity Fostering Teacher Behaviour Index (CFTIndex) developed for Singaporean teacher originally. It was then translated into Turkish and trialled on teachers in Nigde province with acceptable reliability and factorial validity. The main purpose of…

Descriptors: Creativity, Teacher Behavior, Comparative Analysis, Turkish

The Psychometric Properties of the Short and Long Versions of the Coach-Athlete Relationship Questionnaire

Peer reviewed

Direct link

Yang, Sophie Xin; Jowett, Sophia – Measurement in Physical Education and Exercise Science, 2013

The Coach-Athlete Relationship Questionnaire was developed to effectively measure affective, cognitive, and behavioral aspects, represented by the interpersonal constructs of closeness, commitment, and complementarity, of the quality of the relationship within the context of sport coaching. The current study sought to determine the internal…

Descriptors: Foreign Countries, Athletes, Athletic Coaches, Interpersonal Relationship

Relating Unidimensional IRT Parameters to a Multidimensional Response Space: A Review of Two Alternative Projection IRT Models for Scoring Subscales

Peer reviewed

Direct link

Kahraman, Nilufer; Thompson, Tony – Journal of Educational Measurement, 2011

A practical concern for many existing tests is that subscore test lengths are too short to provide reliable and meaningful measurement. A possible method of improving the subscale reliability and validity would be to make use of collateral information provided by items from other subscales of the same test. To this end, the purpose of this article…

Descriptors: Test Length, Test Items, Alignment (Education), Models

Development of the Career Indecision Profile: Factor Structure, Reliability, and Validity

Peer reviewed

Direct link

Hacker, Jason; Carr, Andrea; Abrams, Matthew; Brown, Steven D. – Journal of Career Assessment, 2013

Prior research using a 167-item measure of career indecision (Career Indecision Profile-167 [CIP-167]) has suggested that career choice difficulties may be associated with four major sources of career indecision: neuroticism/negative affectivity, choice/commitment anxiety, lack of readiness, and interpersonal conflicts. The purpose of this study…

Descriptors: Career Choice, Decision Making, Measures (Individuals), Factor Structure

Multidimensional CAT Item Selection Methods for Domain Scores and Composite Scores: Theory and Applications

Peer reviewed

Direct link

Yao, Lihua – Psychometrika, 2012

Multidimensional computer adaptive testing (MCAT) can provide higher precision and reliability or reduce test length when compared with unidimensional CAT or with the paper-and-pencil test. This study compared five item selection procedures in the MCAT framework for both domain scores and overall scores through simulation by varying the structure…

Descriptors: Item Banks, Test Length, Simulation, Adaptive Testing

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9

Educational and Psychological…	13
Journal of Psychoeducational…	8
Applied Psychological…	5
Journal of Educational…	5
Psychometrika	4
Applied Measurement in…	3
Language Testing	3
Assessment & Evaluation in…	2
ETS Research Report Series	2
International Journal of…	2
Journal of Personality…	2
Psychological Assessment	2
Research Matters	2
ACT Education Corp.	1
AERA Online Paper Repository	1
African Educational Research…	1
Anatomical Sciences Education	1
Assessment	1
Assessment and Evaluation in…	1
College Student Journal	1
Contemporary Educational…	1
Education and Information…	1
Educational Research and…	1
Educational Sciences: Theory…	1
Eurasian Journal of…	1
More ▼

Hambleton, Ronald K.	4
Burton, Richard F.	3
Cliff, Norman	2
Gilmer, Jerry S.	2
Huynh, Huynh	2
Lee, Yi-Hsuan	2
Leite, Walter L.	2
Livingston, Samuel A.	2
Marcoulides, Katerina M.	2
Raborn, Anthony W.	2
Reckase, Mark D.	2
Wilcox, Rand R.	2
Yao, Lihua	2
Zhang, Jinming	2
de Jong, John H. A. L.	2
Abrams, Matthew	1
Allison, Paul A.	1
Almeida, Leandro S.	1
Anderson, Judith A.	1
Andrea Fuster	1
Andy Rick Sánchez-Villena	1
Anthony, Christopher J.	1
Anthony, Christopher James	1
Arens, A. Katrin	1
More ▼

Wechsler Adult Intelligence…	3
McCarthy Scales of Childrens…	2
Peabody Picture Vocabulary…	2
Test of English as a Foreign…	2
Wechsler Intelligence Scale…	2
ACT Assessment	1
ACTFL Oral Proficiency…	1
Adaptive Behavior Scale	1
Armed Forces Qualification…	1
Comprehensive Tests of Basic…	1
Developmental Indicators for…	1
Draw a Person Test	1
Fennema Sherman Mathematics…	1
Iowa Tests of Basic Skills	1
MacArthur Communicative…	1
Matching Familiar Figures Test	1
Measures of Academic Progress	1
Medical College Admission Test	1
Minnesota Multiphasic…	1
Multidimensional…	1
National Assessment of…	1
Positive and Negative Affect…	1
School and College Ability…	1
Self Description Questionnaire	1
Stanford Binet Intelligence…	1
More ▼