ERIC - Search Results

Publication Date

In 2026	0
Since 2025	197
Since 2022 (last 5 years)	1067
Since 2017 (last 10 years)	2577
Since 2007 (last 20 years)	4938

Descriptor

Test Items	9530
Test Construction	2714
Foreign Countries	2179
Item Response Theory	1868
Difficulty Level	1620
Item Analysis	1501
Test Validity	1415
Test Reliability	1186
Multiple Choice Tests	1154
Scores	1136
Computer Assisted Testing	1057
Comparative Analysis	1024
Test Format	956
Higher Education	877
Achievement Tests	854
Statistical Analysis	850
Mathematics Tests	845
Psychometrics	832
Test Bias	770
Models	753
Student Evaluation	736
Language Tests	699
Correlation	695
Evaluation Methods	674
Scoring	633
More ▼

Author

van der Linden, Wim J.	69
Tindal, Gerald	50
Hambleton, Ronald K.	45
Alonzo, Julie	41
Chang, Hua-Hua	40
Plake, Barbara S.	40
Sinharay, Sandip	37
Reckase, Mark D.	36
Wainer, Howard	33
Dorans, Neil J.	32
Gierl, Mark J.	30
Sireci, Stephen G.	28
Wang, Wen-Chung	26
Cohen, Allan S.	25
Meijer, Rob R.	25
Samejima, Fumiko	24
Stocking, Martha L.	24
Anderson, Daniel	23
Zwick, Rebecca	23
Veldkamp, Bernard P.	22
Haladyna, Thomas M.	21
Kim, Seock-Ho	21
Wise, Steven L.	21
Kim, Sooyeon	20
More ▼

Education Level

Higher Education	1307
Postsecondary Education	1057
Secondary Education	925
Elementary Education	715
Middle Schools	419
High Schools	362
Elementary Secondary Education	358
Junior High Schools	319
Grade 8	255
Intermediate Grades	209
Grade 4	183
Early Childhood Education	177
Grade 5	134
Primary Education	126
Grade 7	113
Grade 3	111
Grade 6	107
Grade 9	68
Grade 2	56
Grade 10	52
Grade 12	52
Kindergarten	50
Adult Education	39
Grade 11	37
Grade 1	36
More ▼

Audience

Practitioners	653
Teachers	563
Researchers	250
Students	201
Administrators	81
Policymakers	22
Parents	17
Counselors	8
Community	7
Support Staff	3
Media Staff	1
More ▼

Location

Turkey	225
Canada	223
Australia	155
Germany	116
United States	99
China	90
Florida	86
Indonesia	82
Taiwan	78
United Kingdom	73
California	65
Japan	65
Netherlands	64
Iran	62
United Kingdom (England)	57
South Africa	48
Missouri	45
New York	45
Oklahoma	44
South Korea	44
Malaysia	42
Texas	42
Israel	37
Singapore	37
Sweden	37
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	4
Meets WWC Standards with or without Reservations	4
Does not meet standards	1

Showing 7,006 to 7,020 of 9,530 results Save | Export

Reasons for Changing Answers: An Evaluation Using Personal Interviews.

Peer reviewed

Schwarz, Shirley P.; And Others – Journal of Educational Measurement, 1991

Interviews were conducted with 104 students in masters' level classes to determine their reasons for changing test answers. Subjects previously had been instructed in answer-changing strategies. Most changes were for thought out reasons; few were because of clerical errors. Reconsideration of test items is probably underestimated in…

Descriptors: Achievement Gains, Graduate Students, Guessing (Tests), Higher Education

Inquiring Trainers Want to Know.

Stoneall, Linda – Training and Development, 1991

Describes questioning methods trainers can use to uncover training needs (interviews, surveys, test questions, program evaluations). Illustrates the use of questions at the beginning, middle, and end of training sessions. (SK)

Descriptors: Adult Education, Discussion (Teaching Technique), Evaluation Methods, Interviews

A Cluster-Based Method for Test Construction.

Peer reviewed

Boekkooi-Timminga, Ellen – Applied Psychological Measurement, 1990

A new test construction model based on the Rasch model is proposed. This model, the cluster-based method, considers groups of interchangeable items rather than individual items and uses integer programing. Results for six test construction problems indicate that the method produces accurate results in small amounts of time. (SLD)

Descriptors: Cluster Analysis, Computer Assisted Testing, Equations (Mathematics), Item Banks

The None-of-the-Above Option: An Empirical Study.

Peer reviewed

Frary, Robert B. – Applied Measurement in Education, 1991

The use of the "none-of-the-above" option (NOTA) in 20 college-level multiple-choice tests was evaluated for classes with 100 or more students. Eight academic disciplines were represented, and 295 NOTA and 724 regular test items were used. It appears that the NOTA can be compatible with good classroom measurement. (TJH)

Descriptors: College Students, Comparative Testing, Difficulty Level, Discriminant Analysis

Rasch Models in Latent Classes: An Integration of Two Approaches to Item Analysis.

Peer reviewed

Rost, Jurgen – Applied Psychological Measurement, 1990

Combining Rasch and latent class models is presented as a way to overcome deficiencies and retain the positive features of both. An estimation algorithm is outlined, providing conditional maximum likelihood estimates of item parameters for each class. The model is illustrated with simulated data and real data (n=869 adults). (SLD)

Descriptors: Adults, Algorithms, Computer Simulation, Equations (Mathematics)

Interpreting Profiles.

Peer reviewed

Hills, John R. – Educational Measurement: Issues and Practice, 1993

A scenario and accompanying questions and answers are posed to help educators examine possible problems in interpreting a student's test score profile. Profiles developed and used soundly are very helpful, but possible pitfalls in test interpretation must be recognized. (SLD)

Descriptors: Academic Achievement, Educational Assessment, Elementary Secondary Education, Performance

Single-Item Versus Multiple-Item Measurement Scales: An Empirical Comparison.

Peer reviewed

Gardner, Donald G.; Cummings, L. L.; Dunham, Randall B.; Pierce, Jon L. – Educational and Psychological Measurement, 1998

Whether traditional Likert-type focus of attention at work scales would outperform the one-item scales developed by D. Gardner and others (1989) was studied with responses of 492 automobile-services-club employees. Confirmatory factor analysis did not show either method to be better empirically. Situations in which the one-item scale might be…

Descriptors: Attention, Comparative Analysis, Employees, Likert Scales

Calibration and Scoring of Tests with Multiple-Choice and Constructed-Response Item Types.

Peer reviewed

Ercikan, Kadriye; Schwartz, Richard D.; Julian, Marc W.; Burket, George R.; Weber, Melba M.; Link, Valerie – Journal of Educational Measurement, 1998

Discusses and demonstrates combining scores from multiple-choice (MC) and constructed-response (CR) items to create a common scale using Item Response Theory methodology. Provides empirical results using a set of tests in reading, language, mathematics, and science in three grades. (SLD)

Descriptors: Constructed Response, Elementary Secondary Education, Item Response Theory, Language Arts

Improving Measurement for Graduate Admissions.

Peer reviewed

Enright, Mary K.; Rock, Donald A.; Bennett, Randy Elliot – Journal of Educational Measurement, 1998

Examined alternative-item types and section configurations for improving the discriminant and convergent validity of the Graduate Record Examination (GRE) general test using a computer-based test given to 388 examinees who had taken the GRE previously. Adding new variations of logical meaning appeared to decrease discriminant validity. (SLD)

Descriptors: Admission (School), College Entrance Examinations, College Students, Computer Assisted Testing

Possible Determinants of Differential Item Functioning: Familiarity, Interest, and Emotional Reaction.

Peer reviewed

Stricker, Lawrence J.; Emmerich, Walter – Journal of Educational Measurement, 1999

Examined the connection between gender differences in examinees' familiarity, interest, and negative emotional reactions to items on the College Board's Advanced Placement Psychology Examination and the items' differential item functioning (DIF). Gender differences for a sample of 717 students for the 3 variables were substantially related to the…

Descriptors: Advanced Placement, Correlation, Emotional Response, Familiarity

The Interaction between Item Format and Gender Differences in Mathematics Performance Based on TIMSS Data.

Peer reviewed

Wester, Anita; Henriksson, Widar – Studies in Educational Evaluation, 2000

Examined whether changes in format of mathematics items in the Third International Mathematics and Science Study (TIMSS) had any effect on gender differences in performance using a Swedish sample of 8,851 sixth, seventh, and eighth graders. Results show no significant changes in gender differences when item format is altered. (SLD)

Descriptors: Interaction, International Studies, Junior High School Students, Junior High Schools

Effects of Stem and Likert Response Option Reversals on Survey Internal Consistency: If You Feel the Need, There Is a Better Alternative to Using Those Negatively Worded Stems.

Peer reviewed

Barnette, J. Jackson – Educational and Psychological Measurement, 2000

Used a design in which item stem direction and item response pattern were crossed to determine effects on internal consistency reliability. Results from high school and college students and teachers (150 individuals per test form) suggest using directly worded items with half of the response items going in one direction, and half in the other.…

Descriptors: College Students, High School Students, High Schools, Higher Education

Identifying Impediments to Learning Probability and Statistics from an Assessment of Instructional Software.

Peer reviewed

Cohen, Steve; And Others – Journal of Educational and Behavioral Statistics, 1996

A detailed multisite evaluation of instructional software, the ConStatS package, designed to help students conceptualize introductory probability and statistics, yielded patterns of error on several assessment items. Results from 739 college students demonstrated 10 misconceptions that may be among the most difficult concepts to teach. (SLD)

Descriptors: College Students, Computer Assisted Instruction, Computer Software Evaluation, Educational Assessment

Highly Confident but Wrong: Gender Differences and Similarities in Confidence Judgments.

Peer reviewed

Lundeberg, Mary A.; And Others – Journal of Educational Psychology, 1994

Gender differences in item-specific confidence judgments were studied for 70 male and 181 female college students. Gender differences in confidence were dependent on context and the domain being tested. Both men and women were overconfident, but men were especially overconfident when incorrect. (SLD)

Descriptors: College Students, Confidence Testing, Context Effect, Difficulty Level

Does a Standard Reflect Minimal Competency of Examinees or Judge Competency?

Peer reviewed

Chang, Lei; And Others – Applied Measurement in Education, 1996

The influence of judges' knowledge on standard setting for competency tests was studied with 17 judges who took an economics teacher certification test while setting competency standards using the Angoff procedure. Judges tended to set higher standards for items they answered correctly and lower standards for items they answered incorrectly. (SLD)

Descriptors: Competence, Difficulty Level, Economics, Judges

« Previous Page | Next Page »

Pages: 1 | ... | 464 | 465 | 466 | 467 | 468 | 469 | 470 | 471 | 472 | ... | 636

Educational and Psychological…	416
Journal of Educational…	359
ProQuest LLC	246
Applied Psychological…	234
Applied Measurement in…	231
ETS Research Report Series	146
Educational Measurement:…	128
Journal of Educational and…	122
Online Submission	115
International Journal of…	105
Grantee Submission	98
Language Testing	93
Psychometrika	93
International Journal of…	79
Journal of Psychoeducational…	72
Educational Assessment	69
Measurement:…	57
Practical Assessment,…	56
Language Assessment Quarterly	55
Journal of Chemical Education	54
Behavioral Research and…	50
Journal of Experimental…	45
Physical Review Physics…	38
Journal of Experimental…	36
International Journal of…	35
More ▼

Journal Articles	5866
Reports - Research	5575
Reports - Evaluative	1556
Speeches/Meeting Papers	1168
Reports - Descriptive	796
Tests/Questionnaires	768
Guides - Classroom - Teacher	472
Guides - Non-Classroom	259
Dissertations/Theses -…	251
Numerical/Quantitative Data	185
Information Analyses	179
Opinion Papers	164
Guides - Classroom - Learner	162
Books	54
Collected Works - General	33
Multilingual/Bilingual…	32
Guides - General	31
Reports - General	21
Book/Product Reviews	20
ERIC Publications	20
Non-Print Media	16
ERIC Digests in Full Text	14
Collected Works - Proceedings	13
Reference Materials - General	13
Collected Works - Serials	12
More ▼

No Child Left Behind Act 2001	36
Individuals with Disabilities…	20
Every Student Succeeds Act…	5
Elementary and Secondary…	4
Race to the Top	4
Rehabilitation Act 1973…	4
Elementary and Secondary…	3
Head Start	3
Americans with Disabilities…	2
Comprehensive Education…	2
Higher Education Act…	2
Immigration Reform and…	2
Civil Rights Act 1964	1
Civil Rights Act 1964 Title…	1
Comprehensive Employment and…	1
Education Consolidation…	1
Education for All Handicapped…	1
Fair Labor Standards Act	1
Higher Education Act Title II	1
Higher Education Opportunity…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Jeanne Clery Disclosure of…	1
Job Training Partnership Act…	1
Kentucky Education Reform Act…	1
More ▼

National Assessment of…	182
Program for International…	178
SAT (College Admission Test)	137
Trends in International…	114
Test of English as a Foreign…	85
Graduate Record Examinations	74
ACT Assessment	44
Advanced Placement…	34
Texas Educational Assessment…	32
Law School Admission Test	30
Wechsler Intelligence Scale…	26
Iowa Tests of Basic Skills	25
Progress in International…	25
Stanford Achievement Tests	24
Raven Progressive Matrices	22
Armed Services Vocational…	20
International English…	20
Peabody Picture Vocabulary…	20
California Achievement Tests	18
Comprehensive Tests of Basic…	18
Test of English for…	17
Metropolitan Achievement Tests	15
General Educational…	14
Graduate Management Admission…	14
Wechsler Adult Intelligence…	13
More ▼