ERIC - Search Results

Publication Date

In 2026	0
Since 2025	197
Since 2022 (last 5 years)	1067
Since 2017 (last 10 years)	2577
Since 2007 (last 20 years)	4938

Descriptor

Test Items	9530
Test Construction	2714
Foreign Countries	2179
Item Response Theory	1868
Difficulty Level	1620
Item Analysis	1501
Test Validity	1415
Test Reliability	1186
Multiple Choice Tests	1154
Scores	1136
Computer Assisted Testing	1057
Comparative Analysis	1024
Test Format	956
Higher Education	877
Achievement Tests	854
Statistical Analysis	850
Mathematics Tests	845
Psychometrics	832
Test Bias	770
Models	753
Student Evaluation	736
Language Tests	699
Correlation	695
Evaluation Methods	674
Scoring	633
More ▼

Author

van der Linden, Wim J.	69
Tindal, Gerald	50
Hambleton, Ronald K.	45
Alonzo, Julie	41
Chang, Hua-Hua	40
Plake, Barbara S.	40
Sinharay, Sandip	37
Reckase, Mark D.	36
Wainer, Howard	33
Dorans, Neil J.	32
Gierl, Mark J.	30
Sireci, Stephen G.	28
Wang, Wen-Chung	26
Cohen, Allan S.	25
Meijer, Rob R.	25
Samejima, Fumiko	24
Stocking, Martha L.	24
Anderson, Daniel	23
Zwick, Rebecca	23
Veldkamp, Bernard P.	22
Haladyna, Thomas M.	21
Kim, Seock-Ho	21
Wise, Steven L.	21
Kim, Sooyeon	20
More ▼

Education Level

Higher Education	1307
Postsecondary Education	1057
Secondary Education	925
Elementary Education	715
Middle Schools	419
High Schools	362
Elementary Secondary Education	358
Junior High Schools	319
Grade 8	255
Intermediate Grades	209
Grade 4	183
Early Childhood Education	177
Grade 5	134
Primary Education	126
Grade 7	113
Grade 3	111
Grade 6	107
Grade 9	68
Grade 2	56
Grade 10	52
Grade 12	52
Kindergarten	50
Adult Education	39
Grade 11	37
Grade 1	36
More ▼

Audience

Practitioners	653
Teachers	563
Researchers	250
Students	201
Administrators	81
Policymakers	22
Parents	17
Counselors	8
Community	7
Support Staff	3
Media Staff	1
More ▼

Location

Turkey	225
Canada	223
Australia	155
Germany	116
United States	99
China	90
Florida	86
Indonesia	82
Taiwan	78
United Kingdom	73
California	65
Japan	65
Netherlands	64
Iran	62
United Kingdom (England)	57
South Africa	48
Missouri	45
New York	45
Oklahoma	44
South Korea	44
Malaysia	42
Texas	42
Israel	37
Singapore	37
Sweden	37
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	4
Meets WWC Standards with or without Reservations	4
Does not meet standards	1

Showing 7,216 to 7,230 of 9,530 results Save | Export

A Psychometric Evaluation of 4-Point and 6-Point Likert-Type Scales in Relation to Reliability and Validity.

Peer reviewed

Chang, Lei – Applied Psychological Measurement, 1994

Reliability and validity of 4-point and 6-point scales were assessed using a new model-based approach to fit empirical data from 165 graduate students completing an attitude measure. Results suggest that the issue of four- versus six-point scales may depend on the empirical setting. (SLD)

Descriptors: Attitude Measures, Goodness of Fit, Graduate Students, Graduate Study

An Investigation of Lord's Procedure for the Detection of Differential Item Functioning.

Peer reviewed

Kim, Seock-Ho; And Others – Applied Psychological Measurement, 1994

Type I error rates of F. M. Lord's chi square test for differential item functioning were investigated using Monte Carlo simulations with marginal maximum likelihood estimation and marginal Bayesian estimation algorithms. Lord's chi square did not provide useful Type I error control for the three-parameter logistic model at these sample sizes.…

Descriptors: Algorithms, Bayesian Statistics, Chi Square, Error of Measurement

The Use of Grid Questions in Chemistry.

Wilson, Audrey – Journal of Science and Mathematics Education in Southeast Asia, 1992

Reports a study to compare the performance of Australian university students on chemistry questions delivered in two different formats: multiple choice and grid. Concluded that students encounter more difficulties with questions presented in the grid format and that grid questions demand a deeper understanding of the topic. (MDH)

Descriptors: Academic Achievement, Chemistry, Cognitive Processes, Foreign Countries

A Multidimensional Scaling Study of College Students' Perceptions of Test Item Formats.

Peer reviewed

Rocklin, Thomas – Applied Measurement in Education, 1992

College students rated dissimilarity of pairs of common test item formats. A multidimensional scaling model with individual differences fit to data from 111 students suggested that they used 2 dimensions to distinguish among the formats, 1 separating supply from selection items and 1 based on the number of options. (SLD)

Descriptors: Academic Ability, Academic Achievement, College Students, Higher Education

Using Global Student Rating Items for Summative Evaluation.

Peer reviewed

Cashin, William E.; Downey, Ronald G. – Journal of Educational Psychology, 1992

The usefulness of global items in predicting weighted-composite evaluations of teaching was evaluated with a sample of 17,183 classes from 105 institutions. Results suggest that, because global items account for a substantial amount of variance, a short evaluation form could capture much of the information needed for summative evaluation. (SLD)

Descriptors: College Students, Evaluation Methods, Higher Education, Predictive Measurement

Effects of Linking Methods on Detection of DIF.

Peer reviewed

Kim, Seock-Ho; Cohen, Allan S. – Journal of Educational Measurement, 1992

Effects of the following methods for linking metrics on detection of differential item functioning (DIF) were compared: (1) test characteristic curve method (TCC); (2) weighted mean and sigma method; and (3) minimum chi-square method. With large samples, results were essentially the same. With small samples, TCC was most accurate. (SLD)

Descriptors: Chi Square, Comparative Analysis, Equations (Mathematics), Estimation (Mathematics)

Using Tests to Teach.

Peer reviewed

Cameron, Beverly J. – College Teaching, 1991

When college teachers are explicit about specific methods and strategies involved in effective thinking, students are more likely to learn and use these skills. Labeling test questions with the thinking skills required can help students refocus their study methods, resulting in more effective thinking, problem-solving, or decision-making skills.…

Descriptors: Classroom Techniques, College Instruction, Decision Making, Higher Education

Are Tests Comprising Both Multiple-Choice and Free-Response Items Necessarily Less Unidimensional than Multiple-Choice Tests? An Analysis of Two Tests.

Peer reviewed

Thissen, David; And Others – Journal of Educational Measurement, 1994

Restricted factor analysis shows that the multiple-choice and free-response sections of the Computer Science and Chemistry Advanced Placement examinations (College Board) measure the same proficiencies for the most part. There is a small degree of multidimensionality because of local dependence among free-response items. (SLD)

Descriptors: Advanced Placement, Chemistry, Computer Science, Factor Analysis

A Comparison of Equal Percentile and Partial Credit Equatings for Performance-Based Assessments Composed of Free-Response Items.

Peer reviewed

Huynh, Huynh; Ferrara, Steven – Journal of Educational Measurement, 1994

Equal percentile (EP) and partial credit (PC) equatings for raw scores from performance-based assessments with free-response items are compared through the use of data from the Maryland School Performance Assessment Program. Results suggest that EP and PC methods do not give equivalent results when distributions are markedly skewed. (SLD)

Descriptors: Comparative Analysis, Equated Scores, Mathematics Tests, Performance Based Assessment

Procedures for Designing Course Evaluation Instruments: Masked Personality Format versus Transparent Achievement Format.

Peer reviewed

Carey, Lou M.; And Others – Educational and Psychological Measurement, 1994

Effects of randomly distributing attitude-measurement items throughout a questionnaire (personality format) versus grouping together items from the same dimension (achievement format) on students' end-of-course evaluations were studied for 376 undergraduates. Advantages demonstrated for the achievement format in terms of statistical results,…

Descriptors: Attitude Measures, Course Evaluation, Evaluation Methods, Higher Education

Convergent Examinations and the Divergent Student: Testing EFL at University Level.

Peer reviewed

Statman, Stella – System, 1992

Describes weaknesses of English-as-a-foreign-language (EFL) testing methods and argues that many EFL departments set up their own examinations to assess how effectively they are preparing students for those examinations. It is suggested that this leads to production of test items that are biased against divergent students and that an interview…

Descriptors: Departments, English (Second Language), English for Special Purposes, Higher Education

Bayesian Estimation of Normal Ogive Item Response Curves Using Gibbs Sampling.

Peer reviewed

Albert, James H. – Journal of Educational Statistics, 1992

Estimating item parameters from a two-parameter normal ogive model is considered using Gibbs sampling to simulate draws from the joint posterior distribution of ability and item parameters. The method gives marginal posterior density estimates for any parameter of interest, as illustrated using data from a 33-item mathematics placement…

Descriptors: Algorithms, Bayesian Statistics, Equations (Mathematics), Estimation (Mathematics)

A Comparison of the Performance of Simulated Hierarchical and Linear Testlets.

Peer reviewed

Wainer, Howard; And Others – Journal of Educational Measurement, 1992

Computer simulations were run to measure the relationship between testlet validity and factors of item pool size and testlet length for both adaptive and linearly constructed testlets. Making a testlet adaptive yields only modest increases in aggregate validity because of the peakedness of the typical proficiency distribution. (Author/SLD)

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Computer Simulation

Use of an Inclusive Option and the Optimal Number of Options for Multiple-Choice Items.

Peer reviewed

Crehan, Kevin D.; And Others – Educational and Psychological Measurement, 1993

Studies with 220 college students found that multiple-choice test items with 3 items are more difficult than those with 4 items, and items with the none-of-these option are more difficult than those without this option. Neither format manipulation affected item discrimination. Implications for test construction are discussed. (SLD)

Descriptors: College Students, Comparative Testing, Difficulty Level, Distractors (Tests)

The Partial Credit Model and Null Categories.

Peer reviewed

Wilson, Mark; Masters, Geoffery N. – Psychometrika, 1993

A strategy is described for dealing with measurement situations in which certain categories of responses are null, that is, persons do not respond in certain categories to certain items. The method is described for the partial credit model while maintaining the integrity of the original response framework. (SLD)

Descriptors: Equations (Mathematics), Estimation (Mathematics), Item Response Theory, Mathematical Models

« Previous Page | Next Page »

Pages: 1 | ... | 478 | 479 | 480 | 481 | 482 | 483 | 484 | 485 | 486 | ... | 636

Educational and Psychological…	416
Journal of Educational…	359
ProQuest LLC	246
Applied Psychological…	234
Applied Measurement in…	231
ETS Research Report Series	146
Educational Measurement:…	128
Journal of Educational and…	122
Online Submission	115
International Journal of…	105
Grantee Submission	98
Language Testing	93
Psychometrika	93
International Journal of…	79
Journal of Psychoeducational…	72
Educational Assessment	69
Measurement:…	57
Practical Assessment,…	56
Language Assessment Quarterly	55
Journal of Chemical Education	54
Behavioral Research and…	50
Journal of Experimental…	45
Physical Review Physics…	38
Journal of Experimental…	36
International Journal of…	35
More ▼

Journal Articles	5866
Reports - Research	5575
Reports - Evaluative	1556
Speeches/Meeting Papers	1168
Reports - Descriptive	796
Tests/Questionnaires	768
Guides - Classroom - Teacher	472
Guides - Non-Classroom	259
Dissertations/Theses -…	251
Numerical/Quantitative Data	185
Information Analyses	179
Opinion Papers	164
Guides - Classroom - Learner	162
Books	54
Collected Works - General	33
Multilingual/Bilingual…	32
Guides - General	31
Reports - General	21
Book/Product Reviews	20
ERIC Publications	20
Non-Print Media	16
ERIC Digests in Full Text	14
Collected Works - Proceedings	13
Reference Materials - General	13
Collected Works - Serials	12
More ▼

No Child Left Behind Act 2001	36
Individuals with Disabilities…	20
Every Student Succeeds Act…	5
Elementary and Secondary…	4
Race to the Top	4
Rehabilitation Act 1973…	4
Elementary and Secondary…	3
Head Start	3
Americans with Disabilities…	2
Comprehensive Education…	2
Higher Education Act…	2
Immigration Reform and…	2
Civil Rights Act 1964	1
Civil Rights Act 1964 Title…	1
Comprehensive Employment and…	1
Education Consolidation…	1
Education for All Handicapped…	1
Fair Labor Standards Act	1
Higher Education Act Title II	1
Higher Education Opportunity…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Jeanne Clery Disclosure of…	1
Job Training Partnership Act…	1
Kentucky Education Reform Act…	1
More ▼

National Assessment of…	182
Program for International…	178
SAT (College Admission Test)	137
Trends in International…	114
Test of English as a Foreign…	85
Graduate Record Examinations	74
ACT Assessment	44
Advanced Placement…	34
Texas Educational Assessment…	32
Law School Admission Test	30
Wechsler Intelligence Scale…	26
Iowa Tests of Basic Skills	25
Progress in International…	25
Stanford Achievement Tests	24
Raven Progressive Matrices	22
Armed Services Vocational…	20
International English…	20
Peabody Picture Vocabulary…	20
California Achievement Tests	18
Comprehensive Tests of Basic…	18
Test of English for…	17
Metropolitan Achievement Tests	15
General Educational…	14
Graduate Management Admission…	14
Wechsler Adult Intelligence…	13
More ▼