ERIC - Search Results

Publication Date

In 2026	0
Since 2025	52
Since 2022 (last 5 years)	194
Since 2017 (last 10 years)	494
Since 2007 (last 20 years)	742

Descriptor

Test Items	1186
Test Reliability	1186
Test Validity	684
Test Construction	565
Foreign Countries	348
Difficulty Level	279
Item Analysis	252
Psychometrics	233
Item Response Theory	219
Factor Analysis	183
Multiple Choice Tests	172
Scores	171
Higher Education	126
Correlation	121
Test Format	119
Scoring	114
Statistical Analysis	108
Measures (Individuals)	101
Goodness of Fit	94
College Students	92
Undergraduate Students	91
Achievement Tests	90
Factor Structure	90
Comparative Analysis	89
Test Bias	86
More ▼

Education Level

Higher Education	247
Postsecondary Education	202
Secondary Education	149
Elementary Education	122
High Schools	68
Middle Schools	68
Junior High Schools	51
Early Childhood Education	38
Elementary Secondary Education	36
Intermediate Grades	30
Primary Education	29
Grade 8	22
Grade 7	20
Grade 6	17
Kindergarten	17
Grade 5	16
Grade 2	15
Grade 1	13
Grade 3	13
Grade 4	13
Grade 9	9
Adult Education	7
Preschool Education	6
Grade 12	4
Grade 10	3
More ▼

Audience

Practitioners	39
Researchers	30
Teachers	24
Administrators	13
Support Staff	3
Counselors	2
Students	2
Community	1
Parents	1
Policymakers	1

Location

Turkey	68
Indonesia	37
Germany	20
Canada	17
Florida	17
China	16
Australia	15
California	12
Iran	11
India	10
New York	9
United States	9
Malaysia	8
Nigeria	8
Taiwan	8
Nebraska	7
Netherlands	7
Georgia	6
Illinois	6
Mexico	6
Saudi Arabia	6
South Korea	6
Thailand	6
Turkey (Ankara)	6
Turkey (Istanbul)	6
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	4
Every Student Succeeds Act…	3
No Child Left Behind Act 2001	3
Rehabilitation Act 1973…	3
Elementary and Secondary…	1
Head Start	1
Job Training Partnership Act…	1
United Nations Convention on…	1

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Test Items X

Showing 976 to 990 of 1,186 results Save | Export

OCOD-CTTP Test Evaluation Report.

Download full text

Shorey, Leonard – 1991

Tests in social studies and integrated science given in Saint Vincent, Saint Lucia, Grenada, and Dominica were analyzed by the Organization for Co-operation in Overseas Development (OCOD) Comprehensive Teacher Training Program (CTTP) for discrimination, difficulty, and reliability, as well as other characteristics. There were 767 examinees for the…

Descriptors: Difficulty Level, Elementary Secondary Education, Evaluation Methods, Foreign Countries

An Investigation of the Relationship between Reliability, Power, and the Type I Error Rate of the Mantel-Haenszel and Simultaneous Item Bias Detection Procedures.

Download full text

Ackerman, Terry A.; Evans, John A. – 1992

The relationship between levels of reliability and the power of two bias and differential item functioning (DIF) detection methods is examined. Both methods, the Mantel-Haenszel (MH) procedure of P. W. Holland and D. T. Thayer (1988) and the Simultaneous Item Bias (SIB) procedure of R. Shealy and W. Stout (1991), use examinees' raw scores as a…

Descriptors: Comparative Analysis, Equations (Mathematics), Error of Measurement, Item Bias

The Preliminary Chinese Proficiency Test (Pre-CPT): Development, Scaling, and Equating to the Chinese Proficiency Test (CPT). Technical Report 1.

Download full text

Stansfield, Charles W.; And Others – 1992

This report describes the development, construction, and validation of the Preliminary Chinese Proficiency Test (Pre-CPT), a standardized, nationally-normed test of listening and reading comprehension for beginning-level native English-speaking learners of Chinese as a second language. The Pre-CPT was designed as a lower-level version of the…

Descriptors: Chinese, Higher Education, Language Proficiency, Language Tests

Item Calibration Considerations: A Comparison of Item Calibrations on Written and Computerized Adaptive Examinations.

Download full text

Stone, Gregory Ethan; Lunz, Mary E. – 1994

This paper explores the comparability of item calibrations for three types of items: (1) text only; (2) text with photographs; and (3) text plus graphics when items are presented on written tests and computerized adaptive tests. Data are from five different medical technology certification examinations administered nationwide in 1993. The Rasch…

Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Diagrams

The Development of an Emotions Scale for Writers.

Download full text

Powell, Jack L.; Brand, Alice G. – 1986

The Brand Emotions Scale for Writers (BSEW) is a 20-item scale designed to measure the emotions of writers; (1) immediately before writing (state-before), (2) immediately after writing (state-after), and (3) when writing in general (trait). This paper describes the development of BSEW and the factor structure of these three different forms. Common…

Descriptors: Affective Measures, Authors, Behavior Rating Scales, Factor Structure

The Development of the Economics Values Inventory. Report to the Foundation for Teaching Economics.

Download full text

O'Brien, Mary Utne; Ingels, Steven J. – 1984

Intended to provide information about the development of the Economics Values Inventory (EVI), the report describes considerations that directed development of test items and provides indicators of the reliability and validity of the proposed test instrument. The EVI is recommended as an effective measuring instrument in experimental evaluations…

Descriptors: Affective Measures, Attitude Change, Attitude Measures, Economics

Practical Procedures for Constructing Mastery Tests to Minimize Errors of Classification and to Maximize or Optimize Decision Reliability.

Byars, Alvin Gregg – 1980

The objectives of this investigation are to develop, describe, assess, and demonstrate procedures for constructing mastery tests to minimize errors of classification and to maximize decision reliability. The guidelines are based on conditions where item exchangeability is a reasonable assumption and the test constructor can control the number of…

Descriptors: Cutting Scores, Difficulty Level, Grade 4, Intermediate Grades

Construction and Use of Criterion-Referenced Tests in Program Evaluation Studies. Laboratory of Psychometric and Evaluation Research Report No. 102.

Download full text

Gifford, Janice A.; Hambleton, Ronald K. – 1980

Technical considerations associated with item selection and reliability assessment are considered in relation to criterion-referenced tests constructed to provide group information. The purpose is to emphasize test building and the evaluation of test scores in program evaluation studies. It is stressed that an evaluator employ a performance or…

Descriptors: Criterion Referenced Tests, Group Testing, Item Sampling, Models

Applications of Adaptive Testing in Measuring Achievement and Performance.

Download full text

Bejar, Issac I. – 1976

The concept of testing for partial knowledge is considered with the concept of tailored testing. Following the special usage of latent trait theory, the word valdity is used to mean the correlation of a test with the construct the test measures. The concept of a method factor in the test is also considered as a part of the validity. The possible…

Descriptors: Achievement Tests, Adaptive Testing, Computer Assisted Testing, Confidence Testing

Q. How Many Options Should a Multiple-Choice Question Have? (a) 2. (b) 3. (c) 4. At-a-glance Research Report.

Catts, Ralph – 1978

The reliability of multiple choice tests--containing different numbers of response options--was investigated for 260 students enrolled in technical college economics courses. Four test forms, constructed from previously used four-option items, were administered, consisting of (1) 60 two-option items--two distractors randomly discarded; (2) 40…

Descriptors: Answer Sheets, Difficulty Level, Foreign Countries, Higher Education

Algorithms for Developing Test Questions from Sentences in Instructional Materials. Interim Report, January-September 1977.

Download full text

Roid, Gale; Finn, Patrick – 1978

The feasibility of generating multiple-choice test questions by transforming sentences from prose instructional materials was examined. A computer-based algorithm was used to analyze prose subject matter and to identify high-information words. Sentences containing selected words were then transformed into multiple-choice items by four writers who…

Descriptors: Algorithms, Criterion Referenced Tests, Difficulty Level, Form Classes (Languages)

A Comparison of Three Types of Item Analysis in Test Development Using Classical and Latent Trait Methods.

Benson, Jeri; And Others – 1978

The precision and efficiency of a cognitive test constructed by three different methods of item analysis was compared, using the verbal aptitude subtest of the Florida Twelfth Grade Test. Classical item analysis, factor analysis and the Rasch logistic model were used in the construction of 15 and 30 item subtests and replicated for samples of 250,…

Descriptors: Cognitive Tests, Comparative Analysis, Efficiency, Factor Analysis

Setting, Evaluating, and Maintaining Certification Standards with the Rasch Model.

Peer reviewed

Grosse, Martin E.; Wright, Benjamin D. – Evaluation and the Health Professions, 1986

Based on the standard setting procedures or the American Board of Preventive Medicine for their Core Test, this article describes how Rasch measurement can facilitate using test content judgments in setting a standard. Rasch measurement can then be used to evaluate and improve the precision of the standard and to hold it constant across time.…

Descriptors: Certification, Criterion Referenced Tests, Difficulty Level, Health Personnel

A Discussion of the Expressive One-Word Picture Vocabulary Test.

Peer reviewed

Altepeter, Tom – School Psychology Review, 1983

A critical review of the Expressive One-Word Picture Vocabulary Test (Gardner) is offered. The reviewer feels that the instrument cannot be recommended in its present form. Further research concerning the manual, and theoretical issues, (particularly test-retest stability) is strongly recommended. (Author/PN)

Descriptors: Error of Measurement, Intelligence Tests, Item Analysis, Pictorial Stimuli

Assessment and Feedback in Science Education.

Peer reviewed

Black, Paul – Studies in Educational Evaluation, 1995

The role of assessment in science education is explored, focusing on summative assessment in British public certificate examinations. Examples of test items are presented to illustrate difficulties in making valid and reliable assessments, and issues with implications for formative assessment are discussed. (SLD)

Descriptors: Educational Assessment, Feedback, Foreign Countries, Formative Evaluation

« Previous Page | Next Page »

Pages: 1 | ... | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | ... | 80

Educational and Psychological…	54
Journal of Psychoeducational…	33
Online Submission	31
Journal of Educational…	26
ProQuest LLC	24
Grantee Submission	18
ETS Research Report Series	17
International Journal of…	17
SAGE Open	14
Applied Psychological…	13
Applied Measurement in…	11
International Journal of…	10
Language Testing	10
Physical Review Physics…	10
Education and Information…	9
International Journal of…	9
Journal of Experimental…	9
Chemistry Education Research…	8
Language Assessment Quarterly	8
Educational Sciences: Theory…	7
Journal of Baltic Science…	7
Measurement and Evaluation in…	7
Practical Assessment,…	7
Psychometrika	7
Eurasian Journal of…	6
More ▼

Schoen, Robert C.	12
LaVenia, Mark	5
Liu, Ou Lydia	5
Anderson, Daniel	4
Bauduin, Charity	4
DiLuzio, Geneva J.	4
Farina, Kristy	4
Haladyna, Thomas M.	4
Huck, Schuyler W.	4
Petscher, Yaacov	4
Stansfield, Charles W.	4
Trevisan, Michael S.	4
Wainer, Howard	4
Yang, Xiaotong	4
Aiken, Lewis R.	3
Alonzo, Julie	3
Baghaei, Purya	3
Benson, Jeri	3
Boone, William J.	3
Brennan, Robert L.	3
Burton, Richard F.	3
Dogan, Nuri	3
Downing, Steven M.	3
Edwards, Michael C.	3
More ▼

Reports - Research	856
Journal Articles	812
Reports - Evaluative	144
Speeches/Meeting Papers	126
Tests/Questionnaires	106
Reports - Descriptive	62
Guides - Non-Classroom	29
Dissertations/Theses -…	24
Numerical/Quantitative Data	24
Information Analyses	21
Opinion Papers	19
Guides - Classroom - Teacher	8
Books	5
Guides - General	5
Book/Product Reviews	2
Collected Works - General	2
ERIC Digests in Full Text	2
ERIC Publications	2
Guides - Classroom - Learner	2
Reference Materials -…	2
Collected Works - Serials	1
Computer Programs	1
Multilingual/Bilingual…	1
Non-Print Media	1
Reference Materials -…	1
More ▼

SAT (College Admission Test)	10
Test of English as a Foreign…	10
ACT Assessment	6
Graduate Record Examinations	5
Trends in International…	5
Wechsler Intelligence Scale…	5
Raven Progressive Matrices	4
Test of English for…	4
Comprehensive Tests of Basic…	3
Dynamic Indicators of Basic…	3
Marlowe Crowne Social…	3
Measures of Academic Progress	3
Peabody Picture Vocabulary…	3
Stanford Achievement Tests	3
Advanced Placement…	2
Armed Services Vocational…	2
Autism Diagnostic Observation…	2
Child Behavior Checklist	2
Flesch Kincaid Grade Level…	2
Graduate Management Admission…	2
International English…	2
Iowa Tests of Basic Skills	2
National Assessment of…	2
Program for International…	2
Progress in International…	2
More ▼