ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	3
Since 2007 (last 20 years)	4

Descriptor

Scoring	51
Test Reliability	51
Testing Problems	51
Test Validity	22
Test Construction	14
Testing	13
Test Interpretation	12
Higher Education	9
Scores	9
Standardized Tests	9
Multiple Choice Tests	8
Test Bias	8
Achievement Tests	7
Elementary Secondary Education	7
Item Analysis	7
Measurement Techniques	7
Writing Evaluation	7
Equated Scores	6
Evaluation Methods	6
Interrater Reliability	6
Computer Assisted Testing	5
Error of Measurement	5
Student Evaluation	5
Testing Programs	5
Educational Assessment	4
More ▼

Source

Psychology in the Schools	3
College Teaching	2
Canadian Journal of School…	1
Educational Policy Analysis…	1
Educational and Psychological…	1
Evaluation Quarterly	1
Evaluation and the Health…	1
J Educ Meas	1
Journal of College Student…	1
Journal of Computer Assisted…	1
Journal of Experimental…	1
Physical Review Physics…	1
More ▼

Publication Type

Reports - Research	23
Speeches/Meeting Papers	13
Journal Articles	12
Reports - Evaluative	8
Guides - Non-Classroom	7
Opinion Papers	6
Books	2
Tests/Questionnaires	2
Collected Works - General	1
Guides - General	1
Reports - Descriptive	1
More ▼

Education Level

Elementary Secondary Education	1
Higher Education	1

Audience

Practitioners	4
Researchers	3
Teachers	3
Parents	1

Location

Brazil	1
United Kingdom (Scotland)	1

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…	2
National Assessment of…	2
Adaptive Behavior Scale	1
Alabama High School…	1
Armed Services Vocational…	1
McCarthy Scales of Childrens…	1
Michigan Test of English…	1
Slosson Intelligence Test	1
Torrance Tests of Creative…	1
Wechsler Intelligence Scale…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 51 results Save | Export

Digital-First Assessments: A Security Framework

Peer reviewed

Direct link

LaFlair, Geoffrey T.; Langenfeld, Thomas; Baig, Basim; Horie, André Kenji; Attali, Yigal; von Davier, Alina A. – Journal of Computer Assisted Learning, 2022

Background: Digital-first assessments leverage the affordances of technology in all elements of the assessment process--from design and development to score reporting and evaluation to create test taker-centric assessments. Objectives: The goal of this paper is to describe the engineering, machine learning, and psychometric processes and…

Descriptors: Computer Assisted Testing, Affordances, Scoring, Engineering

Reconsidering the Assessment Policy: Practical Use of Liberal Multiple-Choice Tests (SAC Method)

Peer reviewed
PDF on ERIC

Download full text

Cesur, Kursat – Educational Policy Analysis and Strategic Research, 2019

Examinees' performances are assessed using a wide variety of different techniques. Multiple-choice (MC) tests are among the most frequently used ones. Nearly, all standardized achievement tests make use of MC test items and there is a variety of ways to score these tests. The study compares number right and liberal scoring (SAC) methods. Mixed…

Descriptors: Multiple Choice Tests, Scoring, Evaluation Methods, Guessing (Tests)

Multilevel Rasch Modeling of Two-Tier Multiple Choice Test: A Case Study Using Lawson's Classroom Test of Scientific Reasoning

Peer reviewed

Direct link

Xiao, Yang; Han, Jing; Koenig, Kathleen; Xiong, Jianwen; Bao, Lei – Physical Review Physics Education Research, 2018

Assessment instruments composed of two-tier multiple choice (TTMC) items are widely used in science education as an effective method to evaluate students' sophisticated understanding. In practice, however, there are often concerns regarding the common scoring methods of TTMC items, which include pair scoring and individual scoring schemes. The…

Descriptors: Hierarchical Linear Modeling, Item Response Theory, Multiple Choice Tests, Case Studies

Administration and Scoring Errors of Graduate Students Learning the WISC-IV: Issues and Controversies

Peer reviewed

Direct link

Mrazik, Martin; Janzen, Troy M.; Dombrowski, Stefan C.; Barford, Sean W.; Krawchuk, Lindsey L. – Canadian Journal of School Psychology, 2012

A total of 19 graduate students enrolled in a graduate course conducted 6 consecutive administrations of the Wechsler Intelligence Scale for Children, 4th edition (WISC-IV, Canadian version). Test protocols were examined to obtain data describing the frequency of examiner errors, including administration and scoring errors. Results identified 511…

Descriptors: Intelligence Tests, Intelligence, Statistical Analysis, Scoring

Essential Dimensions of Performance Tests. Professional Paper 1-77.

Download full text

Osborn, William C. – 1977

Four essential dimensions of a performance test are detailed: directness of test method, type of criterion, standardization of conditions, and objectivity of scoring. For simplicity these factors are described as if each were dichotomous, when in actuality each is a continuum; a test method may be more or less direct, conditions more or less…

Descriptors: Performance Tests, Scoring, Test Reliability, Test Validity

Problems in Measuring Formal Operations.

Alliger, R. J.; Harvey, A. L. – 1984

This article discusses practical and theoretical problems related to the measurement of formal operations. The first section of the article discusses problems in measuring formal operations using the clinical interview method. These problems include the lack of both a standardized interview and a uniform scoring procedure. Section two discusses…

Descriptors: Developmental Stages, Group Testing, Interviews, Objective Tests

Problems in Stabilizing the Judgment Process.

Download full text

Quellmalz, Edys – 1980

Measurement problems which jeopardize the reliability and validity of competency-based writing assessments are analyzed. Methods to stabilize rating criteria and readers' application of them are necessary. Most writing assessment programs use guidelines from norm-referenced test methodology. Use of this method of criteria application based on…

Descriptors: Measurement Techniques, Scoring, Test Reliability, Testing Problems

Test Information. Using the Essay as an Assessment Technique. Set 77. Number One. Item 13.

Cowie, Colin – 1977

Certain testing procedures will overcome some of the problems associated with the use of essay tests. Essay tests may not validly indicate achievement because the questions included in the test may not fairly represent instructional content. Reliability may be a problem because of variations in examinee response in different situations, in test…

Descriptors: Achievement Tests, Essay Tests, Guides, Scoring

Matched-Pair Scoring Technique Used on a First-Grade Yes-No Type Economics Achievement Test.

Download full text

Larkins, A. Guy; Shaver, James P. – 1967

There exist special problems in testing first-grade children. Orally administered yes-no tests reduce the problems found in the other types, but they have their own drawbacks. A solution to some of these drawbacks is the use of the matched-pair scoring technique. For each "yes" item on the test there is included a "reversed" or…

Descriptors: Achievement Tests, Economics, Grade 1, Primary Education

The Effects of Repeaters on Test Equating.

Peer reviewed

Andrulis, Richard S.; And Others – Educational and Psychological Measurement, 1978

The effects of repeaters (testees included in both administrations of two forms of a test) on the test equating process are examined. It is shown that repeaters do effect test equating and tend to lower the cutoff point for passing the test. (JKS)

Descriptors: Cutting Scores, Equated Scores, Item Analysis, Scoring

The Visual Motor Integration Test: High Interjudge Reliability, High Potential For Diagnostic Error.

Peer reviewed

Snyder, Peggy P.; And Others – Psychology in the Schools, 1981

Investigated scoring agreement among three different training levels of Visual Motor Integration Test (VMI) diagnosticians. Correlational data demonstrated high interexaminer reliabilities; however, there were gross errors in precision after raw scores had been converted into VMI age equivalent scores. (Author/RC)

Descriptors: Educational Diagnosis, Evaluation Methods, Grade Equivalent Scores, Motor Development

Problems in Scoring, Agreement among Raters, and Internal Consistency of Selected Marker Tests.

Peer reviewed

Rusch, Reuben; Steiner, Judith – Journal of Experimental Education, 1979

The Selected Marker Tests were examined for scoring problems and internal consistency and were administered orally to sixth and seventh graders. Scoring problems were discovered and changes were suggested. The problem was found to be item reliability rather than interrater reliability. (Author/MH)

Descriptors: Cognitive Tests, Elementary Education, Item Analysis, Problem Solving

A Practitioner's Guide to Functional Level Testing.

Haenn, Joseph F. – 1981

Procedures for conducting functional level testing have been available for use by practitioners for some time. However, the Title I Evaluation and Reporting System (TIERS), developed in response to the educational amendments of 1974 to the Elementary and Secondary Education Act (ESEA), has provided the impetus for widespread adoption of this…

Descriptors: Achievement Tests, Difficulty Level, Scores, Scoring

The Effects of Repeaters on Test Equating.

Download full text

Andrulis, Richard S.; And Others – 1974

The purpose of this investigation was to establish the effects of repeaters on test equating. Since consideration was not given to repeaters in test equating, such as in the derivation of equations by Angoff (1971), the hypothetical effect needed to be established. A case study was examined which showed results on a test as expected; overall mean…

Descriptors: Cutting Scores, Equated Scores, Recall (Psychology), Retention (Psychology)

An Evaluation of the Ethical Judgment Scale.

Peer reviewed

Doromal, Quintin S., Jr.; Creamer, Don G. – Journal of College Student Development, 1988

Investigated certain measurement properties of the Ethical Judgment Scale. Results revealed findings of questionable validity and unacceptably low reliability for the instrument even though three different scoring methods were used in the analysis. (Author)

Descriptors: Counseling, Data Analysis, Decision Making, Ethics

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Andrulis, Richard S.	2
Johnson, Eugene G.	2
Alliger, R. J.	1
Arnold, Voiza	1
Attali, Yigal	1
Baig, Basim	1
Bao, Lei	1
Barford, Sean W.	1
Bessa, Nicia M.	1
Bohning, Gerry	1
Brown, Frederick G.	1
Burns, Marilyn	1
Cesur, Kursat	1
Chase, Clinton I.	1
Cowie, Colin	1
Creamer, Don G.	1
DeGeorge, George P.	1
Dombrowski, Stefan C.	1
Doromal, Quintin S., Jr.	1
Doyle, Vincent	1
Ebel, Robert L.	1
Gilmer, Jerry S.	1
Givens, Thelma	1
Haenn, Joseph F.	1
More ▼