ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	5
Since 2007 (last 20 years)	8

Descriptor

Error of Measurement	29
Test Use	29
Scores	11
Test Interpretation	8
Scoring	7
Test Construction	7
Test Validity	7
Comparative Analysis	5
Elementary Secondary Education	5
Psychometrics	5
Test Reliability	5
Test Results	5
Achievement Tests	4
Generalizability Theory	4
Higher Education	4
Item Response Theory	4
Reliability	4
Test Format	4
Test Theory	4
Educational Testing	3
Language Tests	3
Mathematics Tests	3
Standards	3
Testing Problems	3
Testing Programs	3
More ▼

Source

Educational Measurement:…	2
Educational and Psychological…	2
Journal of Educational…	2
New York State Education…	2
Applied Measurement in…	1
Assessment	1
Assessment in Education…	1
ETS Research Report Series	1
Journal of Autism and…	1
Journal of Psychoeducational…	1
Multivariate Behavioral…	1
ProQuest LLC	1
More ▼

Publication Type

Journal Articles	13
Reports - Evaluative	9
Reports - Research	9
Speeches/Meeting Papers	8
Reports - Descriptive	5
Information Analyses	2
Numerical/Quantitative Data	2
Opinion Papers	2
Dissertations/Theses -…	1
ERIC Digests in Full Text	1
ERIC Publications	1
Guides - Non-Classroom	1
Tests/Questionnaires	1
More ▼

Education Level

Elementary Education	3
Early Childhood Education	2
Grade 3	2
Grade 4	2
Grade 5	2
Grade 6	2
Grade 7	2
Grade 8	2
Intermediate Grades	2
Junior High Schools	2
Middle Schools	2
Primary Education	2
Secondary Education	2
Higher Education	1
Postsecondary Education	1
More ▼

Audience

Researchers

Location

New York	2
Bahrain	1
Ireland	1
Saudi Arabia	1
United Kingdom (Northern…	1

Laws, Policies, & Programs

Assessments and Surveys

California Achievement Tests	1
General Aptitude Test Battery	1
New Jersey College Basic…	1
Pupil Control Ideology Form	1
Wisconsin Card Sorting Test	1
Work Keys (ACT)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 29 results Save | Export

The Social Shapes Test as a Self-Administered, Online Measure of Social Intelligence: Two Studies with Typically Developing Adults and Adults with Autism Spectrum Disorder

Peer reviewed

Direct link

Matt I. Brown; Patrick R. Heck; Christopher F. Chabris – Journal of Autism and Developmental Disorders, 2024

The Social Shapes Test (SST) is a measure of social intelligence which does not use human faces or rely on extensive verbal ability. The SST has shown promising validity among adults without autism spectrum disorder (ASD), but it is uncertain whether it is suitable for adults with ASD. We find measurement invariance between adults with (n = 229)…

Descriptors: Interpersonal Competence, Autism Spectrum Disorders, Emotional Intelligence, Verbal Ability

Confidence Intervals for Weighted Composite Scores under the Compound Binomial Error Model

Peer reviewed

Direct link

Kim, Kyung Yong; Lee, Won-Chan – Journal of Educational Measurement, 2018

Reporting confidence intervals with test scores helps test users make important decisions about examinees by providing information about the precision of test scores. Although a variety of estimation procedures based on the binomial error model are available for computing intervals for test scores, these procedures assume that items are randomly…

Descriptors: Weighted Scores, Error of Measurement, Test Use, Decision Making

A Modified "a"-Stratified Method for Computerized Adaptive Testing. Research Report. ETS RR-19-10

Peer reviewed
PDF on ERIC

Download full text

Gu, Lixiong; Ling, Guangming; Qu, Yanxuan – ETS Research Report Series, 2019

Research has found that the "a"-stratified item selection strategy (STR) for computerized adaptive tests (CATs) may lead to insufficient use of high a items at later stages of the tests and thus to reduced measurement precision. A refined approach, unequal item selection across strata (USTR), effectively improves test precision over the…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Use, Test Items

New York State Testing Program 2018: English Language Arts and Mathematics Grades 3-8. Technical Report

Download full text

New York State Education Department, 2018

This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 English Language Arts (ELA) and Mathematics 2018 Operational Tests. This report includes information about test content and test development, item (i.e., individual…

Descriptors: English, Language Arts, Language Tests, Mathematics Tests

New York State Testing Program 2017: English Language Arts and Mathematics Grades 3-8. Technical Report

Download full text

New York State Education Department, 2017

This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 English Language Arts (ELA) and Mathematics 2017 Operational Tests. This report includes information about test content and test development, item (i.e., individual…

Descriptors: English, Language Arts, Language Tests, Mathematics Tests

Multi-Population Invariance with Dichotomous Measures: Combining Multi-Group and MIMIC Methodologies in Evaluating the General Aptitude Test in the Arabic Language

Peer reviewed

Direct link

Sideridis, Georgios D.; Tsaousis, Ioannis; Al-harbi, Khaleel A. – Journal of Psychoeducational Assessment, 2015

The purpose of the present study was to extend the model of measurement invariance by simultaneously estimating invariance across multiple populations in the dichotomous instrument case using multi-group confirmatory factor analytic and multiple indicator multiple causes (MIMIC) methodologies. Using the Arabic version of the General Aptitude Test…

Descriptors: Semitic Languages, Aptitude Tests, Error of Measurement, Factor Analysis

Psychometric Properties of Raw and Scale Scores on Mixed-Format Tests

Peer reviewed

Direct link

Kolen, Michael J.; Lee, Won-Chan – Educational Measurement: Issues and Practice, 2011

This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…

Descriptors: Test Use, Test Format, Error of Measurement, Raw Scores

Investigating the Justifiability of an Additional Test Use: An Application of Assessment Use Argument to an English as a Foreign Language Test

Direct link

Wang, Huan – ProQuest LLC, 2010

Multiple uses of the same assessment may present challenges for both the design and use of an assessment. Little advice, however, has been given to assessment developers as to how to understand the phenomena of multiple assessment use and meet the challenges these present. Particularly problematic is the case in which an assessment is used for…

Descriptors: Test Use, Testing Programs, Program Effectiveness, Test Construction

Tests for the Jackknife Autocorrelation Estimator "r(Q2)."

Peer reviewed

Huitema, Bradley E.; McKean, Joseph W. – Educational and Psychological Measurement, 1996

Two tests for the jackknife autocorrelation estimator r(Q2) are evaluated. It is shown that a test based on the conventional approach for estimating the standard error of a jackknife estimator leads to unacceptable Type I error. An alternative approach is proposed that leads to a more satisfactory test when n>20. (Author/SLD)

Descriptors: Error of Measurement, Estimation (Mathematics), Test Use

Classical Test Theory in Historical Perspective.

Peer reviewed

Traub, Ross E. – Educational Measurement: Issues and Practice, 1997

Classical test theory is founded on the proposition that measurement error, a random latent variable, is a component of the observed score random variable. This article traces the history of the development of classical test theory, beginning in the early 20th century. (SLD)

Descriptors: Educational History, Educational Testing, Error of Measurement, Psychometrics

Estimating the Consistency and Accuracy of Classifications Based on Test Scores.

Download full text

Livingston, Samuel A.; Lewis, Charles – 1993

This paper presents a method for estimating the accuracy and consistency of classifications based on test scores. The scores can be produced by any scoring method, including the formation of a weighted composite. The estimates use data from a single form. The reliability of the score is used to estimate its effective test length in terms of…

Descriptors: Classification, Error of Measurement, Estimation (Mathematics), Reliability

Five Common Misuses of Tests. ERIC Digest No. 108.

Download full text

Gardner, Eric – 1989

Five of the common misuses of tests are reviewed: (1) acceptance of the test title as an accurate and complete description of the variable being measured (failure to examine the manual and the items carefully to know the specific aspects to be tested can result in misuse through selection of an inappropriate test for a particular purpose or…

Descriptors: Error of Measurement, Evaluation Problems, Examiners, Scoring

Test-Retest Stability of the Wisconsin Card Sorting Test.

Peer reviewed

Paolo, Anthony M.; Axelrod, Bradley N.; Troster, Alexander I. – Assessment, 1996

Eighty-seven normal older people were administered the Wisconsin Sorting Test on two occasions averaging over a year apart. There were average retest gains of 5 to 7 standard score points. The standard error of prediction, standard error of difference, and abnormal test-retest discrepancy scores were calculated for clinical use. (SLD)

Descriptors: Clinical Diagnosis, Diagnostic Tests, Error of Measurement, Older Adults

The Precision of Measurements.

Peer reviewed

Kane, Michael – Applied Measurement in Education, 1996

This overview of the role of error and tolerance for error in measurement asserts that the generic precision associated with a measurement procedure is defined as the root mean square error, or standard error, in some relevant population. This view of precision is explored in several applications of measurement. (SLD)

Descriptors: Error of Measurement, Error Patterns, Generalizability Theory, Measurement Techniques

Standard Errors of Measurement at Different Ability Levels.

Peer reviewed

Lord, Frederic M. – Journal of Educational Measurement, 1984

Four methods are outlined for estimating or approximating from a single test administration the standard error of measurement of number-right test score at specified ability levels or cutting scores. The methods are illustrated and compared on one set of real test data. (Author)

Descriptors: Academic Ability, Cutting Scores, Error of Measurement, Scoring Formulas

Previous Page | Next Page »

Pages: 1 | 2

Lee, Won-Chan	2
Al-harbi, Khaleel A.	1
Algina, James	1
Axelrod, Bradley N.	1
Bielinski, John	1
Brennan, Robert L.	1
Christopher F. Chabris	1
Coffman, William E.	1
Cowan, Pamela	1
Espelage, Dorothy L.	1
Gaffney, Patrick V.	1
Gardner, Eric	1
Gardner, Eric F.	1
Gardner, John	1
Griffin, Marlynn M.	1
Gu, Lixiong	1
Haertel, Edward H.	1
Huitema, Bradley E.	1
Johanson, George A.	1
Jones, Eric D.	1
Kamps, Jodi	1
Kane, Michael	1
Kim, Kyung Yong	1
Kolen, Michael J.	1
More ▼