ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	6

Descriptor

Computation	10
Test Validity	10
Student Evaluation	6
Test Construction	5
Test Reliability	5
Test Items	4
Evaluation Methods	3
Goodness of Fit	3
Mathematics Achievement	3
Psychometrics	3
Scoring	3
Academic Standards	2
Cutting Scores	2
English	2
Error of Measurement	2
Interrater Reliability	2
Item Response Theory	2
Mathematical Applications	2
Mathematical Concepts	2
Mathematics Tests	2
Measures (Individuals)	2
Models	2
Public Education	2
Raw Scores	2
Reading Achievement	2
More ▼

Source

Measurement and Evaluation in…	2
New Mexico Public Education…	2
Advances in Physiology…	1
Assessment for Effective…	1
Behavioral Research and…	1
Chemistry Education Research…	1
Diagnostique	1
Measurement & Evaluation in…	1

Publication Type

Reports - Descriptive	10
Journal Articles	7
Numerical/Quantitative Data	3

Education Level

Elementary Secondary Education	3
Grade 4	2
Higher Education	2
Elementary Education	1
Grade 3	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Middle Schools	1

Audience

Researchers

Location

New Mexico

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 10 results Save | Export

Evidence Regarding the Internal Structure: Confirmatory Factor Analysis

Peer reviewed

Direct link

Lewis, Todd F. – Measurement and Evaluation in Counseling and Development, 2017

American Educational Research Association (AERA) standards stipulate that researchers show evidence of the internal structure of instruments. Confirmatory factor analysis (CFA) is one structural equation modeling procedure designed to assess construct validity of assessments that has broad applicability for counselors interested in instrument…

Descriptors: Educational Research, Factor Analysis, Structural Equation Models, Construct Validity

Recalculation of the Critical Values for Lawshe's Content Validity Ratio

Peer reviewed

Direct link

Wilson, F. Robert; Pan, Wei; Schumsky, Donald A. – Measurement and Evaluation in Counseling and Development, 2012

The content validity ratio (Lawshe) is one of the earliest and most widely used methods for quantifying content validity. To correct and expand the table, critical values in unit steps and at multiple alpha levels were computed. Implications for content validation are discussed. (Contains 2 tables and 1 figure.)

Descriptors: Content Validity, Computation, Test Validity

Making Do with What We Have: Use Your Bootstraps

Peer reviewed

Direct link

Calmettes, Guillaume; Drummond, Gordon B.; Vowler, Sarah L. – Advances in Physiology Education, 2012

A jack knife is a pocket knife that is put to many tasks, because it's ready to hand. Often there could be a better tool for the job, such as a screwdriver, a scraper, or a can-opener, but these are not usually pocket items. In statistical terms, the expression implies making do with what's available. Another simile, of an extreme situation, is…

Descriptors: Statistical Analysis, Computation, Population Distribution, Evaluation Methods

A Chemistry Concept Reasoning Test

Peer reviewed

Direct link

Cloonan, Carrie A.; Hutchinson, John S. – Chemistry Education Research and Practice, 2011

A Chemistry Concept Reasoning Test was created and validated providing an easy-to-use tool for measuring conceptual understanding and critical scientific thinking of general chemistry models and theories. The test is designed to measure concept understanding comparable to that found in free-response questions requiring explanations over…

Descriptors: Test Validity, Chemistry, Correlation, Multiple Choice Tests

Instrument Development Procedures for Mathematics Measures. Technical Report Number 08-02

Download full text

Jung, Eunju; Liu, Kimy; Ketterlin-Geller, Leanne R.; Tindal, Gerald – Behavioral Research and Teaching, 2008

The purpose of this study was to develop general outcome measures (GOM) in mathematics so that teachers could focus their instruction on needed prerequisite skills. We describe in detail, the manner in which content-related evidence was established and then present a number of statistical analyses conducted to evaluate the technical adequacy of…

Descriptors: Item Analysis, Test Construction, Test Theory, Mathematics Tests

Estimating the Reliability of a Test Battery Composite or a Test Score Based on Weighted Item Scoring

Peer reviewed

Feldt, Leonard S. – Measurement & Evaluation in Counseling & Development, 2004

In some settings, the validity of a battery composite or a test score is enhanced by weighting some parts or items more heavily than others in the total score. This article describes methods of estimating the total score reliability coefficient when differential weights are used with items or parts.

Descriptors: Test Items, Scoring, Cognitive Processes, Test Validity

Curriculum-Based Measurement of Mathematics Competence: From Computation to Concepts and Applications to Real-Life Problem Solving

Peer reviewed

Direct link

Fuchs, Lynn S.; Fuchs, Douglas; Courey, Susan J. – Assessment for Effective Intervention, 2005

In this article, the authors explain how curriculum-based measurement (CBM) differs from other forms of classroom-based assessment. The development of CBM is traced from computation to concepts and applications to real-life problem solving, with examples of the assessments and illustrations of research to document technical features and utility…

Descriptors: Curriculum Based Assessment, Mathematics Skills, Case Studies, Computation

KeyMath--Revised (KMR).

Bachor, Dan G. – Diagnostique, 1990

KeyMath Revised was devised as a power test for use with students from kindergarten through grade 9. The test is divided into three dimensions: basic concepts, operations, and applications. This paper describes the test's administration, summation of data, standardization, reliability, and validity. (JDD)

Descriptors: Achievement Tests, Computation, Elementary Secondary Education, Mathematical Applications

New Mexico Standards-Based Assessment Technical Report: Spring 2007 Administration

Download full text

New Mexico Public Education Department, 2007

The purpose of the NMSBA technical report is to provide users and other interested parties with a general overview of and technical characteristics of the 2007 NMSBA. The 2007 technical report contains the following information: (1) Test development; (2) Scoring procedures; (3) Summary of student performance; (4) Statistical analyses of item and…

Descriptors: Interrater Reliability, Standard Setting, Measures (Individuals), Scoring

New Mexico Standards Based Assessment (NMSBA) Technical Report: 2006 Spring Administration

Download full text

Griph, Gerald W. – New Mexico Public Education Department, 2006

The purpose of the NMSBA technical report is to provide users and other interested parties with a general overview of and technical characteristics of the 2006 NMSBA. The 2006 technical report contains the following information: (1) Test development; (2) Scoring procedures; (3) Calibration, scaling, and equating procedures; (4) Standard setting;…

Descriptors: Interrater Reliability, Standard Setting, Measures (Individuals), Scoring

Bachor, Dan G.	1
Calmettes, Guillaume	1
Cloonan, Carrie A.	1
Courey, Susan J.	1
Drummond, Gordon B.	1
Feldt, Leonard S.	1
Fuchs, Douglas	1
Fuchs, Lynn S.	1
Griph, Gerald W.	1
Hutchinson, John S.	1
Jung, Eunju	1
Ketterlin-Geller, Leanne R.	1
Lewis, Todd F.	1
Liu, Kimy	1
Pan, Wei	1
Schumsky, Donald A.	1
Tindal, Gerald	1
Vowler, Sarah L.	1
Wilson, F. Robert	1
More ▼