Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 1 |
| Since 2007 (last 20 years) | 6 |
Descriptor
| Computation | 10 |
| Test Validity | 10 |
| Student Evaluation | 6 |
| Test Construction | 5 |
| Test Reliability | 5 |
| Test Items | 4 |
| Evaluation Methods | 3 |
| Goodness of Fit | 3 |
| Mathematics Achievement | 3 |
| Psychometrics | 3 |
| Scoring | 3 |
| More ▼ | |
Source
Author
| Bachor, Dan G. | 1 |
| Calmettes, Guillaume | 1 |
| Cloonan, Carrie A. | 1 |
| Courey, Susan J. | 1 |
| Drummond, Gordon B. | 1 |
| Feldt, Leonard S. | 1 |
| Fuchs, Douglas | 1 |
| Fuchs, Lynn S. | 1 |
| Griph, Gerald W. | 1 |
| Hutchinson, John S. | 1 |
| Jung, Eunju | 1 |
| More ▼ | |
Publication Type
| Reports - Descriptive | 10 |
| Journal Articles | 7 |
| Numerical/Quantitative Data | 3 |
Education Level
| Elementary Secondary Education | 3 |
| Grade 4 | 2 |
| Higher Education | 2 |
| Elementary Education | 1 |
| Grade 3 | 1 |
| Grade 5 | 1 |
| Grade 6 | 1 |
| Grade 7 | 1 |
| Grade 8 | 1 |
| Middle Schools | 1 |
Audience
| Researchers | 1 |
Location
| New Mexico | 2 |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Lewis, Todd F. – Measurement and Evaluation in Counseling and Development, 2017
American Educational Research Association (AERA) standards stipulate that researchers show evidence of the internal structure of instruments. Confirmatory factor analysis (CFA) is one structural equation modeling procedure designed to assess construct validity of assessments that has broad applicability for counselors interested in instrument…
Descriptors: Educational Research, Factor Analysis, Structural Equation Models, Construct Validity
Wilson, F. Robert; Pan, Wei; Schumsky, Donald A. – Measurement and Evaluation in Counseling and Development, 2012
The content validity ratio (Lawshe) is one of the earliest and most widely used methods for quantifying content validity. To correct and expand the table, critical values in unit steps and at multiple alpha levels were computed. Implications for content validation are discussed. (Contains 2 tables and 1 figure.)
Descriptors: Content Validity, Computation, Test Validity
Calmettes, Guillaume; Drummond, Gordon B.; Vowler, Sarah L. – Advances in Physiology Education, 2012
A jack knife is a pocket knife that is put to many tasks, because it's ready to hand. Often there could be a better tool for the job, such as a screwdriver, a scraper, or a can-opener, but these are not usually pocket items. In statistical terms, the expression implies making do with what's available. Another simile, of an extreme situation, is…
Descriptors: Statistical Analysis, Computation, Population Distribution, Evaluation Methods
Cloonan, Carrie A.; Hutchinson, John S. – Chemistry Education Research and Practice, 2011
A Chemistry Concept Reasoning Test was created and validated providing an easy-to-use tool for measuring conceptual understanding and critical scientific thinking of general chemistry models and theories. The test is designed to measure concept understanding comparable to that found in free-response questions requiring explanations over…
Descriptors: Test Validity, Chemistry, Correlation, Multiple Choice Tests
Jung, Eunju; Liu, Kimy; Ketterlin-Geller, Leanne R.; Tindal, Gerald – Behavioral Research and Teaching, 2008
The purpose of this study was to develop general outcome measures (GOM) in mathematics so that teachers could focus their instruction on needed prerequisite skills. We describe in detail, the manner in which content-related evidence was established and then present a number of statistical analyses conducted to evaluate the technical adequacy of…
Descriptors: Item Analysis, Test Construction, Test Theory, Mathematics Tests
Peer reviewedFeldt, Leonard S. – Measurement & Evaluation in Counseling & Development, 2004
In some settings, the validity of a battery composite or a test score is enhanced by weighting some parts or items more heavily than others in the total score. This article describes methods of estimating the total score reliability coefficient when differential weights are used with items or parts.
Descriptors: Test Items, Scoring, Cognitive Processes, Test Validity
Fuchs, Lynn S.; Fuchs, Douglas; Courey, Susan J. – Assessment for Effective Intervention, 2005
In this article, the authors explain how curriculum-based measurement (CBM) differs from other forms of classroom-based assessment. The development of CBM is traced from computation to concepts and applications to real-life problem solving, with examples of the assessments and illustrations of research to document technical features and utility…
Descriptors: Curriculum Based Assessment, Mathematics Skills, Case Studies, Computation
Bachor, Dan G. – Diagnostique, 1990
KeyMath Revised was devised as a power test for use with students from kindergarten through grade 9. The test is divided into three dimensions: basic concepts, operations, and applications. This paper describes the test's administration, summation of data, standardization, reliability, and validity. (JDD)
Descriptors: Achievement Tests, Computation, Elementary Secondary Education, Mathematical Applications
New Mexico Public Education Department, 2007
The purpose of the NMSBA technical report is to provide users and other interested parties with a general overview of and technical characteristics of the 2007 NMSBA. The 2007 technical report contains the following information: (1) Test development; (2) Scoring procedures; (3) Summary of student performance; (4) Statistical analyses of item and…
Descriptors: Interrater Reliability, Standard Setting, Measures (Individuals), Scoring
Griph, Gerald W. – New Mexico Public Education Department, 2006
The purpose of the NMSBA technical report is to provide users and other interested parties with a general overview of and technical characteristics of the 2006 NMSBA. The 2006 technical report contains the following information: (1) Test development; (2) Scoring procedures; (3) Calibration, scaling, and equating procedures; (4) Standard setting;…
Descriptors: Interrater Reliability, Standard Setting, Measures (Individuals), Scoring

Direct link
