Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 6 |
| Since 2017 (last 10 years) | 15 |
| Since 2007 (last 20 years) | 36 |
Descriptor
| Construct Validity | 53 |
| Mathematics Tests | 53 |
| Test Items | 18 |
| Foreign Countries | 17 |
| Elementary School Students | 15 |
| Test Validity | 13 |
| Mathematics Achievement | 12 |
| Test Construction | 12 |
| Test Reliability | 11 |
| Achievement Tests | 10 |
| Scores | 10 |
| More ▼ | |
Source
Author
| Alonzo, Julie | 3 |
| Anderson, Daniel | 3 |
| Lai, Cheng-Fei | 3 |
| Tindal, Gerald | 3 |
| Banerji, Madhabi | 2 |
| Chen, Yi-Hsin | 2 |
| Nese, Joseph F. T. | 2 |
| Saez, Leilani | 2 |
| Soine, Karen M. | 2 |
| Abdelfattah, Faisal A. | 1 |
| Abduljabbar, Adel S. | 1 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 3 |
Location
| Washington | 5 |
| Canada | 3 |
| Oregon | 3 |
| United States | 3 |
| Finland | 2 |
| Indonesia | 2 |
| Massachusetts | 2 |
| Taiwan | 2 |
| Turkey | 2 |
| Alaska | 1 |
| Australia | 1 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Yijie Li; Xin Lin; Chuang Wang – Psychology in the Schools, 2025
This study provides validity evidence supporting the interpretation of scores from the Mathematics Vocabulary Self-Efficacy Scale (MVSES), guided by the five sources of validity described in the "Standards for Educational and Psychological Testing" (AERA, APA, & NCME, 2014). A sample of 435 Chinese fourth-grade students participated…
Descriptors: Psychometrics, Mathematics Tests, Scores, Vocabulary
Jahudin, Janet; Siew, Nyet Moi – Problems of Education in the 21st Century, 2023
Diagnostic tests have been developed previously to measure algebraic thinking skills; however, the tests do not specifically address algebraic problem-solving. Thus, an Algebraic Thinking Test (ATT) Instrument was developed to measure algebraic thinking skills in problem-solving involving linear equations. ATT comprises nine open-ended questions…
Descriptors: Algebra, Thinking Skills, Foreign Countries, Problem Solving
Hartono, Wahyu; Hadi, Samsul; Rosnawati, Raden; Retnawati, Heri – Pegem Journal of Education and Instruction, 2023
Researchers design diagnostic assessments to measure students' knowledge structures and processing skills to provide information about their cognitive attribute. The purpose of this study is to determine the instrument's validity and score reliability, as well as to investigate the use of classical test theory to identify item characteristics. The…
Descriptors: Diagnostic Tests, Test Validity, Item Response Theory, Content Validity
Bal Incebacak, Belgin; Ersoy, Esen – Participatory Educational Research, 2022
This study aims to develop a valid and reliable measuring instrument that can measure the level of mathematical language used by students and their teachers during the teaching of fractions to fourth-grade primary school students. This study is a methodological validity and reliability study. In total, 999 students from fourteen different…
Descriptors: Test Construction, Mathematics Tests, Vocabulary, Mathematics Skills
Chen, Yi-Hsin – Journal of Psychoeducational Assessment, 2022
The quality of diagnostic profiles and probability assignment depends on the validity of the proposed attributes and Q-matrix. The rule-space method (RSM), one of diagnostic classification models, provides the quality indices of diagnostic profiles, such as the classification rate and the squared Mahalanobis distance. The study aims to further…
Descriptors: Profiles, Probability, Classification, Construct Validity
Russell, Michael; Moncaleano, Sebastian – Educational Assessment, 2019
Over the past decade, large-scale testing programs have employed technology-enhanced items (TEI) to improve the fidelity with which an item measures a targeted construct. This paper presents findings from a review of released TEIs employed by large-scale testing programs worldwide. Analyses examine the prevalence with which different types of TEIs…
Descriptors: Computer Assisted Testing, Fidelity, Elementary Secondary Education, Test Items
Cohen, Dale J.; Ballman, Alesha; Rijmen, Frank; Cohen, Jon – Applied Measurement in Education, 2020
Computer-based, pop-up glossaries are perhaps the most promising accommodation aimed at mitigating the influence of linguistic structure and cultural bias on the performance of English Learner (EL) students on statewide assessments. To date, there is no established procedure for identifying the words that require a glossary for EL students that is…
Descriptors: Glossaries, Testing Accommodations, English Language Learners, Computer Assisted Testing
Nazli Uygun Emil – ProQuest LLC, 2020
Validity of a measurement refers to appropriate test score meanings, uses, and interpretations (Messick, 1989; Kane, 1992). There are different approaches to validity: an evidentiary aspect of validity is one requiring gathering statistical evidence to evaluate test score meaning. A common approach to validation is comparisons of test score equity…
Descriptors: Educational Quality, Mathematics Tests, Test Validity, Test Reliability
Suciati; Munadi, Sudji; Sugiman; Febriyanti, Wiwin Dwi Ratna – European Journal of Educational Research, 2020
This study aims to design mathematical literacy instruments that have evidence of content and construct validity and are reliable for use as an assessment for learning. The research involved eight experts as instrument validators and 273 eighth-grade students of junior high school in Yogyakarta Province. The results showed that the ten…
Descriptors: Numeracy, Mathematics Tests, Test Construction, Test Validity
Meredith P. Franco; Jessika H. Bottiani; Katrina J. Debnam; Wes Bonifay; Toshna Pandey; Juliana Karras; Catherine P. Bradshaw – Grantee Submission, 2024
There is growing interest in improving and assessing teachers' use of culturally responsive practices (CRP) in the classroom, yet relatively few research-based approaches exist to address these measurement gaps. This article presents findings on the psychometric properties of a newly developed classroom observation measure of CRP, called the CARES…
Descriptors: Culturally Relevant Education, Classroom Observation Techniques, Construct Validity, Educational Practices
Holmes, Stephen D.; He, Qingping; Meadows, Michelle – Research in Mathematics Education, 2017
The relationship between the characteristics of 33 mathematical problem-solving questions answered by 16-year-old students in England and the quality of problem-solving elicited was investigated in two studies. The first study used comparative judgement (CJ) to estimate the quality of the problem-solving elicited by each question, involving 33…
Descriptors: Foreign Countries, Mathematics Skills, Problem Solving, Mathematical Logic
Michaelides, Michalis P. – Applied Measurement in Education, 2019
The Student Background survey administered along with achievement tests in studies of the International Association for the Evaluation of Educational Achievement includes scales of student motivation, competence, and attitudes toward mathematics and science. The scales consist of positively- and negatively keyed items. The current research…
Descriptors: International Assessment, Achievement Tests, Mathematics Achievement, Mathematics Tests
Toker, Turker; Green, Kathy E. – AERA Online Paper Repository, 2017
This study provides the results of a latent class analysis (LCA) using data from the Trends in International Mathematics and Science Study -- 2011 (TIMSS-2011) with a focus on the 8th grade mathematics section. The study presents the analysis of item data with Mplus 7.31 to determine if results obtained yielded distinct latent subgroups. The data…
Descriptors: Cross Cultural Studies, Comparative Education, Achievement Tests, Elementary Secondary Education
Crabtree, Ashleigh R. – ProQuest LLC, 2016
The purpose of this research is to provide information about the psychometric properties of technology-enhanced (TE) items and the effects these items have on the content validity of an assessment. Specifically, this research investigated the impact that the inclusion of TE items has on the construct of a mathematics test, the technical properties…
Descriptors: Psychometrics, Computer Assisted Testing, Test Items, Test Format
Cheng, Weiyi; Lei, Pui-Wa; DiPerna, James C. – Journal of Experimental Education, 2017
The purpose of the current study was to examine dimensionality and concurrent validity evidence of the EARLI numeracy measures (DiPerna, Morgan, & Lei, 2007), which were developed to assess key skills such as number identification, counting, and basic arithmetic. Two methods (NOHARM with approximate chi-square test and DIMTEST with DETECT…
Descriptors: Construct Validity, Numeracy, Mathematics Tests, Statistical Analysis

Peer reviewed
Direct link
