Publication Date
| In 2026 | 0 |
| Since 2025 | 2 |
| Since 2022 (last 5 years) | 5 |
| Since 2017 (last 10 years) | 12 |
| Since 2007 (last 20 years) | 16 |
Descriptor
| Mathematics Tests | 23 |
| Test Length | 23 |
| Test Items | 9 |
| Achievement Tests | 8 |
| Foreign Countries | 7 |
| Item Response Theory | 7 |
| Reading Tests | 6 |
| Computer Assisted Testing | 5 |
| Grade 8 | 5 |
| Test Construction | 5 |
| Grade 9 | 4 |
| More ▼ | |
Source
Author
| Cohen, Allan S. | 2 |
| Battista, Carmela | 1 |
| Baumgarten, Bruce S. | 1 |
| Campana, Kayla V. | 1 |
| Chien, Yuehmei | 1 |
| Clements, Douglas H. | 1 |
| Dawes, Jillian M. | 1 |
| Day-Hess, Crystal A. | 1 |
| Deng, Meng | 1 |
| Dong, Yixiao | 1 |
| Dumas, Denis | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 16 |
| Reports - Research | 13 |
| Reports - Evaluative | 6 |
| Speeches/Meeting Papers | 4 |
| Numerical/Quantitative Data | 3 |
| Reports - Descriptive | 3 |
| Guides - Non-Classroom | 1 |
| Tests/Questionnaires | 1 |
Education Level
| Secondary Education | 9 |
| Elementary Education | 6 |
| Middle Schools | 6 |
| Junior High Schools | 5 |
| Elementary Secondary Education | 4 |
| Grade 8 | 4 |
| High Schools | 4 |
| Early Childhood Education | 2 |
| Grade 3 | 2 |
| Grade 9 | 2 |
| Kindergarten | 2 |
| More ▼ | |
Audience
| Researchers | 2 |
Location
| Asia | 1 |
| Australia | 1 |
| Colombia | 1 |
| Florida | 1 |
| Indonesia | 1 |
| Jordan | 1 |
| Maryland | 1 |
| Michigan | 1 |
| Netherlands | 1 |
| Norway | 1 |
| Pennsylvania | 1 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
| Program for International… | 3 |
| Trends in International… | 2 |
| COMPASS (Computer Assisted… | 1 |
| Florida Comprehensive… | 1 |
| Measures of Academic Progress | 1 |
| Texas Assessment of Basic… | 1 |
| Texas Educational Assessment… | 1 |
What Works Clearinghouse Rating
Niclas Larson – Journal of the International Society for Teacher Education, 2024
This paper reports on a revision of the assessment model from the first mathematics course for pre-service teachers (PSTs) aiming for grades 5-10, at a Norwegian university. The weight of the final written exam was reduced and a new, mastery-based testing model, with weekly small tests, was introduced. Results from this study show that the PSTs…
Descriptors: High Stakes Tests, Test Length, Mathematics Tests, Preservice Teachers
Pasquale Anselmi; Jürgen Heller; Luca Stefanutti; Egidio Robusto; Giulia Barillari – Education and Information Technologies, 2025
Competence-based test development (CbTD) is a novel method for constructing tests that are as informative as possible about the competence state (the set of skills an individual masters) underlying the item responses. If desired, the tests can also be minimal, meaning that no item can be eliminated without reducing their informativeness. To…
Descriptors: Competency Based Education, Test Construction, Test Length, Usability
Mehmet Fatih Doguyurt; Seref Tan – International Journal of Assessment Tools in Education, 2025
This study investigates the impact of violating the local item independence assumption by loading certain items onto a second dimension on test equating errors in unidimensional and dichotomous tests. The research was designed as a simulation study, using data generated based on the PISA 2018 mathematics exam. Analyses were conducted under 36…
Descriptors: Equated Scores, Test Items, Mathematics Tests, International Assessment
Saskia van Laar; Johan Braeken – International Journal of Testing, 2024
This study examined the impact of two questionnaire characteristics, scale position and questionnaire length, on the prevalence of random responders in the TIMSS 2015 eighth-grade student questionnaire. While there was no support for an absolute effect of questionnaire length, we did find a positive effect for scale position, with an increase of…
Descriptors: Middle School Students, Grade 8, Questionnaires, Test Length
Sari, Halil Ibrahim – International Journal of Psychology and Educational Studies, 2020
Due to low cost monte-carlo (MC) simulations have been extensively conducted in the area of educational measurement. However, the results derived from MC studies may not always be generalizable to operational studies. The purpose of this study was to provide a methodological discussion on the other different types of simulation methods, and run…
Descriptors: Computer Assisted Testing, Adaptive Testing, Simulation, Test Length
Dong, Yixiao; Clements, Douglas H.; Day-Hess, Crystal A.; Sarama, Julie; Dumas, Denis – Journal of Psychoeducational Assessment, 2021
Psychometric work with young children faces the particular challenge that children's attention spans are relatively short, and therefore, shorter assessments are required while retaining comprehensive coverage. This article reports on three empirical studies that encompass the development and validation of the research-based early mathematics…
Descriptors: Young Children, Numeracy, Test Construction, Test Validity
Murphy, Michael – Australian Mathematics Education Journal, 2022
South Australian Secondary Mathematics teachers consider their testing parameters (duration and frequency) to be consistent with teachers elsewhere in the state, however, formal evidence is not available. A review of literature presented similar gaps in this evidence in other jurisdictions in Australia. The logic model that underpins the resource…
Descriptors: Foreign Countries, Secondary School Teachers, Secondary School Mathematics, Mathematics Tests
Precision of Single-Skill Math CBM Time-Series Data: The Effect of Probe Stratification and Set Size
Solomon, Benjamin G.; Payne, Lexy L.; Campana, Kayla V.; Marr, Erin A.; Battista, Carmela; Silva, Alex; Dawes, Jillian M. – Journal of Psychoeducational Assessment, 2020
Comparatively little research exists on single-skill math (SSM) curriculum-based measurements (CBMs) for the purpose of monitoring growth, as may be done in practice or when monitoring intervention effectiveness within group or single-case research. Therefore, we examined a common variant of SSM-CBM: 1 digit × 1 digit multiplication. Reflecting…
Descriptors: Curriculum Based Assessment, Mathematics Tests, Mathematics Skills, Multiplication
Karadavut, Tugba; Cohen, Allan S.; Kim, Seock-Ho – Measurement: Interdisciplinary Research and Perspectives, 2020
Mixture Rasch (MixRasch) models conventionally assume normal distributions for latent ability. Previous research has shown that the assumption of normality is often unmet in educational and psychological measurement. When normality is assumed, asymmetry in the actual latent ability distribution has been shown to result in extraction of spurious…
Descriptors: Item Response Theory, Ability, Statistical Distributions, Sample Size
Vaheoja, Monika; Verhelst, N. D.; Eggen, T.J.H.M. – European Journal of Science and Mathematics Education, 2019
In this article, the authors applied profile analysis to Maths exam data to demonstrate how different exam forms, differing in difficulty and length, can be reported and easily interpreted. The results were presented for different groups of participants and for different institutions in different Maths domains by evaluating the balance. Some…
Descriptors: Feedback (Response), Foreign Countries, Statistical Analysis, Scores
James, Syretta R.; Liu, Shihching Jessica; Maina, Nyambura; Wade, Julie; Wang, Helen; Wilson, Heather; Wolanin, Natalie – Montgomery County Public Schools, 2021
The impact of the COVID-19 pandemic continues to overwhelm the functioning and outcomes of educational systems throughout the nation. The public education system is under particular scrutiny given that students, families, and educators are under considerable stress to maintain academic progress. Since the beginning of the crisis, school-systems…
Descriptors: Achievement Tests, COVID-19, Pandemics, Public Schools
NWEA, 2018
Thousands of U.S. school districts and many international schools use MAP® Growth™ to monitor the academic growth of their students and to inform instruction. The MAP Growth assessment is untimed, meaning that limits are not placed on how much time a student has to respond to the items. However, to help schools understand the amount of time MAP…
Descriptors: Achievement Tests, Test Length, Achievement Gains, Mathematics Tests
Wyse, Adam E.; Hao, Shiqi – Applied Psychological Measurement, 2012
This article introduces two new classification consistency indices that can be used when item response theory (IRT) models have been applied. The new indices are shown to be related to Rudner's classification accuracy index and Guo's classification accuracy index. The Rudner- and Guo-based classification accuracy and consistency indices are…
Descriptors: Item Response Theory, Classification, Accuracy, Reliability
Shin, Chingwei David; Chien, Yuehmei; Way, Walter Denny – Pearson, 2012
Content balancing is one of the most important components in the computerized adaptive testing (CAT) especially in the K to 12 large scale tests that complex constraint structure is required to cover a broad spectrum of content. The purpose of this study is to compare the weighted penalty model (WPM) and the weighted deviation method (WDM) under…
Descriptors: Computer Assisted Testing, Elementary Secondary Education, Test Content, Models
Pennsylvania Department of Education, 2010
This handbook describes the responsibilities of district and school assessment coordinators in the administration of the Pennsylvania System of School Assessment (PSSA). This updated guidebook contains the following sections: (1) General Assessment Guidelines for All Assessments; (2) Writing Specific Guidelines; (3) Reading and Mathematics…
Descriptors: Guidelines, Guides, Educational Assessment, Writing Tests
Previous Page | Next Page »
Pages: 1 | 2
Peer reviewed
Direct link
