Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 3 |
| Since 2017 (last 10 years) | 17 |
| Since 2007 (last 20 years) | 39 |
Descriptor
| Test Items | 42 |
| Grade 8 | 34 |
| Test Validity | 31 |
| Test Construction | 18 |
| Difficulty Level | 17 |
| Item Response Theory | 16 |
| Mathematics Tests | 16 |
| Foreign Countries | 15 |
| Middle School Students | 13 |
| Science Tests | 12 |
| Test Reliability | 12 |
| More ▼ | |
Source
Author
Publication Type
| Reports - Research | 29 |
| Journal Articles | 26 |
| Reports - Evaluative | 6 |
| Reports - Descriptive | 5 |
| Numerical/Quantitative Data | 4 |
| Tests/Questionnaires | 3 |
| Dissertations/Theses -… | 2 |
| Speeches/Meeting Papers | 2 |
Education Level
| Grade 8 | 42 |
| Middle Schools | 32 |
| Secondary Education | 29 |
| Elementary Education | 28 |
| Junior High Schools | 27 |
| Grade 7 | 14 |
| Grade 4 | 12 |
| Grade 6 | 11 |
| Elementary Secondary Education | 10 |
| Intermediate Grades | 9 |
| Grade 5 | 8 |
| More ▼ | |
Audience
| Policymakers | 1 |
| Teachers | 1 |
Location
| Turkey | 5 |
| California | 4 |
| Georgia | 2 |
| Singapore | 2 |
| Alabama | 1 |
| Arizona | 1 |
| Arkansas | 1 |
| Canada | 1 |
| Connecticut | 1 |
| Florida | 1 |
| Germany | 1 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
| Trends in International… | 7 |
| National Assessment of… | 5 |
| Flesch Kincaid Grade Level… | 1 |
What Works Clearinghouse Rating
Suciati; Munadi, Sudji; Sugiman; Febriyanti, Wiwin Dwi Ratna – European Journal of Educational Research, 2020
This study aims to design mathematical literacy instruments that have evidence of content and construct validity and are reliable for use as an assessment for learning. The research involved eight experts as instrument validators and 273 eighth-grade students of junior high school in Yogyakarta Province. The results showed that the ten…
Descriptors: Numeracy, Mathematics Tests, Test Construction, Test Validity
Arikan, Serkan; Erktin, Emine; Pesen, Melek – International Journal of Science and Mathematics Education, 2022
The aim of this study is to construct a STEM competencies assessment framework and provide validity evidence by empirically testing its structure. Common interdisciplinary assessment frameworks for STEM seem to be scarce in the literature. Many studies use students' mathematics or science scores obtained from large-scale assessments or exams to…
Descriptors: STEM Education, Competence, Interdisciplinary Approach, Test Construction
Stammen, Andria – ProQuest LLC, 2018
The aim of this research is to develop a measurement instrument that is valid and reliable, called the Middle School-Life Science Concept Inventory (MS-LSCI), for the purpose of measuring the life science conceptual understanding of middle school-level students. Although there are several existing concept inventories related to biology concepts…
Descriptors: Science Tests, Biological Sciences, Middle School Students, Scientific Concepts
Qi Huang; Daniel M. Bolt; Weicong Lyu – Large-scale Assessments in Education, 2024
Large scale international assessments depend on invariance of measurement across countries. An important consideration when observing cross-national differential item functioning (DIF) is whether the DIF actually reflects a source of bias, or might instead be a methodological artifact reflecting item response theory (IRT) model misspecification.…
Descriptors: Test Items, Item Response Theory, Test Bias, Test Validity
Sparks, Jesse R.; van Rijn, Peter W.; Deane, Paul – Educational Assessment, 2021
Effectively evaluating the credibility and accuracy of multiple sources is critical for college readiness. We developed 24 source evaluation tasks spanning four predicted difficulty levels of a hypothesized learning progression (LP) and piloted these tasks to evaluate the utility of an LP-based approach to designing formative literacy assessments.…
Descriptors: Middle School Students, Information Sources, Grade 6, Grade 7
Wang, Yan; Kim, Eun Sook; Dedrick, Robert F.; Ferron, John M.; Tan, Tony – Educational and Psychological Measurement, 2018
Wording effects associated with positively and negatively worded items have been found in many scales. Such effects may threaten construct validity and introduce systematic bias in the interpretation of results. A variety of models have been applied to address wording effects, such as the correlated uniqueness model and the correlated traits and…
Descriptors: Test Items, Test Format, Correlation, Construct Validity
Uzun, Aysenur; Kilickaya, Ferit – Online Submission, 2020
One of the most important dynamics of the educational setting all over the world is examinations, and some of those are English tests. In Turkey, English tests for the students preparing for the high schools were included in the national exams with the Level Determination Exam (SBS) in 2008 for the first time, and Transition Examination from…
Descriptors: Foreign Countries, Language Tests, English (Second Language), Content Validity
Alnasraween, Moen Salman; Almughrabi, Ayat Mohammad; Ammari, Raeda Mofid; Alkaramneh, Mohammad Saleh – Cypriot Journal of Educational Sciences, 2021
The purpose of this study is to construct a digital culture test in light of the Item Response Theory and to investigate its psychometric properties. The study sample consisted of six hundred fifty (650) male and female students in the eighth grade from the Directorate of Education and Teaching of Salt District. To obtain the results, the…
Descriptors: Foreign Countries, Technological Literacy, Tests, Psychometrics
Saatçioglu, Fatima Münevver; Atar, Hakan Yavuz – Participatory Educational Research, 2020
This study examined the existence of latent classes in TIMSS 2015 data from three countries, Singapure, Turkey and South Africa, were analyzed using Mixture Item Response Theory (MixIRT) models (Rasch, 1PL, 2PL and 3PL) on 18 multiple-choice items in the science subtest. Based on the findings, it was concluded that the data obtained from TIMSS…
Descriptors: Foreign Countries, Item Response Theory, Achievement Tests, International Assessment
Lee, Hollylynne; Bradshaw, Laine; Famularo, Lisa; Masters, Jessica; Azevedo, Roger; Johnson, Sheri; Schellman, Madeline; Elrod, Emily; Sanei, Hamid – Grantee Submission, 2019
The research shared in this conference paper report illustrates how an iterative process to item development that involves expert review and cognitive lab interviews with students can be used to collect evidence of validity for assessment items. Analysis of students' reasoning was also used to expand a model for identifying conceptions and…
Descriptors: Middle School Students, Interviews, Misconceptions, Test Items
Durán, Richard P.; Zhang, Ting; Sañosa, David; Stancavage, Fran – American Institutes for Research, 2020
The National Assessment of Educational Progress's (NAEP's) transition to an entirely digitally based assessment (DBA) began in 2017. As part of this transition, new types of NAEP items have begun to be developed that leverage the DBA environment to measure a wider range of knowledge and skills. These new item types include the science…
Descriptors: National Competency Tests, Computer Assisted Testing, Science Tests, Test Items
Koskey, Kristin L. K.; Makki, Nidaa; Ahmed, Wondimu; Garafolo, Nicholas G.; Visco, Donald P., Jr. – School Science and Mathematics, 2020
Integrating engineering into the K-12 science curriculum continues to be a focus in national reform efforts in science education. Although there is an increasing interest in research in and practice of integrating engineering in K-12 science education, to date only a few studies have focused on the development of an assessment tool to measure…
Descriptors: Middle School Students, Engineering, Design, Science Education
O'Malley, Fran; Norton, Scott – American Institutes for Research, 2022
This paper provides the National Center for Education Statistics (NCES), National Assessment Governing Board (NAGB), and the National Assessment of Educational Progress (NAEP) community with information that may help maintain the validity and utility of the NAEP assessments for civics and U.S. history as revisions are planned to the NAEP…
Descriptors: National Competency Tests, United States History, Test Validity, Governing Boards
Webb, Anne Frank – ProQuest LLC, 2018
Although there are several comprehensive measures of children's anxiety symptoms, many have limited utility for screening within school settings. As such, the primary purpose of this dissertation was to develop a brief self-report anxiety measure appropriate for use with students in Grades 3-8. An initial pool of 50 items was developed based on…
Descriptors: Test Construction, Anxiety, Screening Tests, Grade 3
Turkan, Azmi; Cetin, Bayram – Journal of Education and Practice, 2017
Validity and reliability are among the most crucial characteristics of a test. One of the steps to make sure that a test is valid and reliable is to examine the bias in test items. The purpose of this study was to examine the bias in 2012 Placement Test items in terms of gender variable using Rasch Model in Turkey. The sample of this study was…
Descriptors: Item Response Theory, Gender Differences, Test Bias, Test Items

Peer reviewed
Direct link
