Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 3 |
| Since 2017 (last 10 years) | 8 |
| Since 2007 (last 20 years) | 29 |
Descriptor
| Comparative Analysis | 41 |
| Educational Assessment | 41 |
| Test Items | 41 |
| Foreign Countries | 19 |
| Item Response Theory | 12 |
| Test Construction | 12 |
| Academic Achievement | 11 |
| Item Analysis | 10 |
| Mathematics Tests | 10 |
| Measurement | 9 |
| Scoring | 9 |
| More ▼ | |
Source
Author
| Bassett, Katherine | 3 |
| Donovan, Jenny | 3 |
| Joe, Jilliam | 3 |
| Lennon, Melissa | 3 |
| McClellan, Catherine | 3 |
| Hutton, Penny | 2 |
| Ishii, Takatoshi | 2 |
| Morrissey, Noni | 2 |
| O'Connor, Gayl | 2 |
| Ueno, Maomi | 2 |
| Wu, Margaret | 2 |
| More ▼ | |
Publication Type
Education Level
| Elementary Secondary Education | 11 |
| Elementary Education | 8 |
| Secondary Education | 6 |
| Grade 6 | 4 |
| Grade 4 | 2 |
| Grade 5 | 2 |
| Grade 8 | 2 |
| Higher Education | 2 |
| Intermediate Grades | 2 |
| Adult Education | 1 |
| Grade 10 | 1 |
| More ▼ | |
Audience
| Researchers | 2 |
| Practitioners | 1 |
| Students | 1 |
| Teachers | 1 |
Location
| Australia | 5 |
| Canada | 2 |
| Idaho | 2 |
| Japan | 2 |
| Minnesota | 2 |
| Nevada | 2 |
| New Jersey | 2 |
| South Dakota | 2 |
| Utah | 2 |
| Washington | 2 |
| Arizona | 1 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Jordan M. Wheeler; Allan S. Cohen; Shiyu Wang – Journal of Educational and Behavioral Statistics, 2024
Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming…
Descriptors: Semantics, Educational Assessment, Evaluators, Reliability
Kyung-Mi O. – Language Testing in Asia, 2024
This study examines the efficacy of artificial intelligence (AI) in creating parallel test items compared to human-made ones. Two test forms were developed: one consisting of 20 existing human-made items and another with 20 new items generated with ChatGPT assistance. Expert reviews confirmed the content parallelism of the two test forms.…
Descriptors: Comparative Analysis, Artificial Intelligence, Computer Software, Test Items
Fuchimoto, Kazuma; Ishii, Takatoshi; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2022
Educational assessments often require uniform test forms, for which each test form has equivalent measurement accuracy but with a different set of items. For uniform test assembly, an important issue is the increase of the number of assembled uniform tests. Although many automatic uniform test assembly methods exist, the maximum clique algorithm…
Descriptors: Simulation, Efficiency, Test Items, Educational Assessment
Lance M. Kruse – ProQuest LLC, 2019
This study explores six item-reduction methodologies used to shorten an existing complex problem-solving non-objective test by evaluating how each shortened form performs across three sources of validity evidence (i.e., test content, internal structure, and relationships with other variables). Two concerns prompted the development of the present…
Descriptors: Educational Assessment, Comparative Analysis, Test Format, Test Length
Kang, Hyeon-Ah; Lu, Ying; Chang, Hua-Hua – Applied Measurement in Education, 2017
Increasing use of item pools in large-scale educational assessments calls for an appropriate scaling procedure to achieve a common metric among field-tested items. The present study examines scaling procedures for developing a new item pool under a spiraled block linking design. The three scaling procedures are considered: (a) concurrent…
Descriptors: Item Response Theory, Accuracy, Educational Assessment, Test Items
Zhu, Mengxiao; Shu, Zhan; von Davier, Alina A. – Journal of Educational Measurement, 2016
New technology enables interactive and adaptive scenario-based tasks (SBTs) to be adopted in educational measurement. At the same time, it is a challenging problem to build appropriate psychometric models to analyze data collected from these tasks, due to the complexity of the data. This study focuses on process data collected from SBTs. We…
Descriptors: Measurement, Data Collection, National Competency Tests, Scoring Rubrics
Simsek, Ali – European Journal of Science and Mathematics Education, 2016
Test development has been an important part of measurement and evaluation in any educational setting, whether its purpose is instruction or training. Both teachers and trainers are expected to have certain level of mastery in developing reliable and valid tests for assessing performance of learners adequately. However, it has often been reported…
Descriptors: Comparative Analysis, Achievement Tests, Test Construction, Teacher Education
Koziol, Natalie A. – Applied Measurement in Education, 2016
Testlets, or groups of related items, are commonly included in educational assessments due to their many logistical and conceptual advantages. Despite their advantages, testlets introduce complications into the theory and practice of educational measurement. Responses to items within a testlet tend to be correlated even after controlling for…
Descriptors: Classification, Accuracy, Comparative Analysis, Models
Wagemaker, Hans, Ed. – International Association for the Evaluation of Educational Achievement, 2020
Although International Association for the Evaluation of Educational Achievement-pioneered international large-scale assessment (ILSA) of education is now a well-established science, non-practitioners and many users often substantially misunderstand how large-scale assessments are conducted, what questions and challenges they are designed to…
Descriptors: International Assessment, Achievement Tests, Educational Assessment, Comparative Analysis
Ishii, Takatoshi; Songmuang, Pokpong; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2014
Educational assessments occasionally require uniform test forms for which each test form comprises a different set of items, but the forms meet equivalent test specifications (i.e., qualities indicated by test information functions based on item response theory). We propose two maximum clique algorithms (MCA) for uniform test form assembly. The…
Descriptors: Simulation, Efficiency, Test Items, Educational Assessment
Singer, Judith D., Ed.; Braun, Henry I., Ed.; Chudowsky, Naomi, Ed. – National Academy of Education, 2018
Results from international large-scale assessments (ILSAs) garner considerable attention in the media, academia, and among policy makers. Although there is widespread recognition that ILSAs can provide useful information, there is debate about what types of comparisons are the most meaningful and what could be done to assure more sound…
Descriptors: International Education, Educational Assessment, Educational Policy, Data Interpretation
Chen, Haiwen H.; von Davier, Matthias; Yamamoto, Kentaro; Kong, Nan – ETS Research Report Series, 2015
One major issue with large-scale assessments is that the respondents might give no responses to many items, resulting in less accurate estimations of both assessed abilities and item parameters. This report studies how the types of items affect the item-level nonresponse rates and how different methods of treating item-level nonresponses have an…
Descriptors: Achievement Tests, Foreign Countries, International Assessment, Secondary School Students
Oliveri, Maria Elena; von Davier, Matthias – International Journal of Testing, 2014
In this article, we investigate the creation of comparable score scales across countries in international assessments. We examine potential improvements to current score scale calibration procedures used in international large-scale assessments. Our approach seeks to improve fairness in scoring international large-scale assessments, which often…
Descriptors: Test Bias, Scores, International Programs, Educational Assessment
McQuillan, Mark; Phelps, Richard P.; Stotsky, Sandra – Pioneer Institute for Public Policy Research, 2015
In July 2010, the Massachusetts Board of Elementary and Secondary Education (BESE) voted to adopt Common Core's standards in English language arts (ELA) and mathematics in place of the state's own standards in these two subjects. The vote was based largely on recommendations by Commissioner of Education Mitchell Chester and then Secretary of…
Descriptors: Reading Tests, Writing Tests, Achievement Tests, Common Core State Standards
McClellan, Catherine; Joe, Jilliam; Bassett, Katherine – National Network of State Teachers of the Year, 2017
This study continues the work that National Network of State Teachers of the Year (NNSTOY) and its partners, Clowder Consulting and Education-Counsel, began with their "Right Trajectory" study (see ED581172), released in 2015, and continued in their "Still the Right Trajectory" study (see ED581173) released in early 2017. In…
Descriptors: Grade 11, Comparative Analysis, Common Core State Standards, Summative Evaluation

Peer reviewed
Direct link
