Publication Date
| In 2026 | 0 |
| Since 2025 | 4 |
| Since 2022 (last 5 years) | 12 |
| Since 2017 (last 10 years) | 30 |
| Since 2007 (last 20 years) | 39 |
Descriptor
| Data Collection | 14 |
| Data Analysis | 10 |
| Models | 10 |
| Data | 8 |
| Educational Assessment | 8 |
| Scores | 8 |
| Psychometrics | 7 |
| Computer Assisted Testing | 5 |
| Item Response Theory | 5 |
| Measurement | 5 |
| Test Items | 5 |
| More ▼ | |
Source
| Educational Measurement:… | 39 |
Author
| Sinharay, Sandip | 4 |
| Cui, Zhongmin | 3 |
| Clauser, Brian E. | 2 |
| Feinberg, Richard A. | 2 |
| Haberman, Shelby J. | 2 |
| Margolis, Melissa J. | 2 |
| Sireci, Stephen G. | 2 |
| Ackerman, Terry A. | 1 |
| Ames, Allison | 1 |
| An, Lily Shiao | 1 |
| Andrew Hoang | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 39 |
| Reports - Research | 17 |
| Reports - Descriptive | 16 |
| Reports - Evaluative | 5 |
| Information Analyses | 1 |
| Opinion Papers | 1 |
Education Level
| Secondary Education | 4 |
| Elementary Education | 2 |
| Elementary Secondary Education | 2 |
| Higher Education | 2 |
| Postsecondary Education | 2 |
| Grade 4 | 1 |
| High Schools | 1 |
| Intermediate Grades | 1 |
| Junior High Schools | 1 |
| Middle Schools | 1 |
Audience
| Practitioners | 1 |
| Students | 1 |
Location
| California | 1 |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
| Iowa Tests of Basic Skills | 1 |
| Iowa Tests of Educational… | 1 |
| National Assessment of… | 1 |
| Program for International… | 1 |
| SAT (College Admission Test) | 1 |
What Works Clearinghouse Rating
Stella Y. Kim; Sungyeun Kim – Educational Measurement: Issues and Practice, 2025
This study presents several multivariate Generalizability theory designs for analyzing automatic item-generated (AIG) based test forms. The study used real data to illustrate the analysis procedure and discuss practical considerations. We collected the data from two groups of students, each group receiving a different form generated by AIG. A…
Descriptors: Generalizability Theory, Automation, Test Items, Students
Silvia Testa; Renato Miceli; Renato Miceli – Educational Measurement: Issues and Practice, 2025
Random Equating (RE) and Heuristic Approach (HA) are two linking procedures that may be used to compare the scores of individuals in two tests that measure the same latent trait, in conditions where there are no common items or individuals. In this study, RE--that may only be used when the individuals taking the two tests come from the same…
Descriptors: Comparative Testing, Heuristics, Problem Solving, Personality Traits
Shaw, Mairead; Flake, Jessica K. – Educational Measurement: Issues and Practice, 2023
Clustered data structures are common in many areas of educational and psychological research (e.g., students clustered in schools, patients clustered by clinician). In the course of conducting research, questions are often administered to obtain scores reflecting latent constructs. Multilevel measurement models (MLMMs) allow for modeling…
Descriptors: Hierarchical Linear Modeling, Research Methodology, Data Analysis, Structural Equation Models
Jiahui Zhang; William H. Schmidt – Educational Measurement: Issues and Practice, 2024
Measuring opportunities to learn (OTL) is crucial for evaluating education quality and equity, but obtaining accurate and comprehensive OTL data at a large scale remains challenging. We attempt to address this issue by investigating measurement concerns in data collection and sampling. With the primary goal of estimating group-level OTLs for large…
Descriptors: Educational Opportunities, Measurement Techniques, Data Collection, Grade 4
Setzer, J. Carl; Cui, Zhongmin – Educational Measurement: Issues and Practice, 2022
Data visualization is a core tenet of communicating measurement research and outcomes. Measurement professionals utilize data visualization in various phases of research, including exploration and communication. However, data visualization has not received enough attention in the measurement field. While it is true that many measurement graphics…
Descriptors: Measures (Individuals), Outcome Measures, Visual Aids, Data Analysis
Sireci, Stephen G.; Suarez-Alvarez, Javier – Educational Measurement: Issues and Practice, 2022
The COVID-19 pandemic negatively affected the quality of data from educational testing programs. These data were previously used for many important purposes ranging from placing students in instructional programs to school accountability. In this article, we draw from the research design literature to point out the limitations inherent in…
Descriptors: Decision Making, Data Use, COVID-19, Pandemics
Mo Zhang; Paul Deane; Andrew Hoang; Hongwen Guo; Chen Li – Educational Measurement: Issues and Practice, 2025
In this paper, we describe two empirical studies that demonstrate the application and modeling of keystroke logs in writing assessments. We illustrate two different approaches of modeling differences in writing processes: analysis of mean differences in handcrafted theory-driven features and use of large language models to identify stable personal…
Descriptors: Writing Tests, Computer Assisted Testing, Keyboarding (Data Entry), Writing Processes
Woolverton, Genevieve Alice; Pollastri, Alisha R. – Educational Measurement: Issues and Practice, 2021
Within classrooms, psychologists and teachers use direct behavior observation methods, systematic behavior observations (SBOs) and direct behavior ratings (DBRs), to gather information about students' behaviors for the purposes of making decisions related to diagnosis and classroom management or behavioral feedback respectively. Observers use SBOs…
Descriptors: Student Behavior, Classroom Observation Techniques, Behavior Rating Scales, Behavior Patterns
Guher Gorgun; Okan Bulut – Educational Measurement: Issues and Practice, 2025
Automatic item generation may supply many items instantly and efficiently to assessment and learning environments. Yet, the evaluation of item quality persists to be a bottleneck for deploying generated items in learning and assessment settings. In this study, we investigated the utility of using large-language models, specifically Llama 3-8B, for…
Descriptors: Artificial Intelligence, Quality Control, Technology Uses in Education, Automation
Voss, Nathaniel M.; Vangsness, Lisa – Educational Measurement: Issues and Practice, 2020
While it is easy to assume that university students who wait until the last minute to complete surveys for their class research requirements provide low-quality data, this issue has not been empirically examined. The goal of the present study was to examine the relation between student research procrastination and two important data quality…
Descriptors: Time Management, College Students, Data Collection, Student Surveys
Ulitzsch, Esther; Lüdtke, Oliver; Robitzsch, Alexander – Educational Measurement: Issues and Practice, 2023
Country differences in response styles (RS) may jeopardize cross-country comparability of Likert-type scales. When adjusting for rather than investigating RS is the primary goal, it seems advantageous to impose minimal assumptions on RS structures and leverage information from multiple scales for RS measurement. Using PISA 2015 background…
Descriptors: Response Style (Tests), Comparative Analysis, Achievement Tests, Foreign Countries
Sinharay, Sandip – Educational Measurement: Issues and Practice, 2021
Technical difficulties occasionally lead to missing item scores and hence to incomplete data on computerized tests. It is not straightforward to report scores to the examinees whose data are incomplete due to technical difficulties. Such reporting essentially involves imputation of missing scores. In this paper, a simulation study based on data…
Descriptors: Data Analysis, Scores, Educational Assessment, Educational Testing
Cui, Zhongmin – Educational Measurement: Issues and Practice, 2020
Thanks to COVID-19, schools were closed and tests were canceled. The result is that we may not see test-taking data typically seen before. For some analyses, sample sizes may not meet the minimum requirement. For others, the sample of test-takers may be different from previous years. In some situation, there may be no data at all. What do we do in…
Descriptors: Testing, Sample Size, Data Collection, COVID-19
Baron, Patricia; Sireci, Stephen G.; Slater, Sharon C. – Educational Measurement: Issues and Practice, 2021
Since the No Child Left Behind Act (No Child Left Behind [NCLB], 2001) was enacted, the Bookmark method has been used in many state standard setting studies (Karantonis and Sireci; Zieky, Perie, and Livingston). The purpose of the current study is to evaluate the criticism that when panelists are presented with data during the Bookmark standard…
Descriptors: State Standards, Standard Setting, Evaluators, Training
Cui, Zhongmin – Educational Measurement: Issues and Practice, 2021
Commonly used machine learning applications seem to relate to big data. This article provides a gentle review of machine learning and shows why machine learning can be applied to small data too. An example of applying machine learning to screen irregularity reports is presented. In the example, the support vector machine and multinomial naïve…
Descriptors: Artificial Intelligence, Man Machine Systems, Data, Bayesian Statistics

Peer reviewed
Direct link
