Publication Date
In 2025 | 9 |
Since 2024 | 23 |
Since 2021 (last 5 years) | 55 |
Since 2016 (last 10 years) | 139 |
Since 2006 (last 20 years) | 495 |
Descriptor
Evaluation Methods | 707 |
Measurement | 707 |
Measurement Techniques | 152 |
Foreign Countries | 125 |
Student Evaluation | 123 |
Educational Assessment | 121 |
Models | 113 |
Program Evaluation | 88 |
Evaluation Criteria | 83 |
Psychometrics | 81 |
Comparative Analysis | 77 |
More ▼ |
Source
Author
Publication Type
Education Level
Location
Canada | 14 |
Arizona | 12 |
Australia | 11 |
United Kingdom | 10 |
California | 9 |
Germany | 8 |
China | 7 |
United States | 7 |
North Carolina | 6 |
Florida | 5 |
Italy | 5 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
David Goretzko; Karik Siemund; Philipp Sterner – Educational and Psychological Measurement, 2024
Confirmatory factor analyses (CFA) are often used in psychological research when developing measurement models for psychological constructs. Evaluating CFA model fit can be quite challenging, as tests for exact model fit may focus on negligible deviances, while fit indices cannot be interpreted absolutely without specifying thresholds or cutoffs.…
Descriptors: Factor Analysis, Goodness of Fit, Psychological Studies, Measurement
Madeline A. Schellman; Matthew J. Madison – Grantee Submission, 2024
Diagnostic classification models (DCMs) have grown in popularity as stakeholders increasingly desire actionable information related to students' skill competencies. Longitudinal DCMs offer a psychometric framework for providing estimates of students' proficiency status transitions over time. For both cross-sectional and longitudinal DCMs, it is…
Descriptors: Diagnostic Tests, Classification, Models, Psychometrics
Hongwen Guo; Matthew S. Johnson; Luis Saldivia; Michelle Worthington; Kadriye Ercikan – ETS Research Institute, 2025
ETS scientists developed a human-centered AI (HAI) framework that combines data on how students interact with assessments--such as task navigation and time spent--with their performance, providing deeper insights into student performance in large-scale assessments.
Descriptors: Artificial Intelligence, Student Evaluation, Evaluation Methods, Measurement
Francis L. Huang – Large-scale Assessments in Education, 2024
The use of large-scale assessments (LSAs) in education has grown in the past decade though analysis of LSAs using multilevel models (MLMs) using R has been limited. A reason for its limited use may be due to the complexity of incorporating both plausible values and weighted analyses in the multilevel analyses of LSA data. We provide additional…
Descriptors: Hierarchical Linear Modeling, Evaluation Methods, Educational Assessment, Data Analysis
Jamie M. Holloway; Heewon L. Gray; Acadia W. Buro; Jodi Thomas; Rachel Sauls; Allison M. Howard – Review Journal of Autism and Developmental Disorders, 2024
This review aimed to identify measurement tools to assess dietary intake and physical activity (PA) among individuals with autism spectrum disorder (ASD) and describe the evidence of validity and availability of each tool. Searches were conducted in PubMed, Embase, CINAHL, and PsycINFO using keywords for ASD, PA, diet, and assessment/measurement…
Descriptors: Dietetics, Eating Habits, Nutrition, Physical Activities
Lotfi Simon Kerzabi – ProQuest LLC, 2021
Monte Carlo methods are an accepted methodology in regards to generation critical values for a Maximum test. The same methods are also applicable to the evaluation of the robustness of the new created test. A table of critical values was created, and the robustness of the new maximum test was evaluated for five different distributions. Robustness…
Descriptors: Data, Monte Carlo Methods, Testing, Evaluation Research
Yanhua Xu; Ziqing Ou; Zhiting Wu; Yating Lin; Wei Zeng; Jiayan Yang; Jialu Li; Mengfan Shan; Yunqin Li – Journal of Geography, 2024
Measurement is significant in geospatial thinking research. This study evaluated the geospatial thinking of pre-service geography teachers through an orienteering design tasks and subjective evaluation methods. Moreover, it analyzed their level of geospatial thinking, difficulty of evaluation indicators and rater severity with the Many-Faceted…
Descriptors: Foreign Countries, Undergraduate Students, Preservice Teachers, Geography Instruction
Aneesha Badrinarayan – Learning Policy Institute, 2025
Since the rise of state assessments whose primary function is to yield scores that can be used to compare schools and groups of students, most states have developed their state assessment programs under the assumption that either: (a) state tests are not intended to meaningfully shape instruction, or (b), if they are, the information provided in…
Descriptors: Measurement, Student Evaluation, Evaluation Methods, Relevance (Education)
Gutiérrez, Gabriel; Lupton, Ruth; Carrasco, Alejandro; Rasse, Alejandra – Journal of Education Policy, 2023
The process of privatising services historically provided by the state has blurred the boundaries between what is considered to be 'private' and 'public'. However, few efforts have been made in the educational arena to develop tools to measure this process. Most of the previous research has relied on narrow definitions about what is private and…
Descriptors: Public Education, Private Education, Foreign Countries, Measurement
Kylie Gorney; Mark D. Reckase – Journal of Educational Measurement, 2025
In computerized adaptive testing, item exposure control methods are often used to provide a more balanced usage of the item pool. Many of the most popular methods, including the restricted method (Revuelta and Ponsoda), use a single maximum exposure rate to limit the proportion of times that each item is administered. However, Barrada et al.…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Item Banks
Causes of Nonlinear Metrics in Item Response Theory Models and Implications for Educational Research
Xiangyi Liao – ProQuest LLC, 2024
Educational research outcomes frequently rely on an assumption that measurement metrics have interval-level properties. While most investigators know enough to be suspicious of interval-level claims, and in some cases even question their findings given such doubts, there is a lack of understanding regarding the measurement conditions that create…
Descriptors: Item Response Theory, Educational Research, Measurement, Evaluation Methods
Fouché, Ilse – Applied Linguistics, 2023
This article, located in the discipline of academic literacy studies, draws upon the fields of critical realism, design research, and evaluation studies. It reports on the validation of a flexible evaluation design for assessing the impact of academic literacy interventions. The design was validated in two ways. Firstly, through a process of…
Descriptors: Foreign Countries, Intervention, Literacy Education, Feedback (Response)
He, Yinhong – Journal of Educational Measurement, 2023
Back random responding (BRR) behavior is one of the commonly observed careless response behaviors. Accurately detecting BRR behavior can improve test validities. Yu and Cheng (2019) showed that the change point analysis (CPA) procedure based on weighted residual (CPA-WR) performed well in detecting BRR. Compared with the CPA procedure, the…
Descriptors: Test Validity, Item Response Theory, Measurement, Monte Carlo Methods
Whitney Sivils-Sawyer – ProQuest LLC, 2022
The purpose of this research was to create a Culture of Assessment Instrument (Appendix F) that will allow education program provider (EPP) assessment leaders to measure the assessment culture within their program. General Systems Theory was the theoretical framework as the foundation of this research. Using a modified Delphi panel of assessment…
Descriptors: Teacher Education, Educational Assessment, Leadership, Culture
Melina Verger; Chunyang Fan; Sébastien Lallé; François Bouchet; Vanda Luengo – Journal of Educational Data Mining, 2024
Predictive student models are increasingly used in learning environments due to their ability to enhance educational outcomes and support stakeholders in making informed decisions. However, predictive models can be biased and produce unfair outcomes, leading to potential discrimination against certain individuals and harmful long-term…
Descriptors: Algorithms, Prediction, Bias, Classification