Publication Date
| In 2026 | 0 |
| Since 2025 | 13 |
| Since 2022 (last 5 years) | 47 |
| Since 2017 (last 10 years) | 121 |
| Since 2007 (last 20 years) | 489 |
Descriptor
| Evaluation Methods | 711 |
| Measurement | 711 |
| Measurement Techniques | 152 |
| Foreign Countries | 128 |
| Student Evaluation | 123 |
| Educational Assessment | 121 |
| Models | 113 |
| Program Evaluation | 88 |
| Evaluation Criteria | 83 |
| Psychometrics | 81 |
| Comparative Analysis | 77 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Location
| Canada | 14 |
| Arizona | 12 |
| Australia | 11 |
| United Kingdom | 10 |
| California | 9 |
| Germany | 8 |
| China | 7 |
| United States | 7 |
| North Carolina | 6 |
| Florida | 5 |
| Italy | 5 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
He, Yinhong – Journal of Educational Measurement, 2023
Back random responding (BRR) behavior is one of the commonly observed careless response behaviors. Accurately detecting BRR behavior can improve test validities. Yu and Cheng (2019) showed that the change point analysis (CPA) procedure based on weighted residual (CPA-WR) performed well in detecting BRR. Compared with the CPA procedure, the…
Descriptors: Test Validity, Item Response Theory, Measurement, Monte Carlo Methods
Hongwen Guo; Matthew S. Johnson; Luis Saldivia; Michelle Worthington; Kadriye Ercikan – ETS Research Institute, 2025
ETS scientists developed a human-centered AI (HAI) framework that combines data on how students interact with assessments--such as task navigation and time spent--with their performance, providing deeper insights into student performance in large-scale assessments.
Descriptors: Artificial Intelligence, Student Evaluation, Evaluation Methods, Measurement
Olesya Kisselev; Mihail Kopotev; Anton Vakhranev – Modern Language Journal, 2025
Lexical proficiency in a second language (L2) has long been effectively assessed through the measurement of various lexical indices, or textual characteristics that act as observable indicators of such conceptual categories as lexical richness, diversity, sophistication, and fluency. While many studies have established links between these lexical…
Descriptors: Language Proficiency, Russian, Second Language Learning, Second Language Instruction
Marian Small – National Council of Teachers of Mathematics, 2025
This extensively updated teaching resource provides over 100 engaging, full-color visuals with explanations of how they can be used to stimulate mathematics learning, to explain mathematical concepts, and to assess students' mathematical understanding in grades K-8. Readers are provided with a strong mathematical background, downloadable copies of…
Descriptors: Mathematical Concepts, Teaching Methods, Mathematics Instruction, Kindergarten
Guy B. deBrun – Journal of Outdoor Recreation, Education, and Leadership, 2025
Discussions of what it means to be an effective outdoor leader are common in outdoor education literature (Martin et al., 2025; Smith, 2021). Research has identified core competencies (Martin et al., 2025), conceptual frameworks (Pomfret et al., 2023), and course curricula/qualifications for effective leadership (Baker & O'Brien, 2019; Seaman…
Descriptors: Outdoor Leadership, Leadership Effectiveness, Evaluation Methods, Scoring Rubrics
Francis L. Huang – Large-scale Assessments in Education, 2024
The use of large-scale assessments (LSAs) in education has grown in the past decade though analysis of LSAs using multilevel models (MLMs) using R has been limited. A reason for its limited use may be due to the complexity of incorporating both plausible values and weighted analyses in the multilevel analyses of LSA data. We provide additional…
Descriptors: Hierarchical Linear Modeling, Evaluation Methods, Educational Assessment, Data Analysis
Jamie M. Holloway; Heewon L. Gray; Acadia W. Buro; Jodi Thomas; Rachel Sauls; Allison M. Howard – Review Journal of Autism and Developmental Disorders, 2024
This review aimed to identify measurement tools to assess dietary intake and physical activity (PA) among individuals with autism spectrum disorder (ASD) and describe the evidence of validity and availability of each tool. Searches were conducted in PubMed, Embase, CINAHL, and PsycINFO using keywords for ASD, PA, diet, and assessment/measurement…
Descriptors: Dietetics, Eating Habits, Nutrition, Physical Activities
Richards, Adam S. – Communication Teacher, 2021
Course: Communication Research Methods. Objectives: This activity provides students with an experiential introduction to measurement theory and the methods for assessing measurement reliability. First, multiple measurements of a person's height are interpreted according to classical test theory. Second, the measurement of human height is used as…
Descriptors: Body Height, Measurement, Communication Research, Test Theory
Lotfi Simon Kerzabi – ProQuest LLC, 2021
Monte Carlo methods are an accepted methodology in regards to generation critical values for a Maximum test. The same methods are also applicable to the evaluation of the robustness of the new created test. A table of critical values was created, and the robustness of the new maximum test was evaluated for five different distributions. Robustness…
Descriptors: Data, Monte Carlo Methods, Testing, Evaluation Research
Yanhua Xu; Ziqing Ou; Zhiting Wu; Yating Lin; Wei Zeng; Jiayan Yang; Jialu Li; Mengfan Shan; Yunqin Li – Journal of Geography, 2024
Measurement is significant in geospatial thinking research. This study evaluated the geospatial thinking of pre-service geography teachers through an orienteering design tasks and subjective evaluation methods. Moreover, it analyzed their level of geospatial thinking, difficulty of evaluation indicators and rater severity with the Many-Faceted…
Descriptors: Foreign Countries, Undergraduate Students, Preservice Teachers, Geography Instruction
Aneesha Badrinarayan – Learning Policy Institute, 2025
Since the rise of state assessments whose primary function is to yield scores that can be used to compare schools and groups of students, most states have developed their state assessment programs under the assumption that either: (a) state tests are not intended to meaningfully shape instruction, or (b), if they are, the information provided in…
Descriptors: Measurement, Student Evaluation, Evaluation Methods, Relevance (Education)
Gutiérrez, Gabriel; Lupton, Ruth; Carrasco, Alejandro; Rasse, Alejandra – Journal of Education Policy, 2023
The process of privatising services historically provided by the state has blurred the boundaries between what is considered to be 'private' and 'public'. However, few efforts have been made in the educational arena to develop tools to measure this process. Most of the previous research has relied on narrow definitions about what is private and…
Descriptors: Public Education, Private Education, Foreign Countries, Measurement
Kylie Gorney; Mark D. Reckase – Journal of Educational Measurement, 2025
In computerized adaptive testing, item exposure control methods are often used to provide a more balanced usage of the item pool. Many of the most popular methods, including the restricted method (Revuelta and Ponsoda), use a single maximum exposure rate to limit the proportion of times that each item is administered. However, Barrada et al.…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Item Banks
David Goretzko; Karik Siemund; Philipp Sterner – Educational and Psychological Measurement, 2024
Confirmatory factor analyses (CFA) are often used in psychological research when developing measurement models for psychological constructs. Evaluating CFA model fit can be quite challenging, as tests for exact model fit may focus on negligible deviances, while fit indices cannot be interpreted absolutely without specifying thresholds or cutoffs.…
Descriptors: Factor Analysis, Goodness of Fit, Psychological Studies, Measurement
Causes of Nonlinear Metrics in Item Response Theory Models and Implications for Educational Research
Xiangyi Liao – ProQuest LLC, 2024
Educational research outcomes frequently rely on an assumption that measurement metrics have interval-level properties. While most investigators know enough to be suspicious of interval-level claims, and in some cases even question their findings given such doubts, there is a lack of understanding regarding the measurement conditions that create…
Descriptors: Item Response Theory, Educational Research, Measurement, Evaluation Methods

Peer reviewed
Direct link
