ERIC - Search Results

Publication Date

In 2025	3
Since 2024	9
Since 2021 (last 5 years)	20
Since 2016 (last 10 years)	47
Since 2006 (last 20 years)	119

Descriptor

Evaluation Methods	80
Measurement Techniques	55
Measurement	48
Models	40
Psychometrics	36
Evaluation Problems	35
Item Response Theory	29
Educational Assessment	26
Methods	25
Test Validity	25
Comparative Analysis	23
Classification	22
Test Construction	22
Test Items	22
Evaluation Research	19
Student Evaluation	18
Educational Testing	16
Teaching Methods	16
Testing Problems	16
Validity	16
Evaluation Criteria	15
Monte Carlo Methods	15
Teacher Evaluation	15
Bibliometrics	14
Mathematics Education	14
More ▼

Source

Measurement:…

130

Publication Type

Journal Articles	130
Opinion Papers	72
Reports - Research	36
Reports - Evaluative	32
Reports - Descriptive	6
Information Analyses	2
Book/Product Reviews	1

Education Level

Elementary Secondary Education	27
Higher Education	14
Postsecondary Education	10
Elementary Education	5
Adult Education	2
Grade 11	1
Grade 5	1
Grade 8	1
High Schools	1
Intermediate Grades	1
Middle Schools	1
More ▼

Audience

Practitioners	2
Researchers	1

Location

Germany	3
United Kingdom (England)	3
United States	3
California	2
United Kingdom	2
United Kingdom (Wales)	2
Asia	1
Australia	1
China	1
Hong Kong	1
South Korea	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	1
Race to the Top	1

Assessments and Surveys

Advanced Placement…	2
SAT (College Admission Test)	2
National Assessment of…	1
Program for International…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 130 results Save | Export

Uncertainty in Artificial Neural Network Models: Monte-Carlo Simulations beyond the GUM Boundaries

Peer reviewed

Direct link

A. M. Sadek; Fahad Al-Muhlaki – Measurement: Interdisciplinary Research and Perspectives, 2024

In this study, the accuracy of the artificial neural network (ANN) was assessed considering the uncertainties associated with the randomness of the data and the lack of learning. The Monte-Carlo algorithm was applied to simulate the randomness of the input variables and evaluate the output distribution. It has been shown that under certain…

Descriptors: Monte Carlo Methods, Accuracy, Artificial Intelligence, Guidelines

Improving Cross-Cultural Psychometric Scales: A Focus on Readability

Peer reviewed

Direct link

Tülin Otbiçer Acar – Measurement: Interdisciplinary Research and Perspectives, 2025

This study argues that readability formulas should be used as one measure of linguistic equivalence when adapting psychometric scales from one language to another. Assuming that the psychological structure being measured was not changed, it was observed that calculated readability levels and interpretations were different for two different…

Descriptors: Psychometrics, Evaluation Methods, Readability, Media Adaptation

There Are Many Greater Lower Bounds than Cronbach's [alpha]: A Monte Carlo Simulation Study

Peer reviewed

Direct link

Novak, Josip; Rebernjak, Blaž – Measurement: Interdisciplinary Research and Perspectives, 2023

A Monte Carlo simulation study was conducted to examine the performance of [alpha], [lambda]2, [lambda][subscript 4], [lambda][subscript 2], [omega][subscript T], GLB[subscript MRFA], and GLB[subscript Algebraic] coefficients. Population reliability, distribution shape, sample size, test length, and number of response categories were varied…

Descriptors: Monte Carlo Methods, Evaluation Methods, Reliability, Simulation

Estimation of Finite Population Variance under Stratified Sampling in the Presence of Measurement Errors

Peer reviewed

Direct link

Abdul Haq; Muhammad Usman; Manzoor Khan – Measurement: Interdisciplinary Research and Perspectives, 2024

Measurement errors may significantly distort the properties of an estimator. In this paper, estimators of the finite population variance using the information on first and second raw moments of the study variable are developed under stratified random sampling that incorporate the variance of a measurement error component. Additionally, combined…

Descriptors: Sampling, Error of Measurement, Evaluation Methods, Statistical Bias

Explainable Machine Learning for Credit Risk Management When Features Are Dependent

Peer reviewed

Direct link

Thanh Thuy Do; Golnoosh Babaei; Paolo Pagnottoni – Measurement: Interdisciplinary Research and Perspectives, 2024

Complex Machine Learning (ML) models used to support decision-making in peer-to-peer (P2P) lending often lack clear, accurate, and interpretable explanations. While the game-theoretic concept of Shapley values and its computationally efficient variant Kernel SHAP may be employed for this aim, similarly to other existing methods, the latter makes…

Descriptors: Artificial Intelligence, Risk Management, Credit (Finance), Prediction

A New Sampling Scheme for an Improved Monitoring of the Process Mean

Peer reviewed

Direct link

Abdul Haq – Measurement: Interdisciplinary Research and Perspectives, 2024

This article introduces an innovative sampling scheme, the median sampling (MS), utilizing individual observations over time to efficiently estimate the mean of a process characterized by a symmetric (non-uniform) probability distribution. The mean estimator based on MS is not only unbiased but also boasts enhanced precision compared to its simple…

Descriptors: Sampling, Innovation, Computation, Probability

Quantifying and Estimating Regression to the Mean Effect for Bivariate Beta-Binomial Distribution

Peer reviewed

Direct link

Aimel Zafar; Manzoor Khan; Muhammad Yousaf – Measurement: Interdisciplinary Research and Perspectives, 2024

Subjects with initially extreme observations upon remeasurement are found closer to the population mean. This tendency of observations toward the mean is called regression to the mean (RTM) and can make natural variation in repeated data look like real change. Studies, where subjects are selected on a baseline criterion, should be guarded against…

Descriptors: Measurement, Regression (Statistics), Statistical Distributions, Intervention

Identifying Response Styles Using Person Fit Analysis and Response-Styles Models

Peer reviewed

Direct link

Wind, Stefanie A.; Ge, Yuan – Measurement: Interdisciplinary Research and Perspectives, 2023

In selected-response assessments such as attitude surveys with Likert-type rating scales, examinees often select from rating scale categories to reflect their locations on a construct. Researchers have observed that some examinees exhibit "response styles," which are systematic patterns of responses in which examinees are more likely to…

Descriptors: Goodness of Fit, Responses, Likert Scales, Models

A Validation Study of the Extended Relevance Scale Using the D3mirt Package for R

Peer reviewed

Direct link

Erik Forsberg; Anders Sjöberg – Measurement: Interdisciplinary Research and Perspectives, 2025

This paper reports a validation study based on descriptive multidimensional item response theory (DMIRT), implemented in the R package "D3mirt" by using the ERS-C, an extended version of the Relevance subscale from the Moral Foundations Questionnaire including two new items for collectivism (17 items in total). Two latent models are…

Descriptors: Evaluation Methods, Programming Languages, Altruism, Collectivism

Evaluating Six Approaches to Handling Zero-Frequency Scores under Equipercentile Equating

Peer reviewed

Direct link

Sun, Ting; Kim, Stella Yun – Measurement: Interdisciplinary Research and Perspectives, 2021

In many large testing programs, equipercentile equating has been widely used under a random groups design to adjust test difficulty between forms. However, one thorny issue occurs with equipercentile equating when a particular score has no observed frequency. The purpose of this study is to suggest and evaluate six potential methods in…

Descriptors: Equated Scores, Test Length, Sample Size, Methods

Data Acquiring System for Gas Turbine Engine's Dynamic Performance; Build and Validate

Peer reviewed

Direct link

Mostafa M. Samy; Mohamed A. Metwally; Mahmoud Ashry; Wael M. Elmayyah – Measurement: Interdisciplinary Research and Perspectives, 2025

Gas Turbine Engines (GTE) have the highest power-to-weight ratio among Internal Combustion Engines (ICE). Its modularity and ability to utilize various types of fuel make it highly recommended in power plants, naval transportation, and, of course, the most equipped in aviation. The lack of GTEs' real data is increasing a recognized need for…

Descriptors: Engines, Power Technology, Data Collection, Data Interpretation

Computerized Multistage Testing: Principles, Designs and Practices with R

Peer reviewed

Direct link

Yigiter, Mahmut Sami; Dogan, Nuri – Measurement: Interdisciplinary Research and Perspectives, 2023

In recent years, Computerized Multistage Testing (MST), with their versatile benefits, have found themselves a wide application in large scale assessments and have increased their popularity. The fact that forms can be made ready before the exam application, such as a linear test, and that they can be adapted according to the test taker's ability…

Descriptors: Programming Languages, Monte Carlo Methods, Computer Assisted Testing, Test Format

Validation and Implementation of Customer Classification System Using Machine Learning

Peer reviewed

Direct link

Hyemin Yoon; HyunJin Kim; Sangjin Kim – Measurement: Interdisciplinary Research and Perspectives, 2024

We have maintained the customer grade system that is being implemented to customers with excellent performance through customer segmentation for years. Currently, financial institutions that operate the customer grade system provide similar services based on the score calculation criteria, but the score calculation criteria vary from the financial…

Descriptors: Classification, Artificial Intelligence, Prediction, Decision Making

An Approach to Test Equating under the Latent "D"-Scoring Method

Peer reviewed

Direct link

Dimitrov, Dimiter M.; Atanasov, Dimitar V. – Measurement: Interdisciplinary Research and Perspectives, 2021

This study offers an approach to test equating under the latent D-scoring method (DSM-L) using the nonequivalent groups with anchor tests (NEAT) design. The accuracy of the test equating was examined via a simulation study under a 3 × 3 design by two conditions: group ability at three levels and test difficulty at three levels. The results for…

Descriptors: Equated Scores, Scoring, Test Items, Accuracy

Misclassification Error, Binary Regression Bias, and Reliability in Multidimensional Poverty Measurement: An Estimation Approach Based on Bayesian Modelling

Peer reviewed

Direct link

Najera, Hector – Measurement: Interdisciplinary Research and Perspectives, 2023

Measurement error affects the quality of population orderings of an index and, hence, increases the misclassification of the poor and the non-poor groups and affects statistical inferences from binary regression models. Hence, the conclusions about the extent, profile, and distribution of poverty are likely to be misleading. However, the size and…

Descriptors: Poverty, Error of Measurement, Classification, Statistical Inference

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9

Hill, Heather C.	5
Engelhard, George, Jr.	4
Kane, Michael T.	3
Mroch, Andrew A.	3
Ripkey, Douglas R.	3
Suh, Youngsuk	3
Abdul Haq	2
Alonzo, Alicia C.	2
Ames, Allison J.	2
Black, Paul	2
Blunk, Merrie	2
Fisher, William P., Jr.	2
Gan, Zhengdong	2
Goffney, Imani Masters	2
Guyon, Hervé	2
Humphry, Stephen M.	2
Iceland, John	2
Kane, Michael	2
Kyngdon, Andrew	2
Leventhal, Brian C.	2
Luo, Yong	2
Manzoor Khan	2
Schilling, Stephen G.	2
Schumacker, Randall	2
Sullivan, Rubye K.	2
More ▼