Publication Date
In 2025 | 3 |
Since 2024 | 9 |
Since 2021 (last 5 years) | 20 |
Since 2016 (last 10 years) | 47 |
Since 2006 (last 20 years) | 119 |
Descriptor
Evaluation Methods | 80 |
Measurement Techniques | 55 |
Measurement | 48 |
Models | 40 |
Psychometrics | 36 |
Evaluation Problems | 35 |
Item Response Theory | 29 |
Educational Assessment | 26 |
Methods | 25 |
Test Validity | 25 |
Comparative Analysis | 23 |
More ▼ |
Source
Measurement:… | 130 |
Author
Hill, Heather C. | 5 |
Engelhard, George, Jr. | 4 |
Kane, Michael T. | 3 |
Mroch, Andrew A. | 3 |
Ripkey, Douglas R. | 3 |
Suh, Youngsuk | 3 |
Abdul Haq | 2 |
Alonzo, Alicia C. | 2 |
Ames, Allison J. | 2 |
Black, Paul | 2 |
Blunk, Merrie | 2 |
More ▼ |
Publication Type
Journal Articles | 130 |
Opinion Papers | 72 |
Reports - Research | 36 |
Reports - Evaluative | 32 |
Reports - Descriptive | 6 |
Information Analyses | 2 |
Book/Product Reviews | 1 |
Education Level
Elementary Secondary Education | 27 |
Higher Education | 14 |
Postsecondary Education | 10 |
Elementary Education | 5 |
Adult Education | 2 |
Grade 11 | 1 |
Grade 5 | 1 |
Grade 8 | 1 |
High Schools | 1 |
Intermediate Grades | 1 |
Middle Schools | 1 |
More ▼ |
Audience
Practitioners | 2 |
Researchers | 1 |
Location
Germany | 3 |
United Kingdom (England) | 3 |
United States | 3 |
California | 2 |
United Kingdom | 2 |
United Kingdom (Wales) | 2 |
Asia | 1 |
Australia | 1 |
China | 1 |
Hong Kong | 1 |
South Korea | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Race to the Top | 1 |
Assessments and Surveys
Advanced Placement… | 2 |
SAT (College Admission Test) | 2 |
National Assessment of… | 1 |
Program for International… | 1 |
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
A. M. Sadek; Fahad Al-Muhlaki – Measurement: Interdisciplinary Research and Perspectives, 2024
In this study, the accuracy of the artificial neural network (ANN) was assessed considering the uncertainties associated with the randomness of the data and the lack of learning. The Monte-Carlo algorithm was applied to simulate the randomness of the input variables and evaluate the output distribution. It has been shown that under certain…
Descriptors: Monte Carlo Methods, Accuracy, Artificial Intelligence, Guidelines
Tülin Otbiçer Acar – Measurement: Interdisciplinary Research and Perspectives, 2025
This study argues that readability formulas should be used as one measure of linguistic equivalence when adapting psychometric scales from one language to another. Assuming that the psychological structure being measured was not changed, it was observed that calculated readability levels and interpretations were different for two different…
Descriptors: Psychometrics, Evaluation Methods, Readability, Media Adaptation
Novak, Josip; Rebernjak, Blaž – Measurement: Interdisciplinary Research and Perspectives, 2023
A Monte Carlo simulation study was conducted to examine the performance of [alpha], [lambda]2, [lambda][subscript 4], [lambda][subscript 2], [omega][subscript T], GLB[subscript MRFA], and GLB[subscript Algebraic] coefficients. Population reliability, distribution shape, sample size, test length, and number of response categories were varied…
Descriptors: Monte Carlo Methods, Evaluation Methods, Reliability, Simulation
Abdul Haq; Muhammad Usman; Manzoor Khan – Measurement: Interdisciplinary Research and Perspectives, 2024
Measurement errors may significantly distort the properties of an estimator. In this paper, estimators of the finite population variance using the information on first and second raw moments of the study variable are developed under stratified random sampling that incorporate the variance of a measurement error component. Additionally, combined…
Descriptors: Sampling, Error of Measurement, Evaluation Methods, Statistical Bias
Thanh Thuy Do; Golnoosh Babaei; Paolo Pagnottoni – Measurement: Interdisciplinary Research and Perspectives, 2024
Complex Machine Learning (ML) models used to support decision-making in peer-to-peer (P2P) lending often lack clear, accurate, and interpretable explanations. While the game-theoretic concept of Shapley values and its computationally efficient variant Kernel SHAP may be employed for this aim, similarly to other existing methods, the latter makes…
Descriptors: Artificial Intelligence, Risk Management, Credit (Finance), Prediction
Abdul Haq – Measurement: Interdisciplinary Research and Perspectives, 2024
This article introduces an innovative sampling scheme, the median sampling (MS), utilizing individual observations over time to efficiently estimate the mean of a process characterized by a symmetric (non-uniform) probability distribution. The mean estimator based on MS is not only unbiased but also boasts enhanced precision compared to its simple…
Descriptors: Sampling, Innovation, Computation, Probability
Aimel Zafar; Manzoor Khan; Muhammad Yousaf – Measurement: Interdisciplinary Research and Perspectives, 2024
Subjects with initially extreme observations upon remeasurement are found closer to the population mean. This tendency of observations toward the mean is called regression to the mean (RTM) and can make natural variation in repeated data look like real change. Studies, where subjects are selected on a baseline criterion, should be guarded against…
Descriptors: Measurement, Regression (Statistics), Statistical Distributions, Intervention
Wind, Stefanie A.; Ge, Yuan – Measurement: Interdisciplinary Research and Perspectives, 2023
In selected-response assessments such as attitude surveys with Likert-type rating scales, examinees often select from rating scale categories to reflect their locations on a construct. Researchers have observed that some examinees exhibit "response styles," which are systematic patterns of responses in which examinees are more likely to…
Descriptors: Goodness of Fit, Responses, Likert Scales, Models
Erik Forsberg; Anders Sjöberg – Measurement: Interdisciplinary Research and Perspectives, 2025
This paper reports a validation study based on descriptive multidimensional item response theory (DMIRT), implemented in the R package "D3mirt" by using the ERS-C, an extended version of the Relevance subscale from the Moral Foundations Questionnaire including two new items for collectivism (17 items in total). Two latent models are…
Descriptors: Evaluation Methods, Programming Languages, Altruism, Collectivism
Sun, Ting; Kim, Stella Yun – Measurement: Interdisciplinary Research and Perspectives, 2021
In many large testing programs, equipercentile equating has been widely used under a random groups design to adjust test difficulty between forms. However, one thorny issue occurs with equipercentile equating when a particular score has no observed frequency. The purpose of this study is to suggest and evaluate six potential methods in…
Descriptors: Equated Scores, Test Length, Sample Size, Methods
Mostafa M. Samy; Mohamed A. Metwally; Mahmoud Ashry; Wael M. Elmayyah – Measurement: Interdisciplinary Research and Perspectives, 2025
Gas Turbine Engines (GTE) have the highest power-to-weight ratio among Internal Combustion Engines (ICE). Its modularity and ability to utilize various types of fuel make it highly recommended in power plants, naval transportation, and, of course, the most equipped in aviation. The lack of GTEs' real data is increasing a recognized need for…
Descriptors: Engines, Power Technology, Data Collection, Data Interpretation
Yigiter, Mahmut Sami; Dogan, Nuri – Measurement: Interdisciplinary Research and Perspectives, 2023
In recent years, Computerized Multistage Testing (MST), with their versatile benefits, have found themselves a wide application in large scale assessments and have increased their popularity. The fact that forms can be made ready before the exam application, such as a linear test, and that they can be adapted according to the test taker's ability…
Descriptors: Programming Languages, Monte Carlo Methods, Computer Assisted Testing, Test Format
Hyemin Yoon; HyunJin Kim; Sangjin Kim – Measurement: Interdisciplinary Research and Perspectives, 2024
We have maintained the customer grade system that is being implemented to customers with excellent performance through customer segmentation for years. Currently, financial institutions that operate the customer grade system provide similar services based on the score calculation criteria, but the score calculation criteria vary from the financial…
Descriptors: Classification, Artificial Intelligence, Prediction, Decision Making
Dimitrov, Dimiter M.; Atanasov, Dimitar V. – Measurement: Interdisciplinary Research and Perspectives, 2021
This study offers an approach to test equating under the latent D-scoring method (DSM-L) using the nonequivalent groups with anchor tests (NEAT) design. The accuracy of the test equating was examined via a simulation study under a 3 × 3 design by two conditions: group ability at three levels and test difficulty at three levels. The results for…
Descriptors: Equated Scores, Scoring, Test Items, Accuracy
Najera, Hector – Measurement: Interdisciplinary Research and Perspectives, 2023
Measurement error affects the quality of population orderings of an index and, hence, increases the misclassification of the poor and the non-poor groups and affects statistical inferences from binary regression models. Hence, the conclusions about the extent, profile, and distribution of poverty are likely to be misleading. However, the size and…
Descriptors: Poverty, Error of Measurement, Classification, Statistical Inference