Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 4 |
| Since 2017 (last 10 years) | 38 |
| Since 2007 (last 20 years) | 154 |
Descriptor
| Evaluation Methods | 182 |
| Statistical Analysis | 182 |
| Feedback (Response) | 80 |
| Item Response Theory | 61 |
| Foreign Countries | 51 |
| Student Evaluation | 34 |
| Models | 33 |
| Responses | 27 |
| Comparative Analysis | 26 |
| Qualitative Research | 24 |
| Questionnaires | 22 |
| More ▼ | |
Source
Author
| Cai, Li | 2 |
| DeMars, Christine E. | 2 |
| Dorans, Neil J. | 2 |
| Finch, Holmes | 2 |
| Fujimoto, Ken A. | 2 |
| Gordon, Rachel A. | 2 |
| Haberman, Shelby J. | 2 |
| Hofer, Kerry G. | 2 |
| Huxham, Mark | 2 |
| Kaestner, Robert | 2 |
| Kelecioglu, Hülya | 2 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 4 |
| Students | 2 |
| Administrators | 1 |
| Teachers | 1 |
Location
| Florida | 4 |
| New Zealand | 4 |
| United Kingdom | 4 |
| Germany | 3 |
| Hong Kong | 3 |
| Iowa | 3 |
| Maryland | 3 |
| Pennsylvania | 3 |
| Australia | 2 |
| Israel | 2 |
| Italy | 2 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 1 |
| Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Smith, Ben O.; White, Dustin R.; Wagner, Jamie; Kuzyk, Patricia; Prera, Alex – Studies in Higher Education, 2023
Student Evaluations of Teaching (SETs) are an integral part of evaluating course outcomes. They are routinely used to evaluate teaching quality for the purposes of reappointment, promotion, and tenure (RPT), annual review, and the rehiring of adjunct faculty and lecturers. These evaluations are often based almost entirely on the mean or proportion…
Descriptors: Student Evaluation of Teacher Performance, Statistical Analysis, Response Rates (Questionnaires), Evaluation Methods
Olanipekun, Oluwaseun L.; Zhao, JuLong; Wang, Rongdong; A. Sedory, Stephen; Singh, Sarjinder – Sociological Methods & Research, 2023
In carrying out surveys involving sensitive characteristics, randomized response models have been considered among the best techniques since they provide the maximum privacy protection to the respondents and procure honest responses. Over the years, researchers have carried out studies on the estimation of proportions of the population possessing…
Descriptors: Correlation, Smoking, Thinking Skills, Health Behavior
Haberman, Shelby J. – Journal of Educational Measurement, 2020
Examples of the impact of statistical theory on assessment practice are provided from the perspective of a statistician trained in theoretical statistics who began to work on assessments. Goodness of fit of item-response models is examined in terms of restricted likelihood-ratio tests and generalized residuals. Minimum discriminant information…
Descriptors: Statistics, Goodness of Fit, Item Response Theory, Statistical Analysis
Raykov, Tenko; Pusic, Martin – Educational and Psychological Measurement, 2023
This note is concerned with evaluation of location parameters for polytomous items in multiple-component measuring instruments. A point and interval estimation procedure for these parameters is outlined that is developed within the framework of latent variable modeling. The method permits educational, behavioral, biomedical, and marketing…
Descriptors: Item Analysis, Measurement Techniques, Computer Software, Intervals
Fu, Qiang; Guo, Xin; Land, Kenneth C. – Sociological Methods & Research, 2020
Count responses with grouping and right censoring have long been used in surveys to study a variety of behaviors, status, and attitudes. Yet grouping or right-censoring decisions of count responses still rely on arbitrary choices made by researchers. We develop a new method for evaluating grouping and right-censoring decisions of count responses…
Descriptors: Surveys, Artificial Intelligence, Evaluation Methods, Probability
Mousavi, Amin; Schmidt, Matthew; Squires, Vicki; Wilson, Ken – International Journal of Artificial Intelligence in Education, 2021
Greer and Mark's (2016) paper suggested and reviewed different methods for evaluating the effectiveness of intelligent tutoring systems such as Propensity score matching. The current study aimed at assessing the effectiveness of automated personalized feedback intervention implemented via the Student Advice Recommender Agent (SARA) in a first-year…
Descriptors: Automation, Feedback (Response), Intervention, College Freshmen
Castellano, Katherine E.; McCaffrey, Daniel F. – Journal of Educational Measurement, 2020
The residual gain score has been of historical interest, and its percentile rank has been of interest more recently given its close correspondence to the popular Student Growth Percentile. However, these estimators suffer from low accuracy and systematic bias (bias conditional on prior latent achievement). This article explores three…
Descriptors: Accuracy, Student Evaluation, Measurement Techniques, Evaluation Methods
Vriens, Ingrid; Moors, Guy; Gelissen, John; Vermunt, Jeroen K. – Sociological Methods & Research, 2017
Measuring values in sociological research sometimes involves the use of ranking data. A disadvantage of a ranking assignment is that the order in which the items are presented might influence the choice preferences of respondents regardless of the content being measured. The standard procedure to rule out such effects is to randomize the order of…
Descriptors: Evaluation Methods, Social Science Research, Sociology, Structural Equation Models
To, Jessica; Panadero, Ernesto; Carless, David – Assessment & Evaluation in Higher Education, 2022
The analysis of exemplars of different quality is a potentially powerful tool in enabling students to understand assessment expectations and appreciate academic standards. Through a systematic review methodology, this paper synthesises exemplar-based research designs, exemplar implementation and the educational effects of exemplars. The review of…
Descriptors: Research Design, Scoring Rubrics, Peer Evaluation, Self Evaluation (Individuals)
Hyunsuk Han – ProQuest LLC, 2018
In Huggins-Manley & Han (2017), it was shown that WLSMV global model fit indices used in structural equating modeling practice are sensitive to person parameter estimate RMSE and item difficulty parameter estimate RMSE that results from local dependence in 2-PL IRT models, particularly when conditioning on number of test items and sample size.…
Descriptors: Models, Statistical Analysis, Item Response Theory, Evaluation Methods
Craig, Brandon – ProQuest LLC, 2017
The purpose of this study was to determine if using a multistage approach for the empirical selection of anchor items would lead to more accurate DIF detection rates than the anchor selection methods proposed by Kopf, Zeileis, & Strobl (2015b). A simulation study was conducted in which the sample size, percentage of DIF, and balance of DIF…
Descriptors: Simulation, Sample Size, Item Response Theory, Item Analysis
Tarray, Tanveer A.; Singh, Housila P.; Yan, Zaizai – Sociological Methods & Research, 2017
This article addresses the problem of estimating the proportion Pi[subscript S] of the population belonging to a sensitive group using optional randomized response technique in stratified sampling based on Mangat model that has proportional and Neyman allocation and larger gain in efficiency. Numerically, it is found that the suggested model is…
Descriptors: Models, Efficiency, Sampling, Research Problems
Berg, Craig; Boote, Stacy – International Journal of Science and Mathematics Education, 2017
Prior graphing research has demonstrated that clinical interviews and free-response instruments produce very different results than multiple-choice instruments, indicating potential validity problems when using multiple-choice instruments to assess graphing skills (Berg & Smith in "Science Education," 78(6), 527-554, 1994). Extending…
Descriptors: Graphs, Multiple Choice Tests, Statistical Analysis, Secondary School Students
Ostrow, Korinn; Donnelly, Chistopher; Heffernan, Neil – International Educational Data Mining Society, 2015
As adaptive tutoring systems grow increasingly popular for the completion of classwork and homework, it is crucial to assess the manner in which students are scored within these platforms. The majority of systems, including ASSISTments, return the binary correctness of a student's first attempt at solving each problem. Yet for many teachers,…
Descriptors: Intelligent Tutoring Systems, Scoring, Testing, Credits
Dimitrov, Dimiter M. – Measurement and Evaluation in Counseling and Development, 2017
This article offers an approach to examining differential item functioning (DIF) under its item response theory (IRT) treatment in the framework of confirmatory factor analysis (CFA). The approach is based on integrating IRT- and CFA-based testing of DIF and using bias-corrected bootstrap confidence intervals with a syntax code in Mplus.
Descriptors: Test Bias, Item Response Theory, Factor Analysis, Evaluation Methods

Peer reviewed
Direct link
