NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Does not meet standards1
Showing 1,486 to 1,500 of 3,316 results Save | Export
Andrews, Benjamin James – ProQuest LLC, 2011
The equity properties can be used to assess the quality of an equating. The degree to which expected scores conditional on ability are similar between test forms is referred to as first-order equity. Second-order equity is the degree to which conditional standard errors of measurement are similar between test forms after equating. The purpose of…
Descriptors: Test Format, Advanced Placement, Simulation, True Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Gardner, John – Oxford Review of Education, 2013
Evidence from recent research suggests that in the UK the public perception of errors in national examinations is that they are simply mistakes; events that are preventable. This perception predominates over the more sophisticated technical view that errors arise from many sources and create an inevitable variability in assessment outcomes. The…
Descriptors: Educational Assessment, Public Opinion, Error of Measurement, Foreign Countries
Tan, Xuan; Ricker, Kathryn L.; Puhan, Gautam – Educational Testing Service, 2010
This study examines the differences in equating outcomes between two trend score equating designs resulting from two different scoring strategies for trend scoring when operational constructed-response (CR) items are double-scored--the single group (SG) design, where each trend CR item is double-scored, and the nonequivalent groups with anchor…
Descriptors: Equated Scores, Scoring, Responses, Test Items
Lengh, Carolyn J. – ProQuest LLC, 2010
This study compares the dependability of four classroom assessment scoring methods. Generalizability theory (G) and alternative decision (D) are used to measure the results of students' classroom assessment scores and compare the results of the four scoring methods on variability of rater by person variance and the level of G and D coefficients…
Descriptors: Generalizability Theory, Scoring, Social Studies, Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Yang, Manshu; Chow, Sy-Miin – Psychometrika, 2010
Facial electromyography (EMG) is a useful physiological measure for detecting subtle affective changes in real time. A time series of EMG data contains bursts of electrical activity that increase in magnitude when the pertinent facial muscles are activated. Whereas previous methods for detecting EMG activation are often based on deterministic or…
Descriptors: Test Bias, Error of Measurement, Human Body, Diagnostic Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Monahan, Patrick O.; Ankenmann, Robert D. – Applied Psychological Measurement, 2010
When the matching score is either less than perfectly reliable or not a sufficient statistic for determining latent proficiency in data conforming to item response theory (IRT) models, Type I error (TIE) inflation may occur for the Mantel-Haenszel (MH) procedure or any differential item functioning (DIF) procedure that matches on summed-item…
Descriptors: Error of Measurement, Item Response Theory, Test Bias, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Turner, A. Allan; Lozano-Nieto, Albert; Bouffard, Marcel – Measurement in Physical Education and Exercise Science, 2010
The purpose of this study was to examine the effect of three ventilation conditions (i.e., normal, regimented, and no-ventilation) on the reproducibility of bioimpedance scores in humans for the forearm and trunk segments. One hundred able-bodied North American men and women, from 18 to 71 years of age, volunteered as participants. The…
Descriptors: Ventilation, Generalizability Theory, Spectroscopy, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Burns, Matthew K.; Scholin, Sarah E.; Kosciolek, Stacey; Livingston, Judy – Journal of Psychoeducational Assessment, 2010
The current study examines the consistency of two response-to-intervention (RTI) decision-making models. Weekly progress monitoring data for 30 students participating in a Tier II intervention were collected for 30 weeks. The data were examined by comparing them to an aimline with a yearly goal and by computing a dual discrepancy (DD) using…
Descriptors: Reading Achievement, Reading Tests, Data Collection, Responses
Peer reviewed Peer reviewed
Direct linkDirect link
Wen, Zhonglin; Marsh, Herbert W.; Hau, Kit-Tai – Structural Equation Modeling: A Multidisciplinary Journal, 2010
Standardized parameter estimates are routinely used to summarize the results of multiple regression models of manifest variables and structural equation models of latent variables, because they facilitate interpretation. Although the typical standardization of interaction terms is not appropriate for multiple regression models, straightforward…
Descriptors: Structural Equation Models, Multiple Regression Analysis, Interaction, Computation
Peer reviewed Peer reviewed
Direct linkDirect link
Wang, Lihui; Lawson, Michael J.; Curtis, David D. – Language Teaching Research, 2015
Imagery training has been shown to improve reading comprehension. Recent research has also shown that the quality of visual mental imagery used is important for reading comprehension. A review of literature shows that there has been relatively little detailed research on the quality of imagery used by learners, especially in the case of students…
Descriptors: Educational Quality, Teaching Methods, English (Second Language), Second Language Learning
Peer reviewed Peer reviewed
Direct linkDirect link
Hung, Su-Pin; Chen, Po-Hsi; Chen, Hsueh-Chih – Creativity Research Journal, 2012
Product assessment is widely applied in creative studies, typically as an important dependent measure. Within this context, this study had 2 purposes. First, the focus of this research was on methods for investigating possible rater effects, an issue that has not received a great deal of attention in past creativity studies. Second, the…
Descriptors: Item Response Theory, Creativity, Interrater Reliability, Undergraduate Students
Liu, Qin – Association for Institutional Research, 2012
This discussion constructs a survey data quality strategy for institutional researchers in higher education in light of total survey error theory. It starts with describing the characteristics of institutional research and identifying the gaps in literature regarding survey data quality issues in institutional research and then introduces the…
Descriptors: Institutional Research, Higher Education, Quality Control, Researchers
Peer reviewed Peer reviewed
Direct linkDirect link
Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2009
We derive an estimator of the standardized value which, under the standard assumptions of normality and homoscedasticity, is more efficient than the established (asymptotically efficient) estimator and discuss its gains for small samples. (Contains 1 table and 3 figures.)
Descriptors: Efficiency, Computation, Statistics, Sample Size
Peer reviewed Peer reviewed
Direct linkDirect link
Sijtsma, Klaas – Psychometrika, 2009
This discussion paper argues that both the use of Cronbach's alpha as a reliability estimate and as a measure of internal consistency suffer from major problems. First, alpha always has a value, which cannot be equal to the test score's reliability given the inter-item covariance matrix and the usual assumptions about measurement error. Second, in…
Descriptors: Measurement, Error of Measurement, Scores, Computation
Peer reviewed Peer reviewed
Direct linkDirect link
Kolen, Michael J.; Lee, Won-Chan – Educational Measurement: Issues and Practice, 2011
This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…
Descriptors: Test Use, Test Format, Error of Measurement, Raw Scores
Pages: 1  |  ...  |  96  |  97  |  98  |  99  |  100  |  101  |  102  |  103  |  104  |  ...  |  222