NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Does not meet standards1
Showing 1,801 to 1,815 of 3,311 results Save | Export
Harris, Douglas N. – Policy Analysis for California Education, PACE (NJ3), 2010
In this policy brief, the author explores the problems with attainment measures when it comes to evaluating performance at the school level, and explores the best uses of value-added measures. These value-added measures, the author writes, are useful for sorting out-of-school influences from school influences or from teacher performance, giving…
Descriptors: Principals, Observation, Teacher Evaluation, Measurement Techniques
Lipscomb, Stephen; Teh, Bing-ru; Gill, Brian; Chiang, Hanley; Owens, Antoniya – Mathematica Policy Research, Inc., 2010
This report summarizes research findings and implementation practices for teacher and principal value-added models (VAMs), as a first step in the Team Pennsylvania Foundation's (Team PA) pilot project to inform the development of a full, statewide model evaluation system. We have selected 21 studies that represent key issues and findings in the…
Descriptors: Pilot Projects, Outcomes of Education, Principals, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Rosch, David M.; Schwartz, Leslie M. – Journal of Leadership Education, 2009
As more institutions of higher education engage in the practice of leadership education, the effective assessment of these efforts lags behind due to a variety of factors. Without an intentional assessment plan, leadership educators are liable to make one or more of several common errors in assessing their programs and activities. This article…
Descriptors: Leadership Training, Administrator Education, College Outcomes Assessment, Program Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Kherif, Ferath; Josse, Goulven; Seghier, Mohamed L.; Price, Cathy J. – Journal of Cognitive Neuroscience, 2009
The aim of this study was to find the most prominent source of intersubject variability in neuronal activation for reading familiar words aloud. To this end, we collected functional imaging data from a large sample of subjects (n = 76) with different demographic characteristics such as handedness, sex, and age, while reading. The…
Descriptors: Handedness, Semantics, Reading Strategies, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Huitema, Bradley E.; McKean, Joseph W. – Educational and Psychological Measurement, 2007
Regression models used in the analysis of interrupted time-series designs assume statistically independent errors. Four methods of evaluating this assumption are the Durbin-Watson (D-W), Huitema-McKean (H-M), Box-Pierce (B-P), and Ljung-Box (L-B) tests. These tests were compared with respect to Type I error and power under a wide variety of error…
Descriptors: Regression (Statistics), Evaluation Methods, Error of Measurement, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Jamshidian, M.; Khatoonabadi, M. – International Journal of Mathematical Education in Science and Technology, 2007
Almost all introductory and intermediate level statistics textbooks include the topic of confidence interval for the population mean. Almost all these texts introduce the median as a robust measure of central tendency. Only a few of these books, however, cover inference on the population median and in particular confidence interval for the median.…
Descriptors: Intervals, Simulation, Computation, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Bradley, Kelly D.; Royal, Kenneth D.; Bradley, James W. – Journal of College Teaching & Learning, 2008
The reliability and validity of course evaluations in higher education is often assumed. The typical Likert-type surveys utilized when students' evaluate the course and instructor often overlook measurement issues, or deal with them in an ineffective manner. Given the importance that is placed on higher education course evaluations, with results…
Descriptors: Higher Education, Course Evaluation, Reliability, Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Zhang, Bo; Stone, Clement A. – Educational and Psychological Measurement, 2008
This research examines the utility of the s-x[superscript 2] statistic proposed by Orlando and Thissen (2000) in evaluating item fit for multidimensional item response models. Monte Carlo simulation was conducted to investigate both the Type I error and statistical power of this fit statistic in analyzing two kinds of multidimensional test…
Descriptors: Monte Carlo Methods, Sampling, Goodness of Fit, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
MacCann, Robert G. – Educational and Psychological Measurement, 2008
It is shown that the Angoff and bookmarking cut scores are examples of true score equating that in the real world must be applied to observed scores. In the context of defining minimal competency, the percentage "failed" by such methods is a function of the length of the measuring instrument. It is argued that this length is largely…
Descriptors: True Scores, Cutting Scores, Minimum Competencies, Scores
Rice, Jennifer King – National Education Policy Center, 2012
Schools and school systems throughout the nation are increasingly experimenting with using various instructional technologies to improve productivity and decrease costs, but evidence on both the effectiveness and the costs of education technology is limited. A recent report published by the Thomas B. Fordham Institute sets out to describe "the…
Descriptors: Evidence, Electronic Learning, Distance Education, Online Courses
Peer reviewed Peer reviewed
Silverstein, A. B. – Journal of Clinical Psychology, 1985
A formula is presented for the standard error of estimate of Deviation Quotients (DQs). The formula is shown to perform well when used with data on short forms of two of Wechsler's scales. (Author/JAC)
Descriptors: Error of Measurement
Briggs, Derek C. – Partnership for Assessment of Readiness for College and Careers, 2011
There is often confusion about distinctions between growth models and value-added models. The first half of this paper attempts to dispel some of these confusions by clarifying terminology and illustrating by example how the results from a large-scale assessment can and will be used to make inferences about student growth and the value-added…
Descriptors: Value Added Models, Language Usage, Measurement, Inferences
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Oh, Hyeonjoo J.; Guo, Hongwen; Walker, Michael E. – ETS Research Report Series, 2009
Issues of equity and fairness across subgroups of the population (e.g., gender or ethnicity) must be seriously considered in any standardized testing program. For this reason, many testing programs require some means for assessing test characteristics, such as reliability, for subgroups of the population. However, often only small sample sizes are…
Descriptors: Standardized Tests, Test Reliability, Sample Size, Bayesian Statistics
Rios-Uribe, Carlos Andres – ProQuest LLC, 2009
Measurements of social constructs that evaluate natural hazard preparedness are important to decrease natural hazard vulnerability. Preparedness reduces natural hazard impacts and human vulnerability. Investment in education and education research contribute to human sustainable development and natural hazard preparedness. Faced with other needs,…
Descriptors: Learning Theories, Structural Equation Models, Validity, Physical Geography
Magno, Carlo – Online Submission, 2009
The present report demonstrates the difference between classical test theory (CTT) and item response theory (IRT) approach using an actual test data for chemistry junior high school students. The CTT and IRT were compared across two samples and two forms of test on their item difficulty, internal consistency, and measurement errors. The specific…
Descriptors: Private Schools, Measurement, Error of Measurement, Foreign Countries
Pages: 1  |  ...  |  117  |  118  |  119  |  120  |  121  |  122  |  123  |  124  |  125  |  ...  |  221