NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Does not meet standards1
Showing 1,111 to 1,125 of 3,311 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Haltigan, John D.; Leerkes, Esther M.; Wong, Maria S.; Fortuna, Keren; Roisman, Glenn I.; Supple, Andrew J.; O'Brien, Marion; Calkins, Susan D.; Plamondon, André – Child Development, 2014
This study examined the developmental significance of mothers' adult attachment representations assessed prenatally with the Adult Attachment Interview in relation to observed maternal sensitivity at 6 months postpartum in an ethnically diverse sample (N = 131 African American; N = 128 European American). Multiple-group confirmatory factor…
Descriptors: Attachment Behavior, Mothers, Ethnicity, Parent Caregiver Relationship
Peer reviewed Peer reviewed
Direct linkDirect link
Rutkowski, Leslie – Applied Measurement in Education, 2014
Large-scale assessment programs such as the National Assessment of Educational Progress (NAEP), Trends in International Mathematics and Science Study (TIMSS), and Programme for International Student Assessment (PISA) use a sophisticated assessment administration design called matrix sampling that minimizes the testing burden on individual…
Descriptors: Measurement, Testing, Item Sampling, Computation
Peer reviewed Peer reviewed
Direct linkDirect link
Lin, Chih-Kai; Zhang, Jinming – Language Testing, 2014
Research on the relationship between English language proficiency standards and academic content standards serves to provide information about the extent to which English language learners (ELLs) are expected to encounter academic language use that facilitates their content learning, such as in mathematics and science. Standards-to-standards…
Descriptors: Language Proficiency, Academic Standards, Generalizability Theory, English Language Learners
Porter, Kristin E.; Reardon, Sean F.; Unlu, Fatih; Bloom, Howard S.; Robinson-Cimpian, Joseph P. – MDRC, 2014
A valuable extension of the single-rating regression discontinuity design (RDD) is a multiple-rating RDD (MRRDD). To date, four main methods have been used to estimate average treatment effects at the multiple treatment frontiers of an MRRDD: the "surface" method, the "frontier" method, the "binding-score" method, and…
Descriptors: Regression (Statistics), Research Design, Quasiexperimental Design, Research Methodology
Peer reviewed Peer reviewed
Direct linkDirect link
Gao, Xingyuan; Xia, Jiangang; Shen, Jianping; Ma, Xin – Chinese Education & Society, 2018
Successful school leadership is highly contextually dependent. However, few studies focused on the comparisons of school leadership across different countries. Even among the existing studies, comparisons tend to be conducted with the assumption that the underlying factorial structure of the construct is the same. In this study, school principal's…
Descriptors: Comparative Analysis, Comparative Education, Principals, Decision Making
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Zimmerman, Donald W. – Psicologica: International Journal of Methodology and Experimental Psychology, 2012
In order to circumvent the influence of correlation in paired-samples and repeated measures experimental designs, researchers typically perform a one-sample Student "t" test on difference scores. That procedure entails some loss of power, because it employs N - 1 degrees of freedom instead of the 2N - 2 degrees of freedom of the…
Descriptors: Correlation, Statistical Analysis, Statistical Significance, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Rhoads, Christopher – Journal of Research on Educational Effectiveness, 2016
Experimental evaluations that involve the educational system usually involve a hierarchical structure (students are nested within classrooms that are nested within schools, etc.). Concerns about contamination, where research subjects receive certain features of an intervention intended for subjects in a different experimental group, have often led…
Descriptors: Educational Experiments, Error of Measurement, Research Design, Statistical Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Mousavi, Amin; Krishnan, Vijaya – Alberta Journal of Educational Research, 2016
The Early Development Instrument (EDI) is a widely used teacher rating tool to assess kindergartners' developmental outcomes in Canada and a number of other countries. This paper examines the measurement invariance of EDI domains across ESL status and gender by means of multi-group confirmatory factor analysis. The results suggest evidence of…
Descriptors: Foreign Countries, Measures (Individuals), Child Development, Rating Scales
Peer reviewed Peer reviewed
Direct linkDirect link
Han, Chao – Language Assessment Quarterly, 2016
As a property of test scores, reliability/dependability constitutes an important psychometric consideration, and it underpins the validity of measurement results. A review of interpreter certification performance tests (ICPTs) reveals that (a) although reliability/dependability checking has been recognized as an important concern, its theoretical…
Descriptors: Foreign Countries, Scores, English, Chinese
Peer reviewed Peer reviewed
Direct linkDirect link
Skinner, Ellen; Saxton, Emily; Currie, Cailin; Shusterman, Gwen – International Journal of Science Education, 2017
As part of long-standing efforts to promote undergraduates' success in science, researchers have investigated the instructional strategies and motivational factors that promote student learning and persistence in science coursework and majors. This study aimed to create a set of brief measures that educators and researchers can use as tools to…
Descriptors: Undergraduate Students, Science Instruction, Majors (Students), Biology
Peer reviewed Peer reviewed
Direct linkDirect link
Woods, Carol M.; Cai, Li; Wang, Mian – Educational and Psychological Measurement, 2013
Differential item functioning (DIF) occurs when the probability of responding in a particular category to an item differs for members of different groups who are matched on the construct being measured. The identification of DIF is important for valid measurement. This research evaluates an improved version of Lord's X[superscript 2] Wald test for…
Descriptors: Test Bias, Item Response Theory, Computation, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Liu, Yang; Maydeu-Olivares, Alberto – Educational and Psychological Measurement, 2013
Local dependence (LD) for binary IRT models can be diagnosed using Chen and Thissen's bivariate X[superscript 2] statistic and the score test statistics proposed by Glas and Suarez-Falcon, and Liu and Thissen. Alternatively, LD can be assessed using general purpose statistics such as bivariate residuals or Maydeu-Olivares and Joe's M[subscript r]…
Descriptors: Item Response Theory, Statistical Analysis, Models, Goodness of Fit
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Rindskopf, David – Society for Research on Educational Effectiveness, 2013
Single case designs (SCDs) generally consist of a small number of short time series in two or more phases. The analysis of SCDs statistically fits in the framework of a multilevel model, or hierarchical model. The usual analysis does not take into account the uncertainty in the estimation of the random effects. This not only has an effect on the…
Descriptors: Research Design, Bayesian Statistics, Computation, Data
Peer reviewed Peer reviewed
Direct linkDirect link
Petscher, Yaacov; Cummings, Kelli Dawn; Biancarosa, Gina; Fien, Hank – Assessment for Effective Intervention, 2013
The purpose of this article is to provide a commentary on the current state of several measurement issues pertaining to curriculum-based measures of reading (R-CBM). We begin by providing an overview of the utility of R-CBM, followed by a presentation of five specific measurements considerations: (a) the reliability of R-CBM oral reading fluency…
Descriptors: Measurement, Reading Fluency, Curriculum Based Assessment, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Deygers, Bart; Van Gorp, Koen – Language Testing, 2015
Considering scoring validity as encompassing both reliable rating scale use and valid descriptor interpretation, this study reports on the validation of a CEFR-based scale that was co-constructed and used by novice raters. The research questions this paper wishes to answer are (a) whether it is possible to construct a CEFR-based rating scale with…
Descriptors: Rating Scales, Scoring, Validity, Interrater Reliability
Pages: 1  |  ...  |  71  |  72  |  73  |  74  |  75  |  76  |  77  |  78  |  79  |  ...  |  221