NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Does not meet standards1
Showing 1,186 to 1,200 of 3,316 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Holster, Trevor A.; Lake, J. – Language Assessment Quarterly, 2016
Stewart questioned Beglar's use of Rasch analysis of the Vocabulary Size Test (VST) and advocated the use of 3-parameter logistic item response theory (3PLIRT) on the basis that it models a non-zero lower asymptote for items, often called a "guessing" parameter. In support of this theory, Stewart presented fit statistics derived from…
Descriptors: Guessing (Tests), Item Response Theory, Vocabulary, Language Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Long, Mark C. – Journal of Research on Educational Effectiveness, 2016
Using a "naïve" specification, this paper estimates the relationship between 36 high school characteristics and 24 student outcomes controlling for students' pre-high school characteristics. The goal of this exploration is not to generate casual estimates, but rather to: (a) compare the size of the relationships to determine which inputs…
Descriptors: Hypothesis Testing, Effect Size, High School Students, Student Characteristics
Peer reviewed Peer reviewed
Direct linkDirect link
Aryadoust, Vahid – Educational Psychology, 2016
This study sought to examine the development of paragraph writing skills of 116 English as a second language university students over the course of 12 weeks and the relationship between the linguistic features of students' written texts as measured by Coh-Metrix--a computational system for estimating textual features such as cohesion and…
Descriptors: English (Second Language), Second Language Learning, Writing Skills, College Students
Peer reviewed Peer reviewed
Direct linkDirect link
Raymond, Mark R.; Swygert, Kimberly A.; Kahraman, Nilufer – Journal of Educational Measurement, 2012
Although a few studies report sizable score gains for examinees who repeat performance-based assessments, research has not yet addressed the reliability and validity of inferences based on ratings of repeat examinees on such tests. This study analyzed scores for 8,457 single-take examinees and 4,030 repeat examinees who completed a 6-hour clinical…
Descriptors: Physicians, Licensing Examinations (Professions), Performance Based Assessment, Repetition
Peer reviewed Peer reviewed
Direct linkDirect link
Zopluoglu, Cengiz; Davenport, Ernest C., Jr. – Educational and Psychological Measurement, 2012
The generalized binomial test (GBT) and [omega] indices are the most recent methods suggested in the literature to detect answer copying behavior on multiple-choice tests. The [omega] index is one of the most studied indices, but there has not yet been a systematic simulation study for the GBT index. In addition, the effect of the ability levels…
Descriptors: Statistical Analysis, Error of Measurement, Simulation, Multiple Choice Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Zhang, Guangjian; Preacher, Kristopher J.; Jennrich, Robert I. – Psychometrika, 2012
The infinitesimal jackknife, a nonparametric method for estimating standard errors, has been used to obtain standard error estimates in covariance structure analysis. In this article, we adapt it for obtaining standard errors for rotated factor loadings and factor correlations in exploratory factor analysis with sample correlation matrices. Both…
Descriptors: Factor Analysis, Maximum Likelihood Statistics, Error of Measurement, Nonparametric Statistics
Peer reviewed Peer reviewed
Direct linkDirect link
Taylor, Matthew A.; Skourides, Andreas; Alvero, Alicia M. – Journal of Organizational Behavior Management, 2012
Interval recording procedures are used by persons who collect data through observation to estimate the cumulative occurrence and nonoccurrence of behavior/events. Although interval recording procedures can increase the efficiency of observational data collection, they can also induce error from the observer. In the present study, 50 observers were…
Descriptors: Safety, Behavior, Error of Measurement, Observation
Peer reviewed Peer reviewed
Direct linkDirect link
Raykov, Tenko; Marcoulides, George A. – Structural Equation Modeling: A Multidisciplinary Journal, 2012
A latent variable modeling method is outlined, which accomplishes estimation of criterion validity and reliability for a multicomponent measuring instrument with hierarchical structure. The approach provides point and interval estimates for the scale criterion validity and reliability coefficients, and can also be used for testing composite or…
Descriptors: Predictive Validity, Reliability, Structural Equation Models, Measures (Individuals)
Peer reviewed Peer reviewed
Direct linkDirect link
Ip, Edward Hak-Sing; Chen, Shyh-Huei – Applied Psychological Measurement, 2012
The problem of fitting unidimensional item-response models to potentially multidimensional data has been extensively studied. The focus of this article is on response data that contains a major dimension of interest but that may also contain minor nuisance dimensions. Because fitting a unidimensional model to multidimensional data results in…
Descriptors: Measurement, Item Response Theory, Scores, Computation
Peer reviewed Peer reviewed
Direct linkDirect link
Westfall, Peter H.; Henning, Kevin S. S.; Howell, Roy D. – Structural Equation Modeling: A Multidisciplinary Journal, 2012
This article shows how interfactor correlation is affected by error correlations. Theoretical and practical justifications for error correlations are given, and a new equivalence class of models is presented to explain the relationship between interfactor correlation and error correlations. The class allows simple, parsimonious modeling of error…
Descriptors: Psychometrics, Correlation, Error of Measurement, Structural Equation Models
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Horakova, Tereza; Houska, Milan – International Education Studies, 2014
The paper shows how the methodology for a pedagogical experiment can be improved through including the pre-research stage. If the experiment has the form of a test procedure, an improvement of methodology can be achieved using for example the methods of statistical and didactic analysis of tests which are traditionally used in other areas, i.e.…
Descriptors: Educational Research, Educational Experiments, Research Methodology, Statistical Analysis
Cheema, Jehanzeb R. – Review of Educational Research, 2014
Missing data are a common occurrence in survey-based research studies in education, and the way missing values are handled can significantly affect the results of analyses based on such data. Despite known problems with performance of some missing data handling methods, such as mean imputation, many researchers in education continue to use those…
Descriptors: Educational Research, Data, Data Collection, Data Processing
Peer reviewed Peer reviewed
Direct linkDirect link
Ing, Marsha; Shih, Jeffrey C. – Middle Grades Research Journal, 2013
There are situations within middle school settings where measurements of students and teachers are used for high-stakes decisions. For example, student performance is used as an indicator of teacher quality or determines student eligibility for particular types of support services. Given the high-stakes nature of these types of assessments,…
Descriptors: Generalizability Theory, Middle School Teachers, Teacher Behavior, Research Design
Peer reviewed Peer reviewed
Direct linkDirect link
Severo, Milton; Silva-Pereira, Fernanda; Ferreira, Maria Amelia – Anatomical Sciences Education, 2013
Several studies have shown that the standard error of measurement (SEM) can be used as an additional “safety net” to reduce the frequency of false-positive or false-negative student grading classifications. Practical examinations in clinical anatomy are often used as diagnostic tests to admit students to course final examinations. The aim of this…
Descriptors: Anatomy, Medical Education, Decision Making, Pass Fail Grading
Peer reviewed Peer reviewed
Direct linkDirect link
Yao, Lihua – Applied Psychological Measurement, 2013
Through simulated data, five multidimensional computerized adaptive testing (MCAT) selection procedures with varying test lengths are examined and compared using different stopping rules. Fixed item exposure rates are used for all the items, and the Priority Index (PI) method is used for the content constraints. Two stopping rules, standard error…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Selection
Pages: 1  |  ...  |  76  |  77  |  78  |  79  |  80  |  81  |  82  |  83  |  84  |  ...  |  222