NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20241
Since 2021 (last 5 years)2
Since 2016 (last 10 years)9
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing all 9 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Beauducel, André; Hilger, Norbert – Educational and Psychological Measurement, 2022
In the context of Bayesian factor analysis, it is possible to compute plausible values, which might be used as covariates or predictors or to provide individual scores for the Bayesian latent variables. Previous simulation studies ascertained the validity of mean plausible values by the mean squared difference of the mean plausible values and the…
Descriptors: Bayesian Statistics, Factor Analysis, Prediction, Simulation
Peer reviewed Peer reviewed
Direct linkDirect link
Julie Cohen; Luke C. Miller; Rosalie Chung; Emily Wiseman; Erik Ruzek – Journal of Education, 2024
Helping students engage with complex texts has been a longstanding challenge, though teachers have received little guidance about practices that help students in engaging with texts. This paper provides a range of empirical evidence about a tool designed to provide formative insight into text-focused teaching, which we used to reliably score more…
Descriptors: Reading Instruction, Teaching Methods, Reader Text Relationship, Reading Research
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Liu, Yuming; Robin, Frédéric; Yoo, Hanwook; Manna, Venessa – ETS Research Report Series, 2018
The "GRE"® Psychology test is an achievement test that measures core knowledge in 12 content domains that represent the courses commonly offered at the undergraduate level. Currently, a total score and 2 subscores, experimental and social, are reported to test takers as well as graduate institutions. However, the American Psychological…
Descriptors: College Entrance Examinations, Graduate Study, Psychological Testing, Scores
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Sloat, Edward; Amrein-Beardsley, Audrey; Sabo, Kent E. – AERA Open, 2017
In this study, we investigated the factor structure underlying the TAP System for Teacher and Student Advancement using confirmatory and exploratory factor-analytic methods and under conditions of multilevel (nested) data structures and ordinal measurement scales. We found evidence of generally poor fit with the system's posited first-order,…
Descriptors: Factor Structure, Teacher Evaluation, Value Added Models, Elementary Secondary Education
Peer reviewed Peer reviewed
Direct linkDirect link
Sawaki, Yasuyo; Sinharay, Sandip – Language Testing, 2018
The present study examined the reliability of the reading, listening, speaking, and writing section scores for the TOEFL iBT® test and their interrelationship in order to collect empirical evidence to support, respectively, the "generalization" inference and the "explanation" inference in the TOEFL iBT validity argument…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Computer Assisted Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Wallace, Tanner LeBaron; Kelcey, Benjamin; Ruzek, Erik – American Educational Research Journal, 2016
We conducted a theory-based analysis of the underlying structure of the Tripod student perception survey instrument using the Measures of Effective Teaching (MET) database (N = 1,049 middle school math class sections; N = 25,423 students). Multilevel item factor analyses suggested that an alternative bifactor structure best fit the Tripod items,…
Descriptors: Student Surveys, Attitude Measures, Teacher Effectiveness, Theories
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Arseven, Zeynep; Kiliç, Abdurrahman; Sahin, Seyma – Universal Journal of Educational Research, 2016
In the present study, it is aimed to develop a valid and reliable scale for determining value-eroding behaviors of teachers, hence their values of judgment. The items of the "Value-eroding Teacher Behaviors Scale" were designed in the form of 5-point likert type rating scale. The exploratory factor analysis (EFA) was conducted to…
Descriptors: Teacher Behavior, Behavior Rating Scales, Test Reliability, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Kuhfeld, Megan – Educational Assessment, 2017
This article develops a validity argument for the use of the Tripod student survey of instructional practices to assess teacher effectiveness in summative teacher evaluations and professional development decisions. This paper expands upon previous research in three ways: (a) it draws from current validity thinking to examine the evidence for…
Descriptors: Student Evaluation of Teacher Performance, Teacher Evaluation, Classroom Techniques, Classroom Observation Techniques
Peer reviewed Peer reviewed
Direct linkDirect link
Blazar, David; Litke, Erica; Barmore, Johanna – American Educational Research Journal, 2016
Education agencies are evaluating teachers using student achievement data. However, very little is known about the comparability of test-based or "value-added" metrics across districts and the extent to which they capture variability in classroom practices. Drawing on data from four urban districts, we found that teachers were…
Descriptors: Educational Quality, Teacher Evaluation, Academic Achievement, School Districts