NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20260
Since 20250
Since 2022 (last 5 years)0
Since 2017 (last 10 years)3
Since 2007 (last 20 years)21
Audience
Researchers2
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 56 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Atteberry, Allison; Mangan, Daniel – Educational Researcher, 2020
Papay (2011) noticed that teacher value-added measures (VAMs) from a statistical model using the most common pre/post testing timeframe--current-year spring relative to previous spring (SS)--are essentially unrelated to those same teachers' VAMs when instead using next-fall relative to current-fall (FF). This is concerning since this choice--made…
Descriptors: Correlation, Value Added Models, Pretests Posttests, Decision Making
Peer reviewed Peer reviewed
Direct linkDirect link
Tindal, Gerald; Nese, Joseph F. T.; Stevens, Joseph J. – Educational Assessment, 2017
For the past decade, the accountability model associated with No Child Left Behind (NCLB) emphasized proficiency on end of year tests; with Every Student Succeeds Act (ESSA) the emphasis on proficiency within statewide testing programs, though now integrated with other measures of student learning, nevertheless remains a primary metric for…
Descriptors: Testing Programs, Middle School Students, Models, State Standards
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Zhang, Mo; Breyer, F. Jay; Lorenz, Florian – ETS Research Report Series, 2013
In this research, we investigated the suitability of implementing "e-rater"® automated essay scoring in a high-stakes large-scale English language testing program. We examined the effectiveness of generic scoring and 2 variants of prompt-based scoring approaches. Effectiveness was evaluated on a number of dimensions, including agreement…
Descriptors: Computer Assisted Testing, Computer Software, Scoring, Language Tests
Kelley, Ronald Scott – ProQuest LLC, 2012
Scope and Method of Study: This study focused on the development and use of the AT-SAT test battery and the Initial En Route Qualification training course for the selection, training, and evaluation of air traffic controller candidates. The Pearson product moment correlation coefficient was used to measure the linear relationship between the…
Descriptors: Traffic Safety, Scores, Equated Scores, Multiple Regression Analysis
Northwest Evaluation Association, 2014
Recently, the Northwest Evaluation Association (NWEA) completed a study to connect the scale of the North Carolina State End of Grade (EOG) Testing Program used for North Carolina's mathematics and reading assessments with NWEA's Rausch Interval Unit (RIT) scale. Information from the state assessments was used in a study to establish…
Descriptors: Alignment (Education), Testing Programs, Equated Scores, Standard Setting
Benson, Lauren – ProQuest LLC, 2017
This executive position paper identifies preferred modes of communication for parents and guardians in a small New Jersey Public School District. Research was conducted because there has been an unprecedented test refusal initiative by parents and guardians of New Jersey Public School Students who are mandated to sit for Partnership for Assessment…
Descriptors: Parent Attitudes, Resistance (Psychology), School Districts, Compliance (Legal)
Peer reviewed Peer reviewed
Direct linkDirect link
Chen, Qian – International Journal of Science and Mathematics Education, 2014
In this study, the Trends in International Mathematics and Science Study 2007 data were used to build mathematics achievement models of fourth graders in two East Asian school systems: Hong Kong and Singapore. In each school system, eight variables at student level and nine variables at school/class level were incorporated to build an achievement…
Descriptors: Foreign Countries, Mathematics Achievement, Grade 4, Mathematics Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Debeer, Dries; Buchholz, Janine; Hartig, Johannes; Janssen, Rianne – Journal of Educational and Behavioral Statistics, 2014
In this article, the change in examinee effort during an assessment, which we will refer to as persistence, is modeled as an effect of item position. A multilevel extension is proposed to analyze hierarchically structured data and decompose the individual differences in persistence. Data from the 2009 Program of International Student Achievement…
Descriptors: Reading Tests, International Programs, Testing Programs, Individual Differences
Peer reviewed Peer reviewed
Direct linkDirect link
Skorupski, William P.; Carvajal, Jorge – Educational and Psychological Measurement, 2010
This study is an evaluation of the psychometric issues associated with estimating objective level scores, often referred to as "subscores." The article begins by introducing the concepts of reliability and validity for subscores from statewide achievement tests. These issues are discussed with reference to popular scaling techniques, classical…
Descriptors: Testing Programs, Test Validity, Achievement Tests, Scores
Wise, Steven L.; Ma, Lingling; Kingsbury, G. Gage; Hauser, Carl – Northwest Evaluation Association, 2010
This study investigated the relationships between when a test is administered and the amount of test-taking effort exhibited by examinees. Three time-related variables were investigated: the time of year the test was administered, the day of the week the test event occurred, and the time of day that the test event occurred. Mean effort did not…
Descriptors: Academic Achievement, Test Wiseness, Investigations, Schematic Studies
Wang, Huan – ProQuest LLC, 2010
Multiple uses of the same assessment may present challenges for both the design and use of an assessment. Little advice, however, has been given to assessment developers as to how to understand the phenomena of multiple assessment use and meet the challenges these present. Particularly problematic is the case in which an assessment is used for…
Descriptors: Test Use, Testing Programs, Program Effectiveness, Test Construction
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Heldsinger, Sandra; Humphry, Stephen – Australian Educational Researcher, 2010
Demands for accountability have seen the implementation of large scale testing programs in Australia and internationally. There is, however, a growing body of evidence to show that externally imposed testing programs do not have a sustained impact on student achievement. It has been argued that teacher assessment is more effective in raising…
Descriptors: Testing Programs, Testing, Academic Achievement, Measures (Individuals)
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Rock, Donald A. – ETS Research Report Series, 2012
This paper provides a history of ETS's role in developing assessment instruments and psychometric procedures for measuring change in large-scale national assessments funded by the Longitudinal Studies branch of the National Center for Education Statistics. It documents the innovations developed during more than 30 years of working with…
Descriptors: Models, Educational Change, Longitudinal Studies, Educational Development
Peer reviewed Peer reviewed
Direct linkDirect link
Jorgensen, Robyn; Lowrie, Tom – International Journal for Mathematics Teaching and Learning, 2015
This paper explores the relationship between social backgrounds and geographical locations with mathematical achievement. Using the national testing system in Australia, correlations between the variables were explored and it was found that students from rural and low SES backgrounds are still being marginalised in school mathematics--in terms of…
Descriptors: Equal Education, Mathematics Education, Mathematics Achievement, Foreign Countries
Daniel, Tracy Demetrie – ProQuest LLC, 2012
Determining if the investment in educational technology will improve student achievement is complicated and multifarious. The purpose of this study was to evaluate the influence of teacher technology integration on student achievement as measured by the Mississippi Subject Area Testing Program (SATP) and to explore the relationship between…
Descriptors: Academic Achievement, High Stakes Tests, Educational Technology, Self Efficacy
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4