Publication Date
| In 2026 | 0 |
| Since 2025 | 2 |
| Since 2022 (last 5 years) | 8 |
| Since 2017 (last 10 years) | 8 |
| Since 2007 (last 20 years) | 210 |
Descriptor
| Educational Testing | 610 |
| Evaluation Methods | 610 |
| Student Evaluation | 272 |
| Educational Assessment | 231 |
| Elementary Secondary Education | 155 |
| Academic Achievement | 133 |
| Program Evaluation | 131 |
| Achievement Tests | 113 |
| Accountability | 108 |
| Educational Policy | 104 |
| Disabilities | 98 |
| More ▼ | |
Source
Author
Publication Type
Education Level
| Elementary Secondary Education | 150 |
| Elementary Education | 75 |
| Secondary Education | 67 |
| High Schools | 61 |
| Grade 4 | 57 |
| Grade 8 | 55 |
| Higher Education | 37 |
| Grade 10 | 29 |
| Postsecondary Education | 28 |
| Grade 11 | 21 |
| Adult Education | 8 |
| More ▼ | |
Audience
| Practitioners | 40 |
| Teachers | 21 |
| Administrators | 8 |
| Researchers | 8 |
| Policymakers | 7 |
| Students | 3 |
| Counselors | 1 |
| Media Staff | 1 |
Location
| United Kingdom | 18 |
| Canada | 13 |
| Florida | 10 |
| California | 9 |
| United Kingdom (England) | 9 |
| Kentucky | 8 |
| Australia | 7 |
| United States | 7 |
| United Kingdom (Wales) | 6 |
| New York | 5 |
| Virginia | 5 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards with or without Reservations | 1 |
O'Neil, Timothy P. – ProQuest LLC, 2010
With scant research to draw upon with respect to the maintenance of vertical scales over time, decisions around the creation and performance of vertical scales over time necessarily suffers due to the lack of information. Undetected item parameter drift (IPD) presents one of the greatest threats to scale maintenance within an item response theory…
Descriptors: Scaling, Measures (Individuals), Item Response Theory, Educational Assessment
Livingston, Samuel A.; Antal, Judit – Applied Measurement in Education, 2010
A simultaneous equating of four new test forms to each other and to one previous form was accomplished through a complex design incorporating seven separate equating links. Each new form was linked to the reference form by four different paths, and each path produced a different score conversion. The procedure used to resolve these inconsistencies…
Descriptors: Measurement Techniques, Measurement, Educational Assessment, Educational Testing
Gill, Brian; Bruch, Julie; Booker, Kevin – Regional Educational Laboratory Mid-Atlantic, 2013
States are increasingly interested in including measures of student achievement growth, or "value-
added," in evaluating teachers. Annual state assessments, however, which are the typical measure of student
growth, usually cover only reading and math teachers and only in grades 4-8. These state assessments thus cannot
…
Descriptors: Teacher Evaluation, Teacher Competencies, Evaluation Methods, Educational Testing
Gill, Brian; Bruch, Julie; Booker, Kevin – Regional Educational Laboratory Mid-Atlantic, 2013
States and school districts are exploring alternatives to state tests for measuring teachers' contributions to student learning. One approach applies statistical value-added methods to alternative student assessments such as commercially available tests and end-of course tests. The evidence suggests that these methods can reliably distinguish…
Descriptors: Teacher Evaluation, Teacher Competencies, Evaluation Methods, Educational Testing
Brookhart, Susan M. – ASCD, 2010
Don't settle for assessing recall and comprehension only when you can use this guide to create assessments for higher-order thinking skills. Assessment expert Susan M. Brookhart brings you up to speed on how to develop and use test questions and other assessments that reveal how well your students can analyze, reason, solve problems, and think…
Descriptors: Test Items, Performance Based Assessment, Thinking Skills, Cognitive Processes
Breton, Theodore R. – Economics of Education Review, 2011
This paper challenges Hanushek and Woessmann's (2008) contention that the quality and not the quantity of schooling determines a nation's rate of economic growth. I first show that their statistical analysis is flawed. I then show that when a nation's average test scores and average schooling attainment are included in a national income model,…
Descriptors: Economic Progress, Income, Statistical Significance, Educational Quality
Powers, Sonya Jean – ProQuest LLC, 2010
When test forms are administered to examinee groups that differ in proficiency, equating procedures are used to disentangle group differences from form differences. This dissertation investigates the extent to which equating results are population invariant, the impact of group differences on equating results, the impact of group differences on…
Descriptors: Evidence, Advanced Placement, Effect Size, True Scores
Koon, Sharon – ProQuest LLC, 2010
This study examined the effectiveness of the odds-ratio method (Penfield, 2008) and the multinomial logistic regression method (Kato, Moen, & Thurlow, 2009) for measuring differential distractor functioning (DDF) effects in comparison to the standardized distractor analysis approach (Schmitt & Bleistein, 1987). Students classified as participating…
Descriptors: Test Bias, Test Items, Reference Groups, Lunch Programs
Assiri, Mohammed S. – ProQuest LLC, 2011
With the focus on how a sample of 25 Arab ESL learners respond to the TOEFL-iBT reading tasks, this study aimed to find out what strategies respondents tend to use, investigate if there are differences between high- and low-scorers in strategy use, and determine aspects of effective strategy use among respondents. Data were collected using a…
Descriptors: Arabs, Program Effectiveness, Scoring, Data Analysis
Kim, Jiseon – ProQuest LLC, 2010
Classification testing has been widely used to make categorical decisions by determining whether an examinee has a certain degree of ability required by established standards. As computer technologies have developed, classification testing has become more computerized. Several approaches have been proposed and investigated in the context of…
Descriptors: Test Length, Computer Assisted Testing, Classification, Probability
Helgesen, Angela – ProQuest LLC, 2010
This research project focuses on a freshman class in a suburban high school where incoming ninth graders were identified as being at-risk or not at-risk; students' scores on eighth grade standardized tests were critical in determining a student's placement in one of the two categories. The researcher established the predictive value of this…
Descriptors: Standardized Tests, At Risk Students, Grade 9, Grade 8
PEPNet-West, 2010
This paper presents the highlights of the 2008 Test Equity Summit held in Bloomfield, Colorado last August 6-8, 2008. The 2008 Test Equity Summit convened by the Postsecondary Education Programs Network (PEPNet) identified and examined problems, challenges, and issues that academic and psychoeducational tests pose for individuals who are deaf or…
Descriptors: Partial Hearing, Educational Testing, Deafness, Test Construction
American Journal of Distance Education, 2010
David Foster is the founder of Kryterion, an Internet test administration company, and currently serves there as chief scientist and executive vice president. He is the author of numerous articles for industry trade journals and textbooks and sits on the Council for the International Test Commission. In this interview, Foster talks about his…
Descriptors: Test Construction, High Stakes Tests, Educational Technology, Interviews
Raj Chetty; John N. Friedman; Jonah E. Rockoff – National Bureau of Economic Research, 2011
Are teachers' impacts on students' test scores ("value-added") a good measure of their quality? This question has sparked debate largely because of disagreement about (1) whether value-added (VA) provides unbiased estimates of teachers' impacts on student achievement and (2) whether high-VA teachers improve students' long-term outcomes.…
Descriptors: Academic Achievement, Scores, Teacher Effectiveness, Outcomes of Education
Cirillo, Mary Grupe – ProQuest LLC, 2010
The purpose of this study was to determine the impact of Virginia school divisions' policy of paying the fee for students to take Advanced Placement exams on Advanced Placement course enrollment, the number of Advanced Placement exams taken by students, the average scores earned and the percent of students earning qualifying scores of 3, 4, or 5…
Descriptors: Advanced Placement, Lunch Programs, Academic Achievement, School Size

Direct link
Peer reviewed
