ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	14

Descriptor

Comparative Analysis	17
Evaluation Problems	17
Evaluation Methods	9
Educational Assessment	8
Academic Achievement	6
Comparative Testing	6
Foreign Countries	6
Program Effectiveness	6
Educational Policy	5
Item Response Theory	5
Models	5
Student Evaluation	5
Achievement Gains	4
Evaluation Research	4
Mathematics Achievement	4
Psychometrics	4
Statistical Bias	4
Testing Problems	4
Achievement Rating	3
Barriers	3
Computer Assisted Testing	3
Correlation	3
Curriculum Evaluation	3
Educational Indicators	3
Educational Testing	3
More ▼

Source

Journal of Applied Testing…	3
Education and the Public…	2
Applied Measurement in…	1
Assessing Writing	1
European Physical Education…	1
International Journal of…	1
Journal of School Choice	1
Measurement:…	1
National Center for Analysis…	1
Online Submission	1
School Science and Mathematics	1
Studies in Educational…	1
More ▼

Publication Type

Journal Articles	11
Reports - Research	8
Reports - Evaluative	7
Speeches/Meeting Papers	3
Information Analyses	1
Opinion Papers	1

Education Level

Elementary Secondary Education	10
Secondary Education	4
High Schools	3
Elementary Education	2
Grade 4	2
Grade 8	2
Grade 3	1
Grade 7	1
Higher Education	1
Middle Schools	1
Postsecondary Education	1
More ▼

Audience

Researchers

Location

Germany	2
Australia	1
Canada	1
Connecticut	1
Denmark	1
Florida	1
Maryland	1
Massachusetts	1
Netherlands	1
New York	1
Poland	1
Sweden	1
United Kingdom (Scotland)	1
United States	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

National Assessment of…	2
Iowa Tests of Educational…	1
Stanford Achievement Tests	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

Are the Nonparametric Person-Fit Statistics More Powerful than Their Parametric Counterparts? Revisiting the Simulations in Karabatsos (2003)

Peer reviewed

Direct link

Sinharay, Sandip – Applied Measurement in Education, 2017

Karabatsos compared the power of 36 person-fit statistics using receiver operating characteristics curves and found the "H[superscript T]" statistic to be the most powerful in identifying aberrant examinees. He found three statistics, "C", "MCI", and "U3", to be the next most powerful. These four statistics,…

Descriptors: Nonparametric Statistics, Goodness of Fit, Simulation, Comparative Analysis

The Applicability of Multidimensional Computerized Adaptive Testing for Cognitive Ability Measurement in Organizational Assessment

Peer reviewed

Direct link

Makransky, Guido; Glas, Cees A. W. – International Journal of Testing, 2013

Cognitive ability tests are widely used in organizations around the world because they have high predictive validity in selection contexts. Although these tests typically measure several subdomains, testing is usually carried out for a single subdomain at a time. This can be ineffective when the subdomains assessed are highly correlated. This…

Descriptors: Foreign Countries, Cognitive Ability, Adaptive Testing, Feedback (Response)

Wise and Proper Use of National Assessment of Educational Progress (NAEP) Data

Peer reviewed

Direct link

Innes, Richard G. – Journal of School Choice, 2012

This article provides examples of how serious misconceptions can result when only "all student" scores from the National Assessment of Educational Progress (NAEP) are used for simplistic state-to-state comparisons. Suggestions for better treatment are presented. The article also compares Kentucky's eighth grade EXPLORE testing to NAEP…

Descriptors: National Competency Tests, Scoring, Misconceptions, Academic Achievement

The Contribution of Constructed Response Items to Large Scale Assessment: Measuring and Understanding Their Impact

Peer reviewed

Direct link

Lissitz, Robert W.; Hou, Xiaodong; Slater, Sharon Cadman – Journal of Applied Testing Technology, 2012

This article investigates several questions regarding the impact of different item formats on measurement characteristics. Constructed response (CR) items and multiple choice (MC) items obviously differ in their formats and in the resources needed to score them. As such, they have been the subject of considerable discussion regarding the impact of…

Descriptors: Computer Assisted Testing, Scoring, Evaluation Problems, Psychometrics

Playing with the Stakes: A Consideration of an Aspect of the Social Context of a Gatekeeping Writing Assessment

Peer reviewed

Direct link

Baker, Beverly A. – Assessing Writing, 2010

In high-stakes writing assessments, rater training in the use of a rating scale does not eliminate variability in grade attribution. This realisation has been accompanied by research that explores possible sources of rater variability, such as rater background or rating scale type. However, there has been little consideration thus far of…

Descriptors: Foreign Countries, Writing Evaluation, Writing Tests, Testing

Person Response Functions and the Definition of Units in the Social Sciences

Peer reviewed

Direct link

Engelhard, George, Jr.; Perkins, Aminah F. – Measurement: Interdisciplinary Research and Perspectives, 2011

Humphry (this issue) has written a thought-provoking piece on the interpretation of item discrimination parameters as scale units in item response theory. One of the key features of his work is the description of an item response theory (IRT) model that he calls the logistic measurement function that combines aspects of two traditions in IRT that…

Descriptors: Foreign Countries, Social Sciences, Item Response Theory, Testing

Computer-Based Signing Accommodations: Comparing a Recorded Human with an Avatar

Peer reviewed

Direct link

Russell, Michael; Kavanaugh, Maureen; Masters, Jessica; Higgins, Jennifer; Hoffmann, Thomas – Journal of Applied Testing Technology, 2009

Many students who are deaf or hard-of-hearing are eligible for a signing accommodation for state and other standardized tests. The signing accommodation, however, presents several challenges for testing programs that attempt to administer tests under standardized conditions. One potential solution for many of these challenges is the use of…

Descriptors: Testing Programs, Student Attitudes, Standardized Tests, Academic Achievement

Re-Examining Test Item Issues in the TIMSS Mathematics and Science Assessments

Peer reviewed

Direct link

Wang, Jianjun – School Science and Mathematics, 2011

As the largest international study ever taken in history, the Trend in Mathematics and Science Study (TIMSS) has been held as a benchmark to measure U.S. student performance in the global context. In-depth analyses of the TIMSS project are conducted in this study to examine key issues of the comparative investigation: (1) item flaws in mathematics…

Descriptors: Test Items, Figurative Language, Item Response Theory, Benchmarking

Challenges and Problems for Research in the Field of Statewide Exams. A Stock Taking of Differing Procedures and Standardization Levels

Peer reviewed

Direct link

Klein, Esther Dominique; van Ackeren, Isabell – Studies in Educational Evaluation, 2011

Statewide exit examinations play an important role in discussions on school effectiveness. Referring to educational governance concepts, this paper presumes a relation between varying organizational structures of statewide examinations across states, and heterogeneous effects on school actors. It is assumed that their ability to affect work in…

Descriptors: Exit Examinations, Governance, School Effectiveness, Foreign Countries

Review of "Multiple Choice: Charter School Performance in 16 States"

Download full text

Miron, Gary; Applegate, Brooks – Education and the Public Interest Center, 2009

The Center for Research on Education Outcomes (CREDO) at Stanford University conducted a large-scale analysis of the impact of charter schools on student performance. The center's data covered 65-70% of the nation's charter schools. Although results varied by state, 17% of the charter school students have significantly higher math results than …

Descriptors: Evidence, Traditional Schools, Charter Schools, Program Effectiveness

Identification of Student- and Teacher-Level Variables in Modeling Variation of Mathematics Achievement Data

Download full text

Tarr, James E.; Ross, Daniel J.; McNaught, Melissa D.; Chavez, Oscar; Grouws, Douglas A.; Reys, Robert E.; Sears, Ruthmae; Taylan, R. Didem – Online Submission, 2010

The Comparing Options in Secondary Mathematics: Investigating Curriculum (COSMIC) project is a longitudinal study of student learning from two types of mathematics curricula: integrated and subject-specific. Previous large-scale research studies such as the National Assessment of Educational Progress (NAEP) indicate that numerous variables are…

Descriptors: Mathematics Education, Teacher Characteristics, Mathematics Achievement, Program Effectiveness

Review of "How New York City's Charter Schools Affect Achievement"

Download full text

Reardon, Sean F. – Education and the Public Interest Center, 2009

"How New York City's Charter Schools Affect Achievement" estimates the effects on student achievement of attending a New York City charter school rather than a traditional public school and investigates the characteristics of charter schools associated with the most positive effects on achievement. Because the report relies on an…

Descriptors: Charter Schools, Academic Achievement, Achievement Gains, Achievement Rating

Examining the Validity and Fairness of a State Standards-Based Assessment of English-Language Arts for Deaf or Hard of Hearing Students

Peer reviewed

Direct link

Steinberg, Jonathan; Cline, Frederick; Ling, Guangming; Cook, Linda; Tognatta, Namrata – Journal of Applied Testing Technology, 2009

This study examines the appropriateness of a large-scale state standards-based English-Language Arts (ELA) assessment for students who are deaf or hard of hearing by comparing the internal test structures for these students to students without disabilities. The Grade 4 and 8 ELA assessments were analyzed via a series of parcel-level exploratory…

Descriptors: Test Bias, Language Arts, State Standards, Partial Hearing

What Makes for a Good Teacher and Who Can Tell? Working Paper 30

Download full text

Harris, Douglas N.; Sass, Tim R. – National Center for Analysis of Longitudinal Data in Education Research, 2009

Mounting pressure in the policy arena to improve teacher productivity either by improving signals that predict teacher performance or through creating incentive contracts based on performance--has spurred two related questions: Are there important determinants of teacher productivity that are not captured by teacher credentials but that can be…

Descriptors: Credentials, Teacher Effectiveness, Teaching Skills, Principals

The Usefulness of Comparative Product Research for the Use of Evaluation Results in Curriculum Development.

van den Berg, Gerald; And Others – 1986

This paper examines possible explanations for the fact that evaluation results are rarely used. Most explanatory factors for use of results in decision making mentioned in the literature have been clustered in three categories of variables. They concern: (1) the making of decisions about the practical problem; (2) the way in which evaluation is…

Descriptors: Comparative Analysis, Curriculum Development, Curriculum Evaluation, Decision Making

Previous Page | Next Page »

Pages: 1 | 2

Applegate, Brooks	1
Baker, Beverly A.	1
Chavez, Oscar	1
Cline, Frederick	1
Collins, Dave	1
Cook, Linda	1
Engelhard, George, Jr.	1
Glas, Cees A. W.	1
Grouws, Douglas A.	1
Harris, Douglas N.	1
Higgins, Jennifer	1
Hoffmann, Thomas	1
Hou, Xiaodong	1
Innes, Richard G.	1
Kavanaugh, Maureen	1
Klein, Esther Dominique	1
Ling, Guangming	1
Lissitz, Robert W.	1
Makransky, Guido	1
Masters, Jessica	1
McNaught, Melissa D.	1
Mehrens, William A.	1
Miron, Gary	1
Perkins, Aminah F.	1
More ▼