ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	15

Descriptor

Educational Testing	24
Scores	24
Statistical Analysis	24
Correlation	7
Test Interpretation	6
Test Reliability	6
Academic Achievement	5
Achievement Gains	5
Standardized Tests	5
Teacher Effectiveness	5
Teacher Evaluation	5
Evaluation Methods	4
High Stakes Tests	4
Reading Tests	4
Statistical Significance	4
Student Evaluation	4
Testing Programs	4
Achievement Tests	3
Educational Assessment	3
Educational Objectives	3
Generalizability Theory	3
Measurement	3
Models	3
Public Schools	3
Test Items	3
More ▼

Source

ProQuest LLC	3
Regional Educational…	2
ACT, Inc.	1
Economics of Education Review	1
Education Next	1
Educational Measurement:…	1
International Journal for the…	1
International Journal of…	1
International Journal of…	1
Journal of Applied Testing…	1
Journal of Educational and…	1
Mathematics Educator	1
National Center for Education…	1
Planning and Changing	1
UCLA IDEA	1
More ▼

Publication Type

Reports - Research	14
Journal Articles	10
Dissertations/Theses -…	3
Reports - Evaluative	3
Reports - Descriptive	2
Information Analyses	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	6
Elementary Education	3
High Schools	3
Higher Education	3
Grade 4	2
Middle Schools	2
Postsecondary Education	2
Grade 3	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Secondary Education	1
Two Year Colleges	1
More ▼

Audience

Researchers

Location

California	2
Arizona	1
Massachusetts	1
Michigan	1
Minnesota	1
New York	1
Texas	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	4
SAT (College Admission Test)	4
Dynamic Indicators of Basic…	2
Iowa Tests of Basic Skills	2
Preliminary Scholastic…	2
Stanford Achievement Tests	2
Graduate Record Examinations	1
Metropolitan Achievement Tests	1
Wechsler Intelligence Scale…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

Somers' D as an Alternative for the Item-Test and Item-Rest Correlation Coefficients in the Educational Measurement Settings

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – International Journal of Educational Methodology, 2020

Pearson product-moment correlation coefficient between item g and test score X, known as item-test or item-total correlation ("Rit"), and item-rest correlation ("Rir") are two of the most used classical estimators for item discrimination power (IDP). Both "Rit" and "Rir" underestimate IDP caused by the…

Descriptors: Correlation, Test Items, Scores, Difficulty Level

Differential Item Functioning Detection with the Mantel-Haenszel Procedure: The Effects of Matching Types and Other Factors

Peer reviewed

Direct link

Socha, Alan; DeMars, Christine E.; Zilberberg, Anna; Phan, Ha – International Journal of Testing, 2015

The Mantel-Haenszel (MH) procedure is commonly used to detect items that function differentially for groups of examinees from various demographic and linguistic backgrounds--for example, in international assessments. As in some other DIF methods, the total score is used to match examinees on ability. In thin matching, each of the total score…

Descriptors: Test Items, Educational Testing, Evaluation Methods, Ability Grouping

Investigating the Relationships between a Reading Test and Can-Do Statements of Performance on Reading Tasks

Direct link

Liu, Hsin-min – ProQuest LLC, 2014

One of the fundamental problems in language testing is the lack of adequate generalizability between what a test is measuring and what fulfills the learners' real world language use needs. It is important to recognize that no matter how precise a test measures a construct, if the way that a construct is defined and the way that test tasks are…

Descriptors: Reading Tests, Language Tests, Task Analysis, Generalizability Theory

Gaps in College Readiness: ACT and SAT Differences by Ethnicity across 10 School Years

Direct link

Harvey, Donzel Wayne – ProQuest LLC, 2013

Purpose: The purpose of this study was to examine the college-readiness rates of Black, Hispanic, White, and Asian graduates of public secondary schools in Texas using archival data from the Texas Education Agency Academic Excellence Indicator System. Data examined were the average ACT and SAT scores for the past 10 school years (i.e., 2001-2002…

Descriptors: College Readiness, Ethnicity, Racial Differences, African American Students

Measuring Test Measurement Error: A General Approach

Peer reviewed

Direct link

Boyd, Donald; Lankford, Hamilton; Loeb, Susanna; Wyckoff, James – Journal of Educational and Behavioral Statistics, 2013

Test-based accountability as well as value-added asessments and much experimental and quasi-experimental research in education rely on achievement tests to measure student skills and knowledge. Yet, we know little regarding fundamental properties of these tests, an important example being the extent of measurement error and its implications for…

Descriptors: Accountability, Educational Research, Educational Testing, Error of Measurement

Using Alternative Student Growth Measures for Evaluating Teacher Performance: What the Literature Says. REL 2013-002

Peer reviewed
PDF on ERIC

Download full text

Gill, Brian; Bruch, Julie; Booker, Kevin – Regional Educational Laboratory Mid-Atlantic, 2013

States are increasingly interested in including measures of student achievement growth, or "value- added," in evaluating teachers. Annual state assessments, however, which are the typical measure of student growth, usually cover only reading and math teachers and only in grades 4-8. These state assessments thus cannot …

Descriptors: Teacher Evaluation, Teacher Competencies, Evaluation Methods, Educational Testing

Using Alternative Student Growth Measures for Evaluating Teacher Performance: What the Literature Says. Summary. REL 2013-002

Peer reviewed
PDF on ERIC

Download full text

Gill, Brian; Bruch, Julie; Booker, Kevin – Regional Educational Laboratory Mid-Atlantic, 2013

States and school districts are exploring alternatives to state tests for measuring teachers' contributions to student learning. One approach applies statistical value-added methods to alternative student assessments such as commercially available tests and end-of course tests. The evidence suggests that these methods can reliably distinguish…

Descriptors: Teacher Evaluation, Teacher Competencies, Evaluation Methods, Educational Testing

The Quality vs. the Quantity of Schooling: What Drives Economic Growth?

Peer reviewed

Direct link

Breton, Theodore R. – Economics of Education Review, 2011

This paper challenges Hanushek and Woessmann's (2008) contention that the quality and not the quantity of schooling determines a nation's rate of economic growth. I first show that their statistical analysis is flawed. I then show that when a nation's average test scores and average schooling attainment are included in a national income model,…

Descriptors: Economic Progress, Income, Statistical Significance, Educational Quality

The Impact of Student Ability and Method for Varying the Position of Correct Answers in Classroom Multiple-Choice Tests

Direct link

Joseph, Dane Christian – ProQuest LLC, 2010

Multiple-choice item-writing guideline research is in its infancy. Haladyna (2004) calls for a science of item-writing guideline research. The purpose of this study is to respond to such a call. The purpose of this study was to examine the impact of student ability and method for varying the location of correct answers in classroom multiple-choice…

Descriptors: Evidence, Test Format, Guessing (Tests), Program Effectiveness

Error Rates in Measuring Teacher and School Performance Based on Student Test Score Gains. NCEE 2010-4004

Peer reviewed
PDF on ERIC

Download full text

Schochet, Peter Z.; Chiang, Hanley S. – National Center for Education Evaluation and Regional Assistance, 2010

This paper addresses likely error rates for measuring teacher and school performance in the upper elementary grades using value-added models applied to student test score gain data. Using realistic performance measurement system schemes based on hypothesis testing, we develop error rate formulas based on OLS and Empirical Bayes estimators.…

Descriptors: Teacher Effectiveness, Teacher Evaluation, Student Evaluation, Scores

Value Added?

Download full text

UCLA IDEA, 2012

Value added measures (VAM) uses changes in student test scores to determine how much "value" an individual teacher has "added" to student growth during the school year. Some policymakers, school districts, and educational advocates have applauded VAM as a straightforward measure of teacher effectiveness: the better a teacher,…

Descriptors: Teacher Effectiveness, Teacher Evaluation, Educational Testing, Standardized Tests

Exploring Teacher Effectiveness Using Hierarchical Linear Models: Student- and Classroom-Level Predictors and Cross-Year Stability in Elementary School Reading

Peer reviewed

Direct link

Munoz, Marco A.; Prather, Joseph R.; Stronge, James H. – Planning and Changing, 2011

Teacher effectiveness and evaluation using student growth measures is a popular reform strategy in education. Teachers can make a difference in student academic growth, but a question that begs an answer is how to go about measuring this impact. This study examines models of teacher effectiveness and the development of hierarchical linear models…

Descriptors: Reading Instruction, Elementary Education, Urban Schools, Teacher Effectiveness

Confidence and Tolerance Intervals for True Scores. ACT Technical Bulletin, Number 42.

Download full text

Jarjoura, David – 1983

Issues regarding confidence and tolerance intervals are discussed within the context of educational measurement. Conceptual distinctions are drawn between these two types of intervals; and examples, under various error and true score models, are used to compare such intervals. It is shown that there tend to be only small differences in tolerance…

Descriptors: Educational Testing, Measurement Techniques, Models, Scores

A Perspective on the History of Generalizability Theory.

Peer reviewed

Brennan, Robert L. – Educational Measurement: Issues and Practice, 1997

The history of generalizability theory (G theory) is told from the perspective of one researcher's experiences, describing psychometric and scientific perspectives that influenced the development of G theory and its adoption. Work that remains to be done in the field is outlined. (SLD)

Descriptors: Educational Testing, Generalizability Theory, Measurement, Psychometrics

An Investigation of the Performance of the Generalized S-X[superscript 2] Item-Fit Index for Polytomous IRT Models. ACT Research Report Series, 2007-1

Download full text

Kang, Taehoon; Chen, Troy T. – ACT, Inc., 2007

Orlando and Thissen (2000, 2003) proposed an item-fit index, S-X[superscript 2], for dichotomous item response theory (IRT) models, which has performed better than traditional item-fit statistics such as Yen's (1981) Q[subscript 1] and McKinley and Mill's (1985) G[superscript 2]. This study extends the utility of S-X[superscript 2] to polytomous…

Descriptors: Item Response Theory, Models, Computer Software, Statistical Analysis

Previous Page | Next Page »

Pages: 1 | 2

Booker, Kevin	2
Bruch, Julie	2
Gill, Brian	2
Barker, Pierce	1
Bauer, Ernest A.	1
Boyd, Donald	1
Brennan, Robert L.	1
Breton, Theodore R.	1
Chen, Troy T.	1
Chiang, Hanley S.	1
DeMars, Christine E.	1
Deng, Hui	1
Harvey, Donzel Wayne	1
Hopkins, Kenneth D.	1
Huang, Yi-Min	1
Jacob, Brian A.	1
Jarjoura, David	1
Joseph, Dane Christian	1
Kang, Taehoon	1
Kobrin, Jennifer L.	1
Lankford, Hamilton	1
Levitt, Steven D.	1
Liu, Hsin-min	1
Loeb, Susanna	1
More ▼