ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	4
Since 2017 (last 10 years)	12
Since 2007 (last 20 years)	39

Descriptor

Test Bias	75
Test Validity	59
Test Reliability	28
Test Construction	27
Student Evaluation	20
Evaluation Methods	15
Testing	15
Scores	13
Validity	13
Achievement Tests	12
Elementary Secondary Education	12
Higher Education	12
Academic Achievement	11
Test Items	11
Psychometrics	10
Accountability	9
Educational Assessment	9
Standardized Tests	9
Educational Policy	8
English (Second Language)	8
Federal Legislation	8
Language Tests	8
Testing Accommodations	8
Intelligence Tests	7
Models	7
More ▼

Publication Type

Reports - Descriptive	75
Journal Articles	46
Numerical/Quantitative Data	5
Opinion Papers	4
Speeches/Meeting Papers	4
Guides - Non-Classroom	3
Information Analyses	2
Reports - Research	2
Reports - Evaluative	1

Education Level

Elementary Secondary Education	13
Secondary Education	7
Elementary Education	6
Primary Education	6
Early Childhood Education	5
Grade 3	5
Grade 4	5
Grade 5	5
Grade 6	5
Grade 7	5
Higher Education	5
Intermediate Grades	5
Junior High Schools	5
Middle Schools	5
High Schools	4
Grade 8	3
Grade 9	2
Postsecondary Education	2
More ▼

Audience

Practitioners	5
Researchers	5
Administrators	3
Teachers	3
Community	1
Parents	1

Location

New York	4
United States	3
California	2
Australia	1
Canada	1
Florida	1
France	1
Germany	1
Hong Kong	1
Israel	1
Malaysia	1
Nebraska	1
North Carolina	1
Tennessee	1
Texas	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	7
Elementary and Secondary…	1
Elementary and Secondary…	1
Every Student Succeeds Act…	1

Assessments and Surveys

International Adult Literacy…	3
SAT (College Admission Test)	3
Alberta Grade Twelve Diploma…	1
College Level Academic Skills…	1
Measures of Academic Progress	1
National Assessment of…	1
System of Multicultural…	1
Test of English for…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 75 results Save | Export

Hold the Bets! Should Quasi-Experiments Be Preferred to True Experiments When Causal Generalization Is the Goal?

Peer reviewed

Direct link

Andrew P. Jaciw – American Journal of Evaluation, 2025

By design, randomized experiments (XPs) rule out bias from confounded selection of participants into conditions. Quasi-experiments (QEs) are often considered second-best because they do not share this benefit. However, when results from XPs are used to generalize causal impacts, the benefit from unconfounded selection into conditions may be offset…

Descriptors: Elementary School Students, Elementary School Teachers, Generalization, Test Bias

A Psychometric View of Technology-Based Assessments

Peer reviewed

Direct link

Liou, Gloria; Bonner, Cavan V.; Tay, Louis – International Journal of Testing, 2022

With the advent of big data and advances in technology, psychological assessments have become increasingly sophisticated and complex. Nevertheless, traditional psychometric issues concerning the validity, reliability, and measurement bias of such assessments remain fundamental in determining whether score inferences of human attributes are…

Descriptors: Psychometrics, Computer Assisted Testing, Adaptive Testing, Data

"Classroometrics": The Validity, Reliability, and Fairness of Classroom Music Assessments

Peer reviewed

Direct link

Wesolowski, Brian C. – Music Educators Journal, 2020

Validity, reliability, and fairness are three prominent indicators for evaluating the quality of assessment processes. Each of the indicators is most often written about and applied in the context of large-scale assessment. As a result, the technical properties of these indicators make them limited in both their practicality and relevance for…

Descriptors: Music Education, Test Validity, Test Reliability, Student Evaluation

MAP Growth Theory of Action

Download full text

Meyer, J. Patrick; Dahlin, Michael – NWEA, 2022

The MAP® Growth™ theory of action describes key features of MAP Growth and its position in a comprehensive assessment system. The basic premise of the theory of action is that all students learn when MAP Growth is situated in a comprehensive assessment system and used for its intended purposes to yield information about student learning and enable…

Descriptors: Achievement Tests, Academic Achievement, Achievement Gains, Student Evaluation

Reframing Research and Assessment Practices: Advancing an Antiracist and Anti-Ableist Research Agenda

Peer reviewed

Direct link

Angela Johnson; Elizabeth Barker; Marcos Viveros Cespedes – Educational Measurement: Issues and Practice, 2024

Educators and researchers strive to build policies and practices on data and evidence, especially on academic achievement scores. When assessment scores are inaccurate for specific student populations or when scores are inappropriately used, even data-driven decisions will be misinformed. To maximize the impact of the research-practice-policy…

Descriptors: Equal Education, Inclusion, Evaluation Methods, Error of Measurement

Digital Module 12: Think-Aloud Interviews and Cognitive Labs https://ncme.elevate.commpartners.com

Peer reviewed

Direct link

Leighton, Jacqueline P.; Lehman, Blair – Educational Measurement: Issues and Practice, 2020

In this digital ITEMS module, Dr. Jacqueline Leighton and Dr. Blair Lehman review differences between think-aloud interviews to measure problem-solving processes and cognitive labs to measure comprehension processes. Learners are introduced to historical, theoretical, and procedural differences between these methods and how to use and analyze…

Descriptors: Protocol Analysis, Interviews, Problem Solving, Cognitive Processes

What School Psychologists Need to Know about Differential Item Functioning

Direct link

Flanagan, Agnes; Cormier, Damien C. – Communique, 2019

One of the areas subsumed under the data-based decision making and accountability practice identified in the National Association of School Psychologists' (NASP) "Model for Integrated School Psychological Services" is to collect information on psychological and educational variables to make decisions at a number of levels of service…

Descriptors: Test Bias, School Psychologists, Measurement, Data Collection

Valid and Reliable Assessments. CSAI Update

Download full text

Center on Standards and Assessments Implementation, 2018

Reliability is a measure of consistency. It is the degree to which student results are the same when they take the same test on different occasions, when different scorers score the same item or task, and when different but equivalent tests are taken at the same time or at different times. Reliability is about making sure that different test forms…

Descriptors: Test Reliability, Test Validity, Student Evaluation, Test Bias

Making the Case for the Quality and Use of a New Language Proficiency Assessment: Validity Argument for the Redesigned "TOEIC Bridge"® Tests. Research Report. ETS RR-21-20

Peer reviewed
PDF on ERIC

Download full text

Schmidgall, Jonathan; Cid, Jaime; Carter Grissom, Elizabeth; Li, Lucy – ETS Research Report Series, 2021

The redesigned "TOEIC Bridge"® tests were designed to evaluate test takers' English listening, reading, speaking, and writing skills in the context of everyday adult life. In this paper, we summarize the initial validity argument that supports the use of test scores for the purpose of selection, placement, and evaluation of a test…

Descriptors: Language Tests, Second Language Learning, English (Second Language), Language Proficiency

Linear Logistic Test Modeling with R

Peer reviewed
PDF on ERIC

Download full text

Baghaei, Purya; Kubinger, Klaus D. – Practical Assessment, Research & Evaluation, 2015

The present paper gives a general introduction to the linear logistic test model (Fischer, 1973), an extension of the Rasch model with linear constraints on item parameters, along with eRm (an R package to estimate different types of Rasch models; Mair, Hatzinger, & Mair, 2014) functions to estimate the model and interpret its parameters. The…

Descriptors: Item Response Theory, Models, Test Validity, Hypothesis Testing

An NCME Instructional Module on Population Invariance in Linking and Equating

Peer reviewed

Direct link

Huggins, Anne C.; Penfield, Randall D. – Educational Measurement: Issues and Practice, 2012

A goal for any linking or equating of two or more tests is that the linking function be invariant to the population used in conducting the linking or equating. Violations of population invariance in linking and equating jeopardize the fairness and validity of test scores, and pose particular problems for test-based accountability programs that…

Descriptors: Equated Scores, Tests, Test Bias, Validity

Enhancing Malaysian Teachers' Assessment Literacy

Peer reviewed
PDF on ERIC

Download full text

Lian, Lim Hooi; Yew, Wun Thiam; Meng, Chew Cheng – International Education Studies, 2014

Currently, in order to reform the Malaysian education system, there have been a number of education policy initiatives launched by the Malaysian Ministry of Education (MOE). All these initiatives have encouraged and inculcated teaching and learning for creativity, critical, innovative and higher-order thinking skills rather than conceptual…

Descriptors: Foreign Countries, Educational Policy, Evaluation Methods, Teacher Competencies

Assessing the "Rothstein Falsification Test": Does It Really Show Teacher Value-Added Models Are Biased? CEDR Working Paper No. 2012 1.3

Direct link

Goldhaber, Dan; Chaplin, Duncan – Center for Education Data & Research, 2012

In a provocative and influential paper, Jesse Rothstein (2010) finds that standard value added models (VAMs) suggest implausible future teacher effects on past student achievement, a finding that obviously cannot be viewed as causal. This is the basis of a falsification test (the Rothstein falsification test) that appears to indicate bias in VAM…

Descriptors: School Effectiveness, Teacher Effectiveness, Achievement Gains, Statistical Bias

The Quest for Fairness in Language Testing

Peer reviewed

Direct link

Karami, Hossein – Educational Research and Evaluation, 2013

The search for fairness in language testing is distinct from other areas of educational measurement as the object of measurement, that is, language, is part of the identity of the test takers. So, a host of issues enter the scene when one starts to reflect on how to assess people's language abilities. As the quest for fairness in language testing…

Descriptors: Language Skills, Language Tests, Testing, Culture Fair Tests

Screening Test Items for Differential Item Functioning

Peer reviewed

Direct link

Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2014

A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…

Descriptors: Test Items, Test Bias, Simulation, Hypothesis Testing

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Assessment and Accountability…	6
Educational Measurement:…	4
International Review of…	3
Language Testing	3
New York State Education…	3
Music Educators Journal	2
New Meridian Corporation	2
American Behavioral Scientist	1
American Journal of Evaluation	1
Center for Assessment and…	1
Center for Education Data &…	1
Center on Standards and…	1
Chronicle of Higher Education	1
College Board	1
College Board Review	1
Communique	1
Congressional Research Service	1
ETS Research Report Series	1
Educational Assessment	1
Educational Research and…	1
Educational and Psychological…	1
Focus on Exceptional Children	1
Government Accountability…	1
International Education…	1
International Journal of…	1
More ▼

Herman, Joan L.	4
Dietel, Ronald	2
Goldschmidt, Pete	2
Heritage, Margaret	2
Leighton, Jacqueline P.	2
Osmundson, Ellen	2
Abedi, Jamal	1
Andrew P. Jaciw	1
Angela Johnson	1
Ashby, Cornelia M.	1
Baghaei, Purya	1
Banville, Dominique	1
Beller, Michal	1
Birenbaum, Menucha	1
Bonner, Cavan V.	1
Boone, William J.	1
Boyle, J. David	1
Brown, Fred	1
Brown, Robert T.	1
Camara, Wayne	1
Cancelli, Anthony A.	1
Capizzi, Andrea M.	1
Carter Grissom, Elizabeth	1
Castenell, Louis	1
Chaplin, Duncan	1
More ▼