ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	4
Since 2007 (last 20 years)	18

Descriptor

Reliability	21
Test Bias	21
Validity	16
Scores	7
Student Evaluation	7
Construct Validity	5
English (Second Language)	5
Factor Analysis	5
Federal Legislation	5
Second Language Learning	5
Test Construction	5
Test Selection	5
College Students	4
Correlation	4
Educational Assessment	4
Factor Structure	4
Teacher Evaluation	4
Academic Accommodations…	3
Accuracy	3
Achievement Tests	3
English Language Learners	3
Evaluation Methods	3
Guidelines	3
Measurement	3
Psychometrics	3
More ▼

Source

Assessment and Accountability…	7
Educational and Psychological…	2
Measurement and Evaluation in…	2
AASA Journal of Scholarship &…	1
ETS Research Report Series	1
Educational Assessment	1
Educational Measurement:…	1
Educational Testing Service	1
International Journal of…	1
International Journal of…	1
ProQuest LLC	1
More ▼

Publication Type

Journal Articles	10
Reports - Research	8
Reports - Descriptive	7
Guides - Non-Classroom	3
Reports - Evaluative	2
Dissertations/Theses -…	1
Information Analyses	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	8
Higher Education	5
Postsecondary Education	4
Grade 8	3
Secondary Education	2
Elementary Education	1
Grade 4	1
Grade 5	1
Grade 7	1
Grade 9	1
High Schools	1
More ▼

Audience

Administrators	1
Practitioners	1

Location

California	1
Hong Kong	1
Michigan (Detroit)	1
United States	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Beck Depression Inventory	1
Iowa Tests of Basic Skills	1
Motivated Strategies for…	1
Stanford Achievement Tests	1
Students Evaluation of…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 21 results Save | Export

Digital Module 12: Think-Aloud Interviews and Cognitive Labs https://ncme.elevate.commpartners.com

Peer reviewed

Direct link

Leighton, Jacqueline P.; Lehman, Blair – Educational Measurement: Issues and Practice, 2020

In this digital ITEMS module, Dr. Jacqueline Leighton and Dr. Blair Lehman review differences between think-aloud interviews to measure problem-solving processes and cognitive labs to measure comprehension processes. Learners are introduced to historical, theoretical, and procedural differences between these methods and how to use and analyze…

Descriptors: Protocol Analysis, Interviews, Problem Solving, Cognitive Processes

Mountain or Molehill? A Simulation Study on the Impact of Response Styles

Peer reviewed

Direct link

Plieninger, Hansjörg – Educational and Psychological Measurement, 2017

Even though there is an increasing interest in response styles, the field lacks a systematic investigation of the bias that response styles potentially cause. Therefore, a simulation was carried out to study this phenomenon with a focus on applied settings (reliability, validity, scale scores). The influence of acquiescence and extreme response…

Descriptors: Response Style (Tests), Test Bias, Item Response Theory, Correlation

Pilot Evaluation of the Computer-Based Assessment of Non-Cognitive Attributes of Health Professionals (CANA-HP)

Direct link

Sara Faye Maher – ProQuest LLC, 2020

To meet the needs of complex and/or underserved patient populations, health care professionals must possess diverse backgrounds, qualities, and skill sets. Holistic review has been used to diversify student admissions through examination of non-cognitive attributes of health care applicants. The objective of this study was to develop a novel…

Descriptors: Computer Assisted Testing, Pilot Projects, Measures (Individuals), Reliability

Administrators Gaming Test- and Observation-Based Teacher Evaluation Methods: To Conform To or Confront the System

Peer reviewed

Direct link

Geiger, Tray J.; Amrein-Beardsley, Audrey – AASA Journal of Scholarship & Practice, 2017

In this commentary, we discuss three types of data manipulations that can occur within teacher evaluation methods: artificial inflation, artificial deflation, and artificial conflation. These types of manipulation are more popularly known in the education profession as instances of Campbell's Law (1976), which states that the higher the…

Descriptors: Teacher Evaluation, Evaluation Methods, Data Analysis, Personnel Policy

Applying Longitudinal Mean and Covariance Structures (LMACS) Analysis to Assess Construct Stability Over Two Time Points: An Example Using Psychological Entitlement

Peer reviewed

Direct link

Bashkov, Bozhidar M.; Finney, Sara J. – Measurement and Evaluation in Counseling and Development, 2013

Traditional methods of assessing construct stability are reviewed and longitudinal mean and covariance structures (LMACS) analysis, a modern approach, is didactically illustrated using psychological entitlement data. Measurement invariance and latent variable stability results are interpreted, emphasizing substantive implications for educators and…

Descriptors: Statistical Analysis, Longitudinal Studies, Reliability, Psychological Patterns

Investigating ESL Students' Performance on Outcomes Assessments in Higher Education

Peer reviewed

Direct link

Lakin, Joni M.; Elliott, Diane Cardenas; Liu, Ou Lydia – Educational and Psychological Measurement, 2012

Outcomes assessments are gaining great attention in higher education because of increased demand for accountability. These assessments are widely used by U.S. higher education institutions to measure students' college-level knowledge and skills, including students who speak English as a second language (ESL). For the past decade, the increasing…

Descriptors: College Outcomes Assessment, Achievement Tests, English Language Learners, College Students

Person Heterogeneity of the BDI-II-C and Its Effects on Dimensionality and Construct Validity: Using Mixture Item Response Models

Peer reviewed

Direct link

Wu, Pei-Chen; Huang, Tsai-Wei – Measurement and Evaluation in Counseling and Development, 2010

This study was to apply the mixed Rasch model to investigate person heterogeneity of Beck Depression Inventory-II-Chinese version (BDI-II-C) and its effects on dimensionality and construct validity. Person heterogeneity was reflected by two latent classes that differ qualitatively. Additionally, person heterogeneity adversely affected the…

Descriptors: Construct Validity, Validity, Depression (Psychology), Item Response Theory

Adaptation and Analysis of Motivated Strategies for Learning Questionnaire in the Chinese Setting

Peer reviewed

Direct link

Lee, John Chi-kin; Yin, Hongbiao; Zhang, Zhonghua – International Journal of Testing, 2010

This article reports the adaptation and analysis of Pintrich's Motivated Strategies for Learning Questionnaire (MSLQ) in Hong Kong. First, this study examined the psychometric qualities of the existing Chinese version of MSLQ (MSLQ-CV). Based on this examination, this study developed a revised Chinese version of MSLQ (MSLQ-RCV) for junior…

Descriptors: Foreign Countries, Questionnaires, Psychometrics, Secondary School Students

Guidance for Developing and Selecting Assessments of Student Growth for Use in Teacher Evaluation Systems (Extended Version)

Download full text

Herman, Joan L.; Heritage, Margaret; Goldschmidt, Pete – Assessment and Accountability Comprehensive Center, 2011

States and districts across the country are grappling with how to incorporate assessments of student learning into their teacher evaluation systems. Sophisticated statistical models have been proposed to estimate the relative value individual teachers add to their students' assessment performance (hence the term teacher "value-added" measures).…

Descriptors: Teacher Evaluation, Testing, Test Selection, Test Construction

Developing and Selecting Assessments of Student Growth for Use in Teacher Evaluation Systems

Download full text

Herman, Joan L.; Heritage, Margaret; Goldschmidt, Pete – Assessment and Accountability Comprehensive Center, 2011

Descriptors: Teacher Evaluation, Testing, Test Selection, Test Construction

Evaluation of the Technical Adequacy of Evidence of Assessments of English Language Proficiency: Body of Evidence Summary

Download full text

Assessment and Accountability Comprehensive Center, 2007

This body of evidence summary reports the results of the evaluation of technical evidence in support of the California English Language Development Test (CELDT), as analyzed against a validated list of technical adequacy criteria. The table presented in this paper outlines the types of validity, reliability, and bias and sensitivity evidence…

Descriptors: Evidence, Validity, Language Acquisition, Language Proficiency

Validity and Fairness of State Standards-Based Assessments for English Language Learners

Peer reviewed

Direct link

Young, John W.; Cho, Yeonsuk; Ling, Guangming; Cline, Fred; Steinberg, Jonathan; Stone, Elizabeth – Educational Assessment, 2008

English language learners (ELLs) constitute one of the fastest growing subpopulations of students in the United States. It is important to determine whether the assessments used by states in determining students' proficiencies are valid and fair for ELLs. This study focused on several standards-based assessments in mathematics and science…

Descriptors: Testing Accommodations, State Standards, Word Lists, Construct Validity

Benchmark Assessment for Improved Learning. An AACC Policy Brief

Download full text

Herman, Joan L.; Osmundson, Ellen; Dietel, Ronald – Assessment and Accountability Comprehensive Center, 2010

The No Child Left Behind Act of 2001 (NCLB, 2002) has produced an explosion of interest in the use of assessment to measure and improve student learning. Initially focused on annual state tests, educators quickly learned that results came too little and too late to identify students who were falling behind. At the same time, evidence from the…

Descriptors: Federal Legislation, Formative Evaluation, Benchmarking, Educational Assessment

Score Comparability for Language Minority Students on the Content Assessments Used by Two States. Research Report. ETS RR-11-27

Download full text

Young, John W.; Holtzman, Steven; Steinberg, Jonathan – Educational Testing Service, 2011

In this research investigation of score comparability for language minority students (English language learners [ELLs] and former English language learners), we examined 3 indicators of score comparability (reliability, internal test structure, and differential item functioning) for 4th and 8th grade students who took the NCLB-mandated content…

Descriptors: Language Minorities, Second Language Learning, Grade 8, Minority Group Students

Benchmark Assessment for Improved Learning. AACC Report

Download full text

Herman, Joan L.; Osmundson, Ellen; Dietel, Ronald – Assessment and Accountability Comprehensive Center, 2010

This report describes the purposes of benchmark assessments and provides recommendations for selecting and using benchmark assessments--addressing validity, alignment, reliability, fairness and bias and accessibility, instructional sensitivity, utility, and reporting issues. We also present recommendations on building capacity to support schools'…

Descriptors: Multiple Choice Tests, Test Items, Benchmarking, Educational Assessment

Previous Page | Next Page »

Pages: 1 | 2

Herman, Joan L.	4
Dietel, Ronald	2
Gallagher, Carole	2
Goldschmidt, Pete	2
Heritage, Margaret	2
Lagunoff, Rachel	2
Osmundson, Ellen	2
Sato, Edynn	2
Steinberg, Jonathan	2
Worth, Peter	2
Young, John W.	2
Amrein-Beardsley, Audrey	1
Bashkov, Bozhidar M.	1
Cho, Yeonsuk	1
Cline, Fred	1
Crane, Eric	1
Ebel, Robert L.	1
Elliott, Diane Cardenas	1
Finney, Sara J.	1
Geiger, Tray J.	1
Holtzman, Steven	1
Huang, Tsai-Wei	1
Lakin, Joni M.	1
Lee, Jihyun	1
Lee, John Chi-kin	1
More ▼