ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	8
Since 2007 (last 20 years)	12

Source

Educational Assessment	3
Educational and Psychological…	3
Assessment	2
Educational Measurement:…	2
Grantee Submission	2
Language Testing	2
Adolescence	1
Advances in Health Sciences…	1
Education and Urban Society	1
European Journal of…	1
Evaluation Review	1
Global Education Review	1
International Journal of…	1
International Journal of…	1
Measurement and Evaluation in…	1
Online Submission	1
Physical Review Special…	1
Psychological Assessment	1
More ▼

Publication Type

Reports - Research	57
Journal Articles	23
Speeches/Meeting Papers	20
Tests/Questionnaires	4
Numerical/Quantitative Data	3
Information Analyses	1
Reports - Evaluative	1

Education Level

Higher Education	4
Postsecondary Education	4
Elementary Education	2
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Practitioners	2
Researchers	1
Teachers	1

Location

Japan	2
Asia	1
Brazil	1
California	1
China	1
Colorado	1
Hungary	1
Kenya	1
Nevada	1
New Jersey	1
Turkey	1
United Arab Emirates	1
United Kingdom (Scotland)	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	2
Education Consolidation…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

Advanced Placement…	1
Armed Services Vocational…	1
California Achievement Tests	1
Center for Epidemiologic…	1
Counselor Rating Form	1
Graduate Record Examinations	1
Kaufman Assessment Battery…	1
Minnesota Multiphasic…	1
National Assessment of…	1
SAT (College Admission Test)	1
Stanford Binet Intelligence…	1
Test Anxiety Inventory	1
Test of English as a Foreign…	1
Torrance Tests of Creative…	1
Wechsler Intelligence Scale…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 57 results Save | Export

A Review of Test Use: The Test Anxiety Inventory

Peer reviewed
PDF on ERIC

Download full text

Alatli, Betül – International Journal of Curriculum and Instruction, 2022

This study was conducted to review the use of tests. For this purpose, 45 articles in which the Turkish form of the "Test Anxiety Inventory (TAI)," which is one of the tests frequently used in the field of education, was employed and that were published between 2000 and 2020 were examined in terms of factors that should be considered in…

Descriptors: Anxiety, Likert Scales, Test Anxiety, Test Reliability

Validity Evidence for an Observational Fidelity Measure to Inform Scale-Up of Evidence-Based Interventions

Peer reviewed

Direct link

Pamela R. Buckley; Katie Massey Combs; Karen M. Drewelow; Brittany L. Hubler; Marion Amanda Lain – Evaluation Review, 2025

As evidence-based interventions are scaled, fidelity of implementation, and thus effectiveness, often wanes. Validated fidelity measures can improve researchers' ability to attribute outcomes to the intervention and help practitioners feel more confident in implementing the intervention as intended. We aim to provide a model for the validation of…

Descriptors: Middle School Students, Middle School Teachers, Evidence Based Practice, Program Development

Rater Certification Tests: A Psychometric Approach

Peer reviewed

Direct link

Attali, Yigal – Educational Measurement: Issues and Practice, 2019

Rater training is an important part of developing and conducting large-scale constructed-response assessments. As part of this process, candidate raters have to pass a certification test to confirm that they are able to score consistently and accurately before they begin scoring operationally. Moreover, many assessment programs require raters to…

Descriptors: Evaluators, Certification, High Stakes Tests, Scoring

A Validation Framework for Science Learning Progression Research

Peer reviewed

Direct link

Jin, Hui; van Rijn, Peter; Moore, John C.; Bauer, Malcolm I.; Pressler, Yamina; Yestness, Nissa – International Journal of Science Education, 2019

This article provides a validation framework for research on the development and use of science Learning Progressions (LPs). The framework describes how evidence from various sources can be used to establish an interpretive argument and a validity argument at five stages of LP research--development, scoring, generalisation, extrapolation, and use.…

Descriptors: Sequential Approach, Educational Research, Science Education, Validity

A Validation Framework for Science Learning Progression Research

Peer reviewed
PDF on ERIC

Download full text

Direct link

Jin, Hui; van Rijn, Peter; Moore, John C.; Bauer, Malcolm I.; Pressler, Yamina; Yestness, Nissa – Grantee Submission, 2019

Descriptors: Sequential Approach, Educational Research, Science Education, Validity

Aligning Test Scoring Procedures with Test Uses of the Early Grade Mathematics Assessment: A Balancing Act

Peer reviewed
PDF on ERIC

Download full text

Ketterlin-Geller, Leanne R.; Perry, Lindsey; Platas, Linda M.; Sitbakhan, Yasmin – Global Education Review, 2018

Test scoring procedures should align with the intended uses and interpretations of test results. In this paper, we examine three test scoring procedures for an operational assessment of early numeracy, the Early Grade Mathematics Assessment (EGMA). The EGMA is an assessment that tests young children's foundational mathematics knowledge and has…

Descriptors: Alignment (Education), Scoring, Test Use, Mathematics Tests

Screener Tests Need Validation Too: Weighing an Argument for Test Use against Practical Concerns

Peer reviewed

Direct link

Schmidgall, Jonathan E.; Getman, Edward P.; Zu, Jiyun – Language Testing, 2018

In this study, we define the term "screener test," elaborate key considerations in test design, and describe how to incorporate the concepts of practicality and argument-based validation to drive an evaluation of screener tests for language assessment. A screener test is defined as a brief assessment designed to identify an examinee as a…

Descriptors: Test Validity, Test Use, Test Construction, Language Tests

Demonstration of an Innovative Reading Comprehension Diagnostic Tool

Peer reviewed
PDF on ERIC

Download full text

Carlson, Sarah E.; Seipel, Ben; Biancarosa, Gina; Davison, Mark L.; Clinton, Virginia – Grantee Submission, 2019

This demonstration introduces and presents an innovative online cognitive diagnostic assessment, developed to identify the types of cognitive processes that readers use during comprehension; specifically, processes that distinguish between subtypes of struggling comprehenders. Cognitive diagnostic assessments are designed to provide valuable…

Descriptors: Reading Comprehension, Standardized Tests, Diagnostic Tests, Computer Assisted Testing

Development and Uses of Upper-Division Conceptual Assessments

Peer reviewed

Direct link

Wilcox, Bethany R.; Caballero, Marcos D.; Baily, Charles; Sadaghiani, Homeyra; Chasteen, Stephanie V.; Ryan, Qing X.; Pollock, Steven J. – Physical Review Special Topics - Physics Education Research, 2015

The use of validated conceptual assessments alongside conventional course exams to measure student learning in introductory courses has become standard practice in many physics departments. These assessments provide a more standard measure of certain learning goals, allowing for comparisons of student learning across instructors, semesters,…

Descriptors: Student Evaluation, Physics, Tests, Advanced Courses

Constructing a Validity Argument for the Objective Structured Assessment of Technical Skills (OSATS): A Systematic Review of Validity Evidence

Peer reviewed

Direct link

Hatala, Rose; Cook, David A.; Brydges, Ryan; Hawkins, Richard – Advances in Health Sciences Education, 2015

In order to construct and evaluate the validity argument for the Objective Structured Assessment of Technical Skills (OSATS), based on Kane's framework, we conducted a systematic review. We searched MEDLINE, EMBASE, CINAHL, PsycINFO, ERIC, Web of Science, Scopus, and selected reference lists through February 2013. Working in duplicate, we selected…

Descriptors: Measures (Individuals), Test Validity, Surgery, Skills

Confidence Scoring of Speaking Performance: How Does Fuzziness become Exact?

Peer reviewed

Direct link

Jin, Tan; Mak, Barley; Zhou, Pei – Language Testing, 2012

The fuzziness of assessing second language speaking performance raises two difficulties in scoring speaking performance: "indistinction between adjacent levels" and "overlap between scales". To address these two problems, this article proposes a new approach, "confidence scoring", to deal with such fuzziness, leading to "confidence" scores between…

Descriptors: Speech Communication, Scoring, Test Interpretation, Second Language Learning

Voices from Test-Takers: Further Evidence for Language Assessment Validation and Use

Peer reviewed

Direct link

Cheng, Liying; DeLuca, Christopher – Educational Assessment, 2011

Test-takers' interpretations of validity as related to test constructs and test use have been widely debated in large-scale language assessment. This study contributes further evidence to this debate by examining 59 test-takers' perspectives in writing large-scale English language tests. Participants wrote about their test-taking experiences in…

Descriptors: Language Tests, Test Validity, Test Use, English

Enhancing the Usability of the Counselor Rating Form for Researchers and Practitioners.

Peer reviewed

Dorn, Fred J.; Jereb, Ron – Measurement and Evaluation in Counseling and Development, 1985

Counseling psychologists (N=8) recorded the time needed to score the Counselor Rating Form (CRF) and the CRF-Quick Score (CRF-QS). Then 120 students viewed a counseling videotape and completed both measures. Results showed the two are comparable but the CRF-QS is significantly less time-consuming to score. (JAC)

Descriptors: Counselor Evaluation, Evaluation Methods, Scoring, Test Use

K-ABC/WISC-R Relationships for Students Referred for Severe Learning Disabilities.

Download full text

Smith, Douglas K.; And Others – 1986

The study examined the relationship between performance on the K-ABC (Kaufman Assessment Battery for Children) and the WISC-R (Wechsler Intelligence Scale for Children--Revised) for 67 students being considered for placement in a private school in a midwestern metropolitan area that serves students with severe learning disabilities. All were…

Descriptors: Elementary Education, Intelligence Quotient, Learning Disabilities, Scoring

Differential Item Functioning Results May Change Depending on How an Item Is Scored: An Illustration with the Center for Epidemiologic Studies Depression Scale.

Peer reviewed

Gelin, Michaela N.; Zumbo, Bruno D. – Educational and Psychological Measurement, 2003

Investigated potentially biased scale items on the Center for Epidemiological Studies Depression scale (CES-D; Radloff, 1977) in a sample of 600 adults. Overall, results indicate that the scoring method has an effect on differential item functioning (DIF), and that DIF is a property of the item, scoring method, and purpose of the assessment. (SLD)

Descriptors: Depression (Psychology), Item Bias, Scoring, Test Items

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Scoring	57
Test Use	57
Test Construction	14
Elementary Secondary Education	11
Test Validity	11
Language Tests	9
Educational Assessment	8
Higher Education	8
Test Interpretation	8
Test Reliability	8
Foreign Countries	7
College Students	6
Scores	6
Validity	6
Achievement Tests	5
English (Second Language)	5
Evaluation Methods	5
Evaluators	5
Interrater Reliability	5
Performance Based Assessment	5
State Programs	5
Student Evaluation	5
Test Items	5
Testing Programs	5
Writing Evaluation	5
More ▼

Bauer, Malcolm I.	2
Jin, Hui	2
Kao, Chi-Wen	2
Moore, John C.	2
Pressler, Yamina	2
Wolfe, Edward W.	2
Yestness, Nissa	2
van Rijn, Peter	2
Afflerbach, Peter	1
Ahlgrim-Delzell, Lynn	1
Alatli, Betül	1
Algozzine, Robert	1
Aloia, Mark S.	1
Attali, Yigal	1
Augustin, James W.	1
Baily, Charles	1
Ballator, Nada	1
Barclay, Raymond	1
Berning, Lisa C.	1
Biancarosa, Gina	1
Brittany L. Hubler	1
Browder, Diane	1
Bruno, James E.	1
Brydges, Ryan	1
More ▼