ERIC - Search Results

Publication Date

In 2026	0
Since 2025	10
Since 2022 (last 5 years)	44
Since 2017 (last 10 years)	136
Since 2007 (last 20 years)	253

Descriptor

Scoring	681
Test Reliability	681
Test Validity	409
Test Construction	214
Testing	138
Test Items	114
Test Interpretation	100
Scores	77
Higher Education	75
Psychometrics	72
Item Analysis	71
Language Tests	70
Multiple Choice Tests	69
Foreign Countries	68
Measurement Techniques	62
Writing Evaluation	62
Item Response Theory	61
Interrater Reliability	60
Computer Assisted Testing	54
Correlation	54
Elementary Secondary Education	54
Standardized Tests	52
Test Bias	51
Test Format	51
Testing Problems	51
More ▼

Education Level

Elementary Education	56
Secondary Education	48
Higher Education	41
Postsecondary Education	34
Middle Schools	30
Early Childhood Education	29
Junior High Schools	27
Elementary Secondary Education	25
Primary Education	23
Grade 3	20
High Schools	20
Grade 4	19
Grade 5	19
Grade 8	19
Intermediate Grades	19
Grade 7	18
Grade 6	17
Kindergarten	10
Grade 1	7
Grade 2	6
Preschool Education	6
Grade 9	5
Grade 11	4
Grade 10	3
Grade 12	2
More ▼

Audience

Practitioners	32
Researchers	19
Teachers	14
Administrators	9
Policymakers	6
Students	4
Counselors	1
Parents	1

Location

New York	16
California	11
Canada	9
Nebraska	8
Turkey	8
Florida	6
Pennsylvania	5
Vermont	5
Australia	4
Netherlands	4
United States	4
Germany	3
New Mexico	3
Texas	3
United Kingdom	3
United Kingdom (England)	3
Europe	2
Japan	2
New Jersey	2
Switzerland	2
Taiwan	2
Africa	1
Alabama	1
Algeria	1
Arizona	1
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	5
Elementary and Secondary…	2
No Child Left Behind Act 2001	2
Education Consolidation…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Showing 1 to 15 of 681 results Save | Export

Investigation of Response Aggregation Methods in Divergent Thinking Assessments

Peer reviewed

Direct link

Janika Saretzki; Rosalie Andrae; Boris Forthmann; Mathias Benedek – Journal of Creative Behavior, 2025

Divergent thinking (DT) ability is widely regarded as a central cognitive capacity underlying creativity, but its assessment is challenged by the fact that DT tasks yield a variable number of responses. Various approaches for the scoring of DT tasks have been proposed, which differ in how responses are evaluated and aggregated within a task. The…

Descriptors: Creative Thinking, Creativity Tests, Scoring, Metacognition

A Study on Psychometric Properties of Creativity Indices

Peer reviewed

Direct link

M. Arda Atakaya; Ugur Sak; M. Bahadir Ayas – Creativity Research Journal, 2024

Scoring in creativity research has been a central problem since creativity became an important issue in psychology and education in the 1950s. The current study examined the psychometric properties of 27 creativity indices derived from summed and averaged scores using 15 scoring methods. Participants included 2802 middle-school students. Data…

Descriptors: Psychometrics, Creativity, Creativity Tests, Scoring

The Sensitivity of Value-Added Estimates to Test Scoring Decisions. EdWorkingPaper No. 25-1226

Download full text

Joshua B. Gilbert; James G. Soland; Benjamin W. Domingue – Annenberg Institute for School Reform at Brown University, 2025

Value-Added Models (VAMs) are both common and controversial in education policy and accountability research. While the sensitivity of VAMs to model specification and covariate selection is well documented, the extent to which test scoring methods (e.g., mean scores vs. IRT-based scores) may affect VA estimates is less studied. We examine the…

Descriptors: Value Added Models, Tests, Testing, Scoring

Linking Errors Introduced by Rapid Guessing Responses When Employing Multigroup Concurrent IRT Scaling

Direct link

Jiayi Deng – ProQuest LLC, 2024

Test score comparability in international large-scale assessments (LSA) is of utmost importance in measuring the effectiveness of education systems and understanding the impact of education on economic growth. To effectively compare test scores on an international scale, score linking is widely used to convert raw scores from different linguistic…

Descriptors: Item Response Theory, Scoring Rubrics, Scoring, Error of Measurement

Is Effort Moderated Scoring Robust to Multidimensional Rapid Guessing?

Peer reviewed

Direct link

Joseph A. Rios; Jiayi Deng – Educational and Psychological Measurement, 2025

To mitigate the potential damaging consequences of rapid guessing (RG), a form of noneffortful responding, researchers have proposed a number of scoring approaches. The present simulation study examines the robustness of the most popular of these approaches, the unidimensional effort-moderated (EM) scoring procedure, to multidimensional RG (i.e.,…

Descriptors: Scoring, Guessing (Tests), Reaction Time, Item Response Theory

Do Scoring Techniques and Number of Choices Affect the Reliability of Multiple-Choice Tests in Elementary Schools?

Peer reviewed
PDF on ERIC

Download full text

Herwin, Herwin; Pristiwaluyo, Triyanto; Ruslan, Ruslan; Dahalan, Shakila Che – Cypriot Journal of Educational Sciences, 2022

The application of multiple-choice tests often does not consider the scoring technique and the number of choices. The study aims at describing the effect of the scoring technique and numerous options towards the reliability of multiple-choice objective tests on social subjects in elementary school. The study is quantitative research with…

Descriptors: Scoring, Multiple Choice Tests, Test Reliability, Elementary School Students

Evaluating the Consistency and Reliability of Attribution Methods in Automated Short Answer Grading (ASAG) Systems: Toward an Explainable Scoring System

Peer reviewed

Direct link

Wallace N. Pinto Jr.; Jinnie Shin – Journal of Educational Measurement, 2025

In recent years, the application of explainability techniques to automated essay scoring and automated short-answer grading (ASAG) models, particularly those based on transformer architectures, has gained significant attention. However, the reliability and consistency of these techniques remain underexplored. This study systematically investigates…

Descriptors: Automation, Grading, Computer Assisted Testing, Scoring

Computational Concepts and Their Assessment in Preschool Students: An Empirical Study

Peer reviewed

Direct link

Marcos Jiménez; María Zapata-Cáceres; Marcos Román-González; Gregorio Robles; Jesús Moreno-León; Estefanía Martín-Barroso – Journal of Science Education and Technology, 2024

Computational thinking (CT) is a multidimensional term that encompasses a wide variety of problem-solving skills related to the field of computer science. Unfortunately, standardized, valid, and reliable methods to assess CT skills in preschool children are lacking, compromising the reliability of the results reported in CT interventions. To…

Descriptors: Computation, Thinking Skills, Student Evaluation, Preschool Children

A Systematic Review of Early Writing Assessment Tools

Peer reviewed

Direct link

Katherine L. Buchanan; Milena Keller-Margulis; Amanda Hut; Weihua Fan; Sarah S. Mire; G. Thomas Schanding Jr. – Early Childhood Education Journal, 2025

There is considerable research regarding measures of early reading but much less in early writing. Nevertheless, writing is a critical skill for success in school and early difficulties in writing are likely to persist without intervention. A necessary step toward identifying those students who need additional support is the use of screening…

Descriptors: Writing Evaluation, Evaluation Methods, Emergent Literacy, Beginning Writing

Selecting Technically Adequate Tests

Peer reviewed

Direct link

Susan K. Johnsen – Gifted Child Today, 2024

The author provides a checklist for educators who are selecting technically adequate tests for identifying and referring students for gifted education services and programs. The checklist includes questions related to how the test was normed, reliability and validity studies as well as questions related to types of scores, administration, and…

Descriptors: Test Selection, Academically Gifted, Gifted Education, Test Validity

Comparison of the Results of the Generalizability Theory with the Inter-Rater Agreement Coefficients

Peer reviewed
PDF on ERIC

Download full text

Eser, Mehmet Taha; Aksu, Gökhan – International Journal of Curriculum and Instruction, 2022

The agreement between raters is examined within the scope of the concept of "inter-rater reliability". Although there are clear definitions of the concepts of agreement between raters and reliability between raters, there is no clear information about the conditions under which agreement and reliability level methods are appropriate to…

Descriptors: Generalizability Theory, Interrater Reliability, Evaluation Methods, Test Theory

Preservice Teachers' Knowledge of Math Modeling: Initial Scale Development and Validation

Peer reviewed

Direct link

Reuben S. Asempapa; Doris Lee – Discover Education, 2025

Across the world, standards and practices for preparing teachers of mathematics emphasize the importance of math modeling (MM) in developing students' mathematical thinking. The aim of this research study was to develop the Mathematical Modeling Knowledge Scale (MAMKS), capable of determining preservice teachers' (PSTs') knowledge of MM. The study…

Descriptors: Preservice Teachers, Preservice Teacher Education, Mathematics Education, Mathematics Curriculum

TOEFL iBT® Technical Manual. TOEFL® Research Series. RR-106. ETS Research Report. RR-25-12

Peer reviewed
PDF on ERIC

Download full text

Venessa F. Manna; Shuhong Li; Spiros Papageorgiou; Lixiong Gu – ETS Research Report Series, 2025

This technical manual describes the purpose and intended uses of the TOEFL iBT test, its target test-taker population, and relevant language use domains. The test design and scoring procedures are presented first, followed by a research agenda intended to support the interpretation and use of test scores. Given the updates to the test starting…

Descriptors: Second Language Learning, English (Second Language), Language Tests, Test Construction

A Review of Test Use: The Test Anxiety Inventory

Peer reviewed
PDF on ERIC

Download full text

Alatli, Betül – International Journal of Curriculum and Instruction, 2022

This study was conducted to review the use of tests. For this purpose, 45 articles in which the Turkish form of the "Test Anxiety Inventory (TAI)," which is one of the tests frequently used in the field of education, was employed and that were published between 2000 and 2020 were examined in terms of factors that should be considered in…

Descriptors: Anxiety, Likert Scales, Test Anxiety, Test Reliability

Assessing Handwriting in Preschool-Aged Children: Reliability and Internal Consistency of the "Just Write!" Tool

Peer reviewed

Direct link

Bolton, Tiffany; Stevenson, Brittney; Janes, William – Journal of Occupational Therapy, Schools & Early Intervention, 2023

Researchers utilized a cross-sectional secondary analysis of data within an ongoing non-randomized controlled trial study design to establish the reliability and internal consistency of a novel handwriting assessment for preschoolers, the Just Write! (JW), written by the authors. Seventy-eight children from an area preschool participated in the…

Descriptors: Handwriting, Writing Skills, Writing Evaluation, Preschool Children

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 46

Journal of Psychoeducational…	41
Educational and Psychological…	23
Journal of Educational…	17
ETS Research Report Series	13
New York State Education…	12
Psychology in the Schools	10
Canadian Journal of School…	8
Grantee Submission	8
Applied Measurement in…	7
Educational Measurement:…	7
Language Testing	7
Nebraska Department of…	7
Applied Psychological…	6
Online Submission	6
ProQuest LLC	5
Psychometrika	5
J Educ Meas	4
Measurement and Evaluation in…	4
Partnership for Assessment of…	4
Assessing Writing	3
Assessment in Education:…	3
European Journal of…	3
Evaluation and the Health…	3
Journal of Autism and…	3
Journal of Consulting and…	3
More ▼

Schoen, Robert C.	7
McCrimmon, Adam W.	6
White, Edward M.	6
Livingston, Samuel A.	5
Breland, Hunter M.	4
Frary, Robert B.	4
Koretz, Daniel	4
Stansfield, Charles W.	4
Yang, Xiaotong	4
Anderson, Daniel	3
Attali, Yigal	3
Bauduin, Charity	3
Crocker, Linda	3
Echternacht, Gary	3
Guthrie, P. D.	3
Hambleton, Ronald K.	3
Paek, Insu	3
Reilly, Richard R.	3
Rippey, Robert M.	3
Sabers, Darrell L.	3
Shavelson, Richard J.	3
Thompson, Bruce	3
Allen, Abigail A.	2
Anderson, Paul S.	2
More ▼

Reports - Research	305
Journal Articles	294
Reports - Evaluative	146
Speeches/Meeting Papers	91
Reports - Descriptive	55
Tests/Questionnaires	53
Guides - Non-Classroom	38
Numerical/Quantitative Data	34
Information Analyses	20
Opinion Papers	17
Books	10
Guides - General	10
Book/Product Reviews	9
Guides - Classroom - Teacher	7
Reports - General	6
Dissertations/Theses -…	5
Reference Materials -…	5
Collected Works - General	3
ERIC Publications	2
Guides - Classroom - Learner	2
Collected Works - Proceedings	1
ERIC Digests in Full Text	1
Historical Materials	1
Reference Materials -…	1
More ▼

Wechsler Intelligence Scale…	12
SAT (College Admission Test)	10
Test of English as a Foreign…	9
Graduate Record Examinations	8
ACT Assessment	6
National Assessment of…	5
Torrance Tests of Creative…	5
Advanced Placement…	4
General Educational…	4
Goodenough Harris Drawing Test	4
Wechsler Individual…	4
Beery Developmental Test of…	3
Bender Gestalt Test	3
Kaufman Test of Educational…	3
McCarthy Scales of Childrens…	3
Medical College Admission Test	3
Wechsler Adult Intelligence…	3
Wechsler Preschool and…	3
Woodcock Johnson Tests of…	3
ACT Interest Inventory	2
Adaptive Behavior Scale	2
Clinical Evaluation of…	2
Developmental Indicators for…	2
Developmental Test of Visual…	2
Graduate Management Admission…	2
More ▼