ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	4
Since 2007 (last 20 years)	8

Descriptor

Test Reliability	8
Test Theory	8
Item Response Theory	7
Test Validity	5
Elementary School Students	4
Difficulty Level	3
Foreign Countries	3
Mathematics Tests	3
Construct Validity	2
Correlation	2
Curriculum Based Assessment	2
Generalizability Theory	2
Grade 1	2
Grade 2	2
Grade 8	2
Scores	2
Test Items	2
Academic Standards	1
Accountability	1
Accuracy	1
Alignment (Education)	1
Beginning Teacher Induction	1
Best Practices	1
Classification	1
Coding	1
More ▼

Source

Assessment for Effective…	1
Behavioral Research and…	1
European Journal of…	1
Journal of Psychoeducational…	1
Journal of Science Education…	1
Language Testing	1
Practical Assessment,…	1
Research Papers in Education	1

Publication Type

Journal Articles	7
Reports - Research	7
Numerical/Quantitative Data	1
Reports - Evaluative	1

Education Level

Elementary Education	8
Early Childhood Education	3
Middle Schools	3
Primary Education	3
Grade 1	2
Grade 2	2
Grade 8	2
Intermediate Grades	2
Junior High Schools	2
Secondary Education	2
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Kindergarten	1
Preschool Education	1
More ▼

Audience

Location

France	1
Norway	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 8 results Save | Export

The Riddle Knowledge Inference Test (R-Kit)

Peer reviewed

Direct link

Nicolas Rochat; Laurent Lima; Pascal Bressoux – Journal of Psychoeducational Assessment, 2025

Inference is considered an important factor in comprehension models and has been described as a causal factor in predicting comprehension. To date, specific tests for inference are rare and often rely on specific thematic texts. This reliance on thematic inference may raise some concerns as inference is related to prior text-specific knowledge.…

Descriptors: Inferences, Reading Comprehension, Reading Tests, Test Reliability

Conditional Standard Error of Measurement: Classical Test Theory, Generalizability Theory and Many-Facet Rasch Measurement with Applications to Writing Assessment

Peer reviewed
PDF on ERIC

Download full text

Huebner, Alan; Skar, Gustaf B. – Practical Assessment, Research & Evaluation, 2021

Writing assessments often consist of students responding to multiple prompts, which are judged by more than one rater. To establish the reliability of these assessments, there exist different methods to disentangle variation due to prompts and raters, including classical test theory, Many Facet Rasch Measurement (MFRM), and Generalizability Theory…

Descriptors: Error of Measurement, Test Theory, Generalizability Theory, Item Response Theory

"TechCheck": Development and Validation of an Unplugged Assessment of Computational Thinking in Early Childhood Education

Peer reviewed

Direct link

Relkin, Emily; de Ruiter, Laura; Bers, Marina Umaschi – Journal of Science Education and Technology, 2020

There is a need for developmentally appropriate Computational Thinking (CT) assessments that can be implemented in early childhood classrooms. We developed a new instrument called "TechCheck" for assessing CT skills in young children that does not require prior knowledge of computer programming. "TechCheck" is based on…

Descriptors: Developmentally Appropriate Practices, Computation, Thinking Skills, Early Childhood Education

A Comparison of Reliability and Precision of Subscore Reporting Methods for a State English Language Proficiency Assessment

Peer reviewed

Direct link

Longabach, Tanya; Peyton, Vicki – Language Testing, 2018

K-12 English language proficiency tests that assess multiple content domains (e.g., listening, speaking, reading, writing) often have subsections based on these content domains; scores assigned to these subsections are commonly known as subscores. Testing programs face increasing customer demands for the reporting of subscores in addition to the…

Descriptors: Comparative Analysis, Test Reliability, Second Language Learning, Language Proficiency

Measuring Teaching Best Practice in the Induction Years: Development and Validation of an Item-Level Assessment

Peer reviewed
PDF on ERIC

Download full text

Kingsley, Laurie; Romine, William – European Journal of Educational Research, 2014

Schools and teacher induction programs around the world routinely assess teaching best practice to inform accreditation, tenure/promotion, and professional development decisions. Routine assessment is also necessary to ensure that teachers entering the profession get the assistance they need to develop and succeed. We introduce the Item-Level…

Descriptors: Test Construction, Test Validity, Beginning Teacher Induction, Best Practices

Classification Accuracy in Key Stage 2 National Curriculum Tests in England

Peer reviewed

Direct link

He, Qingping; Hayes, Malcolm; Wiliam, Dylan – Research Papers in Education, 2013

The accuracy of the results of the national tests in English, mathematics and science taken by 11-year olds in England has been a matter of much debate since their introduction in 1994, with estimates of the proportion of students incorrectly classified varying from 10 to 30%. Using live data from the 2009 and 2010 administration of the national…

Descriptors: Foreign Countries, National Curriculum, Accuracy, Classification

Study of the Reliability of CCSS-Aligned Math Measures (2012 Research Version): Grades 6-8. Technical Report #1312

Download full text

Anderson, Daniel; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2012

In this technical report, we describe the results of a study of mathematics items written to align with the Common Core State Standards (CCSS) in grades 6-8. In each grade, CCSS items were organized into forms, and the reliability of these forms was evaluated along with an experimental form including items aligned with the National Council of…

Descriptors: Curriculum Based Assessment, Mathematics Tests, Academic Standards, State Standards

Item-Level and Construct Evaluation of Early Numeracy Curriculum-Based Measures

Peer reviewed

Direct link

Lee, Young-Sun; Lembke, Erica; Moore, Douglas; Ginsburg, Herbert P.; Pappas, Sandra – Assessment for Effective Intervention, 2012

The present study examined the technical adequacy of curriculum-based measures (CBMs) of early numeracy. Six 1-min early mathematics tasks were administered to 137 kindergarten and first-grade students, along with an omnibus test of early mathematics. The CBM measures included Count Out Loud, Quantity Discrimination, Number Identification, Missing…

Descriptors: Numeracy, Curriculum Based Assessment, Mathematics Tests, Kindergarten

Alonzo, Julie	1
Anderson, Daniel	1
Bers, Marina Umaschi	1
Ginsburg, Herbert P.	1
Hayes, Malcolm	1
He, Qingping	1
Huebner, Alan	1
Kingsley, Laurie	1
Laurent Lima	1
Lee, Young-Sun	1
Lembke, Erica	1
Longabach, Tanya	1
Moore, Douglas	1
Nicolas Rochat	1
Pappas, Sandra	1
Pascal Bressoux	1
Peyton, Vicki	1
Relkin, Emily	1
Romine, William	1
Skar, Gustaf B.	1
Tindal, Gerald	1
Wiliam, Dylan	1
de Ruiter, Laura	1
More ▼