ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	4
Since 2007 (last 20 years)	10

Descriptor

Student Evaluation	19
Test Reliability	19
Test Theory	19
Test Validity	18
Evaluation Methods	10
Psychometrics	6
Test Construction	6
Test Bias	5
Measures (Individuals)	4
Scientific Concepts	4
Standardized Tests	4
Test Interpretation	4
Test Items	4
Test Use	4
Achievement Tests	3
College Science	3
Educational Assessment	3
Educational Research	3
Measurement Techniques	3
Multiple Choice Tests	3
Science Instruction	3
Ability	2
Classroom Observation…	2
Computer Assisted Testing	2
Difficulty Level	2
More ▼

Source

Alberta Journal of…	2
Annual Review of Applied…	1
Asian Journal of Education…	1
Educational Research and…	1
Grantee Submission	1
International Journal of…	1
Journal of Chemical Education	1
Journal of Science Education…	1
Marketing Education Review	1
Peabody Journal of Education	1
Physical Review Physics…	1
ProQuest LLC	1
Review of Educational Research	1
More ▼

Publication Type

Journal Articles	12
Reports - Research	9
Information Analyses	4
Books	2
Reports - Descriptive	2
Collected Works - General	1
Dissertations/Theses -…	1
Guides - Classroom - Learner	1
Guides - Classroom - Teacher	1
Guides - Non-Classroom	1
Opinion Papers	1
Reports - Evaluative	1
Speeches/Meeting Papers	1
More ▼

Education Level

Higher Education	6
Postsecondary Education	5
High Schools	2
Adult Education	1
Early Childhood Education	1
Elementary Education	1
Elementary Secondary Education	1
Grade 1	1
Grade 2	1
Junior High Schools	1
Middle Schools	1
Primary Education	1
Secondary Education	1
More ▼

Audience

Practitioners	2
Students	1
Teachers	1

Location

Australia	1
Turkey (Ankara)	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 19 results Save | Export

Evidence for Validity and Reliability of a Research-Based Assessment Instrument on Measurement Uncertainty

Peer reviewed

Direct link

Gayle Geschwind; Michael Vignal; Marcos D. Caballero; H.? J. Lewandowski – Physical Review Physics Education Research, 2024

The Survey of Physics Reasoning on Uncertainty Concepts in Experiments (SPRUCE) was designed to measure students' proficiency with measurement uncertainty concepts and practices across ten different assessment objectives to help facilitate the improvement of laboratory instruction focused on this important topic. To ensure the reliability and…

Descriptors: Measurement, Ambiguity (Context), Scientific Concepts, Physics

The Development and Validation of the Planet Formation Concept Inventory

Peer reviewed

Direct link

Simon, Molly N.; Prather, Edward E.; Buxner, Sanlyn R.; Impey, Chris D. – International Journal of Science Education, 2019

The discovery and characterisation of planets orbiting distant stars has shed light on the origin of our own Solar System. It is important that college-level introductory astronomy students have a general understanding of the planet formation process before they are able to draw parallels between extrasolar systems and our own Solar System. In…

Descriptors: Measures (Individuals), Test Validity, Test Reliability, Student Evaluation

Using Generalizability Theory to Assess the Score Reliability of Communication Skills of Dentistry Students

Peer reviewed
PDF on ERIC

Download full text

Uzun, N. Bilge; Aktas, Mehtap; Asiret, Semih; Yormaz, Seha – Asian Journal of Education and Training, 2018

The goal of this study is to determine the reliability of the performance points of dentistry students regarding communication skills and to examine the scoring reliability by generalizability theory in balanced random and fixed facet (mixed design) data, considering also the interactions of student, rater and duty. The study group of the research…

Descriptors: Foreign Countries, Generalizability Theory, Scores, Test Reliability

"TechCheck": Development and Validation of an Unplugged Assessment of Computational Thinking in Early Childhood Education

Peer reviewed

Direct link

Relkin, Emily; de Ruiter, Laura; Bers, Marina Umaschi – Journal of Science Education and Technology, 2020

There is a need for developmentally appropriate Computational Thinking (CT) assessments that can be implemented in early childhood classrooms. We developed a new instrument called "TechCheck" for assessing CT skills in young children that does not require prior knowledge of computer programming. "TechCheck" is based on…

Descriptors: Developmentally Appropriate Practices, Computation, Thinking Skills, Early Childhood Education

Improving Comprehension Assessment for Middle and High School Students: Challenges and Opportunities

Peer reviewed
PDF on ERIC

Download full text

Sabatini, John; Petscher, Yaacov; O'Reilly, Tenaha; Truckenmiller, Adrea – Grantee Submission, 2015

For decades, standardized reading comprehension tests have consisted of a series of passages and associated multiple-choice questions. Although widely used in and out of the classroom, there continues to be considerable disagreement regarding how or whether such tests have net value in the service of advancing educational progress in reading. This…

Descriptors: Middle School Students, High School Students, Reading Comprehension, Reading Tests

A Psychometric Analysis of the Chemical Concepts Inventory

Peer reviewed

Direct link

Barbera, Jack – Journal of Chemical Education, 2013

The Chemical Concepts Inventory (CCI) is a multiple-choice instrument designed to assess the alternate conceptions of students in high school or first-semester college chemistry. The instrument was published in 2002 along with an analysis of its data from a test population. This study supports the initial analysis and expands on the psychometric…

Descriptors: Science Instruction, Secondary School Science, High Schools, College Science

A "Conditional" Sense of Fairness in Assessment

Peer reviewed

Direct link

Mislevy, Robert J.; Haertel, Geneva; Cheng, Britte H.; Ructtinger, Liliana; DeBarger, Angela; Murray, Elizabeth; Rose, David; Gravel, Jenna; Colker, Alexis M.; Rutstein, Daisy; Vendlinski, Terry – Educational Research and Evaluation, 2013

Standardizing aspects of assessments has long been recognized as a tactic to help make evaluations of examinees fair. It reduces variation in irrelevant aspects of testing procedures that could advantage some examinees and disadvantage others. However, recent attention to making assessment accessible to a more diverse population of students…

Descriptors: Testing Accommodations, Access to Education, Testing, Psychometrics

An Innovative Excel Application to Improve Exam Reliability in Marketing Courses

Peer reviewed

Direct link

Keller, Christopher M.; Kros, John F. – Marketing Education Review, 2011

Measures of survey reliability are commonly addressed in marketing courses. One statistic of reliability is "Cronbach's alpha." This paper presents an application of survey reliability as a reflexive application of multiple-choice exam validation. The application provides an interactive decision support system that incorporates survey item…

Descriptors: Test Validity, Marketing, Test Reliability, Multiple Choice Tests

Evaluating Alignment between Curriculum, Assessment, and Instruction

Peer reviewed

Direct link

Martone, Andrea; Sireci, Stephen G. – Review of Educational Research, 2009

The authors (a) discuss the importance of alignment for facilitating proper assessment and instruction, (b) describe the three most common methods for evaluating the alignment between state content standards and assessments, (c) discuss the relative strengths and limitations of these methods, and (d) discuss examples of applications of each…

Descriptors: Teaching Methods, Alignment (Education), Student Evaluation, Curriculum Development

The Development of a Digital Logic Concept Inventory

Direct link

Herman, Geoffrey Lindsay – ProQuest LLC, 2011

Instructors in electrical and computer engineering and in computer science have developed innovative methods to teach digital logic circuits. These methods attempt to increase student learning, satisfaction, and retention. Although there are readily accessible and accepted means for measuring satisfaction and retention, there are no widely…

Descriptors: Grounded Theory, Delphi Technique, Concept Formation, Misconceptions

Testing and Teaching: Partners in Learning.

Peer reviewed

Ward, James Gordon – Peabody Journal of Education, 1981

Teachers need valid information to judge the types of programs, instruction, and colleges best suited to students. Teachers appear to support the use of standardized tests to provides some of that information. Abolishing such tests may lead to dependence on more subjective measures, resulting in inequities in placement and selection. (FG)

Descriptors: Achievement Tests, Educational Assessment, Elementary Secondary Education, Standardized Tests

Developments in Language Testing.

Peer reviewed

Douglas, Dan – Annual Review of Applied Linguistics, 1995

Reviews recent theoretical, methodological, and analytical developments in language testing, focusing on more refined models of language ability, reliability and validity, performance testing, innovative test formats, new applications of Item Response Theory and Generalizability Theory to test performance. An annotated bibliography discusses seven…

Descriptors: Annotated Bibliographies, Evaluation Methods, Language Proficiency, Language Tests

A Re-Examination of the Behavioral Categories of Seven Behavior Rating Instruments: A Conceptual Analysis. A Final Research Report.

Download full text

Bullock, Lyndal M.; And Others – 1988

Prompted by the increased use of behavior rating instruments in educational environments and evidence of confusion over the interpretation of labels designating behavior clusters, the present two-phase study analyzed 410 specific items contained in seven behavior rating instruments and investigated whether these items could be intuitively sorted…

Descriptors: Behavior Disorders, Behavior Rating Scales, Classroom Observation Techniques, Elementary Secondary Education

Toward Improving Assessment of Students with Special Needs: Expanding the Data Base to Include Classroom Performance.

Peer reviewed

Bachor, Dan G. – Alberta Journal of Educational Research, 1990

Reviews medical and psychoeducational assessment models, illustrating need to expand assessment procedure database resulting from the measurement errors in traditional assessment practices. Introduces assessment model emphasizing collection of classroom-based information over time and evaluating data more critically. Suggests model facilitates…

Descriptors: Classroom Observation Techniques, Evaluation Methods, Evaluation Problems, Instructional Development

Controlling Rater Stringency Error in Clinical Performance Rating: Further Validation of a Performance Rating Theory.

Cason, Gerald J.; And Others – 1983

Prior research in a single clinical training setting has shown Cason and Cason's (1981) simplified model of their performance rating theory can improve rating reliability and validity through statistical control of rater stringency error. Here, the model was applied to clinical performance ratings of 14 cohorts (about 250 students and 200 raters)…

Descriptors: Clinical Experience, Error of Measurement, Evaluation Methods, Higher Education

Previous Page | Next Page »

Pages: 1 | 2

Aktas, Mehtap	1
Asiret, Semih	1
Bachor, Dan G.	1
Barbera, Jack	1
Bers, Marina Umaschi	1
Bullock, Lyndal M.	1
Buxner, Sanlyn R.	1
Cason, Gerald J.	1
Cheng, Britte H.	1
Colker, Alexis M.	1
DeBarger, Angela	1
Douglas, Dan	1
Gayle Geschwind	1
Gravel, Jenna	1
H.? J. Lewandowski	1
Haertel, Geneva	1
Herman, Geoffrey Lindsay	1
Impey, Chris D.	1
Janda, Louis H.	1
Keller, Christopher M.	1
Kros, John F.	1
Linn, Robert L., Ed.	1
Marcos D. Caballero	1
Martone, Andrea	1
More ▼