ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	21
Since 2007 (last 20 years)	62

Descriptor

Statistical Analysis	82
Test Validity	82
Test Reliability	45
Item Response Theory	41
Foreign Countries	26
Test Construction	25
Test Items	25
Correlation	19
Factor Analysis	19
Psychometrics	18
Multiple Choice Tests	14
Questionnaires	14
Difficulty Level	13
Feedback (Response)	13
Response Style (Tests)	13
Item Analysis	11
College Students	10
Comparative Analysis	10
Rating Scales	10
Responses	10
Scores	10
Goodness of Fit	9
Models	9
English (Second Language)	8
Measurement Techniques	8
More ▼

Publication Type

Reports - Research	62
Journal Articles	59
Tests/Questionnaires	9
Reports - Evaluative	5
Reports - Descriptive	4
Dissertations/Theses -…	3
Books	1
Collected Works - General	1
Numerical/Quantitative Data	1
Speeches/Meeting Papers	1

Education Level

Higher Education	24
Postsecondary Education	17
Elementary Education	14
Secondary Education	14
Middle Schools	5
Grade 8	4
Junior High Schools	4
Early Childhood Education	3
High Schools	3
Elementary Secondary Education	2
Grade 1	2
Grade 7	2
Grade 9	2
Intermediate Grades	2
Kindergarten	2
Preschool Education	2
Primary Education	2
Grade 2	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Two Year Colleges	1
More ▼

Audience

Location

Australia	4
California	4
Germany	4
Turkey	3
Colorado	2
Iran	2
United Kingdom	2
Africa	1
Asia	1
Brazil	1
Canada	1
Colombia	1
Europe	1
Illinois (Chicago)	1
Iowa	1
Japan (Tokyo)	1
Jordan	1
New Zealand	1
Nigeria	1
North America	1
North Carolina	1
Norway	1
Pennsylvania	1
Saudi Arabia	1
Slovenia	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	2
Trends in International…	2
Defining Issues Test	1
Family Assessment Device	1
Metropolitan Readiness Tests	1
Test of English as a Foreign…	1
Woodcock Johnson Tests of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 82 results Save | Export

Detecting Aberrant Response Behavior with Nonparametric Method: Mokken and PerFit Packages in RStudio

Peer reviewed

Direct link

Sengül Avsar, Asiye – Measurement: Interdisciplinary Research and Perspectives, 2020

In order to reach valid and reliable test scores, various test theories have been developed, and one of them is nonparametric item response theory (NIRT). Mokken Models are the most widely known NIRT models which are useful for small samples and short tests. Mokken Package is useful for Mokken Scale Analysis. An important issue about validity is…

Descriptors: Response Style (Tests), Nonparametric Statistics, Item Response Theory, Test Validity

Characterizing the Latent Classes in a Mixture IRT Model Using DIF

Peer reviewed

Direct link

Karadavut, Tugba – Applied Measurement in Education, 2021

Mixture IRT models address the heterogeneity in a population by extracting latent classes and allowing item parameters to vary between latent classes. Once the latent classes are extracted, they need to be further examined to be characterized. Some approaches have been adopted in the literature for this purpose. These approaches examine either the…

Descriptors: Item Response Theory, Models, Test Items, Maximum Likelihood Statistics

Attending to General and Mathematics-Specific Dimensions of Teaching: Exploring Factors across Two Observation Instruments

Peer reviewed

Direct link

Blazar, David; Braslow, David; Charalambous, Charalambos Y.; Hill, Heather C. – Educational Assessment, 2017

New systems that seek to evaluate teachers with regard to their classroom quality often rely on observation instruments that capture general instructional pedagogies. However, decades of research suggest that content-specific dimensions of instruction also are important to differentiate teachers and improve student outcomes. We explore the degree…

Descriptors: Mathematics Instruction, Teaching Methods, Observation, Factor Analysis

Reliability and Validity of the Research Methods Skills Assessment

Peer reviewed
PDF on ERIC

Download full text

Smith, Tamarah; Smith, Samantha – International Journal of Teaching and Learning in Higher Education, 2018

The Research Methods Skills Assessment (RMSA) was created to measure psychology majors' statistics knowledge and skills. The American Psychological Association's Guidelines for the Undergraduate Major in Psychology (APA, 2007, 2013) served as a framework for development. Results from a Rasch analysis with data from n = 330 undergraduates showed…

Descriptors: Psychology, Statistics, Undergraduate Students, Item Response Theory

Assessing Academic Language in an Elementary Mathematics Teacher Licensure Exam

Peer reviewed
PDF on ERIC

Download full text

Castellano, Katherine E.; Duckor, Brent; Wihardini, Diah; Telléz, Kip; Wilson, Mark – Teacher Education Quarterly, 2016

With the adoption by most states of the Common Core State Standards (CCSS) for English language arts and literacy and for mathematics (CCSS Initiative, 2010a, 2010b) comes major changes in public education that will affect instructional practice, curriculum, and assessment across the nation. Heritage, Walqui, and Linquanti (2015) argued that the…

Descriptors: Elementary School Mathematics, Mathematics Teachers, Teacher Certification, Language Usage

Concurrent Validity and Sensitivity to Change of Direct Behavior Rating Single-Item Scales (DBR-SIS) within an Elementary Sample

Peer reviewed

Direct link

Smith, Rhonda L.; Eklund, Katie; Kilgus, Stephen P. – School Psychology Quarterly, 2018

The purpose of this study was to evaluate the concurrent validity, sensitivity to change, and teacher acceptability of Direct Behavior Rating single-item scales (DBR-SIS), a brief progress monitoring measure designed to assess student behavioral change in response to intervention. Twenty-four elementary teacher-student dyads implemented a daily…

Descriptors: Behavior Rating Scales, Test Validity, Progress Monitoring, Student Behavior

A Comparison of Reliability and Precision of Subscore Reporting Methods for a State English Language Proficiency Assessment

Peer reviewed

Direct link

Longabach, Tanya; Peyton, Vicki – Language Testing, 2018

K-12 English language proficiency tests that assess multiple content domains (e.g., listening, speaking, reading, writing) often have subsections based on these content domains; scores assigned to these subsections are commonly known as subscores. Testing programs face increasing customer demands for the reporting of subscores in addition to the…

Descriptors: Comparative Analysis, Test Reliability, Second Language Learning, Language Proficiency

Measuring Multidimensional Subjective Well-Being with the I COPPE Scale in a Hispanic Sample

Peer reviewed

Direct link

Myers, Nicholas D.; Park, Sung Eun; Lefevor, G. Tyler; Dietz, Samantha; Prilleltensky, Isaac; Prado, Guillermo J. – Measurement in Physical Education and Exercise Science, 2016

The purpose of this study was to provide initial validity evidence for measuring multidimensional subjective well-being in a Hispanic sample with the Interpersonal, Community, Occupational, Physical, Psychological, Economic (I COPPE) Scale. Participants were 641 English-speaking adults who self-identified as Hispanic. Bi-factor analyses were used…

Descriptors: Well Being, Comparative Analysis, Correlation, Hispanic Americans

Making Better Tests with the Rasch Measurement Model

Peer reviewed
PDF on ERIC

Download full text

Karlin, Omar; Karlin, Sayaka – InSight: A Journal of Scholarly Teaching, 2018

This study had two aims. The first was to explain the process of using the Rasch measurement model to validate tests in an easy-to-understand way for those unfamiliar with the Rasch measurement model. The second was to validate two final exams with several shared items. The exams were given to two groups of students with slightly differing English…

Descriptors: Item Response Theory, Test Validity, Test Items, Accuracy

Study of Bias in 2012-Placement Test through Rasch Model in Terms of Gender Variable

Peer reviewed
PDF on ERIC

Download full text

Turkan, Azmi; Cetin, Bayram – Journal of Education and Practice, 2017

Validity and reliability are among the most crucial characteristics of a test. One of the steps to make sure that a test is valid and reliable is to examine the bias in test items. The purpose of this study was to examine the bias in 2012 Placement Test items in terms of gender variable using Rasch Model in Turkey. The sample of this study was…

Descriptors: Item Response Theory, Gender Differences, Test Bias, Test Items

The "Test of Financial Literacy": Development and Measurement Characteristics

Peer reviewed

Direct link

Walstad, William B.; Rebeck, Ken – Journal of Economic Education, 2017

The "Test of Financial Literacy" (TFL) was created to measure the financial knowledge of high school students. Its content is based on the standards and benchmarks stated in the "National Standards for Financial Literacy" (Council for Economic Education 2013). The test development process involved extensive item writing and…

Descriptors: Tests, Money Management, Literacy, High School Students

The Development and Validation of the Student Response System Benefit Scale

Peer reviewed

Direct link

Hooker, J. F.; Denker, K. J.; Summers, M. E.; Parker, M. – Journal of Computer Assisted Learning, 2016

Previous research into the benefits student response systems (SRS) that have been brought into the classroom revealed that SRS can contribute positively to student experiences. However, while the benefits of SRS have been conceptualized and operationalized into a widely cited scale, the validity of this scale had not been tested. Furthermore,…

Descriptors: Technology Uses in Education, Factor Analysis, Audience Response Systems, Handheld Devices

Assessing Perceived Emotional Intelligence in Adolescents: New Validity Evidence of Trait Meta-Mood Scale-24

Peer reviewed

Direct link

Pedrosa, Ignacio; Suárez-Álvarez, Javier; Lozano, Luis M.; Muñiz, José; García-Cueto, Eduardo – Journal of Psychoeducational Assessment, 2014

Adolescence is a critical period of life during which significant psychosocial adjustment occurs and in which emotional intelligence plays an essential role. This article provides validity evidence for the Trait Meta-Mood Scale-24 (TMMS-24) scores based on an item response theory (IRT) approach. A sample of 2,693 Spanish adolescents (M = 16.52…

Descriptors: Foreign Countries, Adolescents, Secondary School Students, Emotional Intelligence

Stepping Outside the Normed Sample: Implications for Validity

Peer reviewed

Direct link

Hays, Danica G.; Wood, Chris – Measurement and Evaluation in Counseling and Development, 2017

We present considerations for validity when a population outside of a normed sample is assessed and those data are interpreted. Using a career group counseling example exploring life satisfaction changes as evidenced by the Quality of Life Inventory (Frisch, 1994), we showcase qualitative and quantitative approaches to explore how normative data…

Descriptors: Data Interpretation, Scores, Quality of Life, Life Satisfaction

University Students' Conceptual Knowledge of Randomness and Probability in the Contexts of Evolution and Mathematics

Peer reviewed

Direct link

Fiedler, Daniela; Tröbst, Steffen; Harms, Ute – CBE - Life Sciences Education, 2017

Students of all ages face severe conceptual difficulties regarding key aspects of evolution-- the central, unifying, and overarching theme in biology. Aspects strongly related to abstract "threshold" concepts like randomness and probability appear to pose particular difficulties. A further problem is the lack of an appropriate instrument…

Descriptors: College Students, Concept Formation, Probability, Evolution

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

ETS Research Report Series	3
Journal of Psychoeducational…	3
ProQuest LLC	3
CBE - Life Sciences Education	2
Cogent Education	2
Educational Assessment	2
Educational and Psychological…	2
Eurasian Journal of…	2
International Journal of…	2
Journal of Education and…	2
Language Assessment Quarterly	2
Measurement and Evaluation in…	2
Measurement in Physical…	2
Online Submission	2
Psychometrika	2
School Psychology Quarterly	2
Applied Measurement in…	1
Asia-Pacific Journal of…	1
Center for Educational Policy…	1
Chemistry Education Research…	1
Communication Monographs	1
Developmental Psychology	1
Didakometry	1
Educ Psychol Meas	1
Education Sciences	1
More ▼

Graf, Edith Aurora	2
Liu, Ou Lydia	2
Adkins, Dorothy C.	1
Alavi, Seyed Mohammad	1
Alhaythami, Hassan	1
Allen, Sandra	1
Aron, Arthur	1
Aron, Elaine N.	1
Ashraf, Hamid	1
Assary, Elham	1
Baghaei, Purya	1
Barbera, Jack	1
Bartoletti, Robin	1
Bejar, Isaac I.	1
Bello, Samira Abdullahi	1
Belur, Vinetha	1
Benton, Stephen L.	1
Berman, Ye'Elah	1
Bichi, Ado Abdu	1
Blazar, David	1
Bolden, Edward	1
Boone, William J.	1
Braslow, David	1
Brown, Ron	1
More ▼