ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	51
Since 2006 (last 20 years)	107

Descriptor

Item Analysis	197
Statistical Analysis	197
Test Items	197
Difficulty Level	60
Test Construction	60
Foreign Countries	52
Comparative Analysis	43
Scores	43
Test Validity	38
Test Bias	32
Item Response Theory	30
Correlation	29
English (Second Language)	28
Mathematical Models	27
Achievement Tests	26
Language Tests	26
Multiple Choice Tests	26
Test Reliability	26
Latent Trait Theory	25
Psychometrics	25
Second Language Learning	25
Goodness of Fit	23
Simulation	21
Factor Analysis	20
Mathematics Tests	16
More ▼

Publication Type

Reports - Research	162
Journal Articles	117
Speeches/Meeting Papers	31
Reports - Evaluative	14
Tests/Questionnaires	12
Reports - Descriptive	8
Dissertations/Theses -…	5
Numerical/Quantitative Data	3
Guides - Non-Classroom	2
Collected Works - Proceedings	1
Guides - General	1
Information Analyses	1
Opinion Papers	1
Reports - General	1
More ▼

Education Level

Higher Education	40
Postsecondary Education	31
Secondary Education	16
Elementary Education	13
High Schools	6
Middle Schools	6
Elementary Secondary Education	5
Grade 8	5
Grade 4	4
Grade 5	4
Grade 3	3
Junior High Schools	3
Preschool Education	3
Grade 9	2
Intermediate Grades	2
Adult Education	1
Early Childhood Education	1
Grade 12	1
Grade 6	1
Grade 7	1
Kindergarten	1
Primary Education	1
More ▼

Audience

Researchers

Location

Turkey	7
Japan	4
India	3
Iran	3
Israel	3
Australia	2
Germany	2
Hong Kong	2
Massachusetts	2
Nigeria	2
Poland	2
Sweden	2
United States	2
Bosnia and Herzegovina	1
Botswana	1
Canada	1
China	1
China (Beijing)	1
Colombia	1
Delaware	1
Florida	1
France	1
Italy	1
Jordan	1
Malaysia	1
More ▼

Laws, Policies, & Programs

What Works Clearinghouse Rating

Showing 1 to 15 of 197 results Save | Export

Dimension-Corrected Somers' D for the Item Analysis Settings

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – International Journal of Educational Methodology, 2020

A new index of item discrimination power (IDP), dimension-corrected Somers' D (D2) is proposed. Somers' D is one of the superior alternatives for item-total- (Rit) and item-rest correlation (Rir) in reflecting the real IDP with items with scales 0/1 and 0/1/2, that is, up to three categories. D also reaches the extreme value +1 and -1 correctly…

Descriptors: Item Analysis, Correlation, Test Items, Simulation

Generalized Discrimination Index

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – International Journal of Educational Methodology, 2020

Kelley's Discrimination Index (DI) is a simple and robust, classical non-parametric short-cut to estimate the item discrimination power (IDP) in the practical educational settings. Unlike item-total correlation, DI can reach the ultimate values of +1 and -1, and it is stable against the outliers. Because of the computational easiness, DI is…

Descriptors: Test Items, Computation, Item Analysis, Nonparametric Statistics

Assess Robustness of the Rasch Mixture Model to Detect Differential Item Functioning -- A Monte Carlo Study

Direct link

Jinjin Huang – ProQuest LLC, 2020

Measurement invariance is crucial for an effective and valid measure of a construct. Invariance holds when the latent trait varies consistently across subgroups; in other words, the mean differences among subgroups are only due to true latent ability differences. Differential item functioning (DIF) occurs when measurement invariance is violated.…

Descriptors: Robustness (Statistics), Item Response Theory, Test Items, Item Analysis

Examining the Impact of Covariates on Anchor Tests to Ascertain Quality over Time in a College Admissions Test

Peer reviewed

Direct link

Wiberg, Marie; von Davier, Alina A. – International Journal of Testing, 2017

We propose a comprehensive procedure for the implementation of a quality control process of anchor tests for a college admissions test with multiple consecutive administrations. We propose to examine the anchor tests and their items in connection with covariates to investigate if there was any unusual behavior in the anchor test results over time…

Descriptors: College Entrance Examinations, Test Items, Equated Scores, Quality Control

Reliability and Validity of the Research Methods Skills Assessment

Peer reviewed
PDF on ERIC

Download full text

Smith, Tamarah; Smith, Samantha – International Journal of Teaching and Learning in Higher Education, 2018

The Research Methods Skills Assessment (RMSA) was created to measure psychology majors' statistics knowledge and skills. The American Psychological Association's Guidelines for the Undergraduate Major in Psychology (APA, 2007, 2013) served as a framework for development. Results from a Rasch analysis with data from n = 330 undergraduates showed…

Descriptors: Psychology, Statistics, Undergraduate Students, Item Response Theory

Easier Said than Done: Rejoinder on Sijtsma and on Green and Yang

Peer reviewed

Direct link

Davenport, Ernest C.; Davison, Mark L.; Liou, Pey-Yan; Love, Quintin U. – Educational Measurement: Issues and Practice, 2016

The main points of Sijtsma and Green and Yang in Educational Measurement: Issues and Practice (34, 4) are that reliability, internal consistency, and unidimensionality are distinct and that Cronbach's alpha may be problematic. Neither of these assertions are at odds with Davenport, Davison, Liou, and Love in the same issue. However, many authors…

Descriptors: Educational Assessment, Reliability, Validity, Test Construction

An Empirical Investigation of the Potential Impact of Item Misfit on Test Scores. Research Report. ETS RR-17-60

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Robin, Frederic – ETS Research Report Series, 2017

In this study, we examined the potential impact of item misfit on the reported scores of an admission test from the subpopulation invariance perspective. The target population of the test consisted of 3 major subgroups with different geographic regions. We used the logistic regression function to estimate item parameters of the operational items…

Descriptors: Scores, Test Items, Test Bias, International Assessment

How Does Polytomous Item Bias Affect Total-Group Survey Score Comparisons?

Peer reviewed

Direct link

Hidalgo, Ma Dolores; Benítez, Isabel; Padilla, Jose-Luis; Gómez-Benito, Juana – Sociological Methods & Research, 2017

The growing use of scales in survey questionnaires warrants the need to address how does polytomous differential item functioning (DIF) affect observed scale score comparisons. The aim of this study is to investigate the impact of DIF on the type I error and effect size of the independent samples t-test on the observed total scale scores. A…

Descriptors: Test Items, Test Bias, Item Response Theory, Surveys

Assessing Adults' Career Exploration: Development and Validation of the Vocational and Maternal Identity Exploration Scales

Peer reviewed

Direct link

Gross-Spector, Michal; Cinamon, Rachel Gali – Journal of Career Development, 2018

To promote our theoretical understanding regarding the exploration process during adulthood, the current study focusses on this process as it relates to work and family life roles and the relations between them, during the transition to motherhood. Two instruments assessing vocational and maternal exploration, relating to self and environment…

Descriptors: Adults, Career Exploration, Career Development, Family Work Relationship

Evaluation of Different Scoring Rules for a Noncognitive Test in Development. Research Report. ETS RR-16-03

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Zu, Jiyun; Kyllonen, Patrick; Schmitt, Neal – ETS Research Report Series, 2016

In this report, systematic applications of statistical and psychometric methods are used to develop and evaluate scoring rules in terms of test reliability. Data collected from a situational judgment test are used to facilitate the comparison. For a well-developed item with appropriate keys (i.e., the correct answers), agreement among various…

Descriptors: Scoring, Test Reliability, Statistical Analysis, Psychometrics

The Need, Development, and Validation of the Innovation Test Instrument

Peer reviewed
PDF on ERIC

Download full text

Wheadon, Jacob; Wright, Geoff A.; West, Richard E.; Skaggs, Paul – Journal of Technology Education, 2017

This study discusses the need, development, and validation of the Innovation Test Instrument (ITI). This article outlines how the researchers identified the content domain of the assessment and created test items. Then, it describes initial validation testing of the instrument. The findings suggest that the ITI is a good first step in creating an…

Descriptors: Innovation, Program Validation, Evaluation Needs, Test Construction

Developing a Learning Progression for Number Sense Based on the Rule Space Model in China

Peer reviewed

Direct link

Chen, Fu; Yan, Yue; Xin, Tao – Educational Psychology, 2017

The current study focuses on developing the learning progression of number sense for primary school students, and it applies a cognitive diagnostic model, the rule space model, to data analysis. The rule space model analysis firstly extracted nine cognitive attributes and their hierarchy model from the analysis of previous research and the…

Descriptors: Numeracy, Learning Processes, Elementary School Students, Foreign Countries

Sources of Difficulty in Assessment: Example of PISA Science Items

Peer reviewed

Direct link

Le Hebel, Florence; Montpied, Pascale; Tiberghien, Andrée; Fontanieu, Valérie – International Journal of Science Education, 2017

The understanding of what makes a question difficult is a crucial concern in assessment. To study the difficulty of test questions, we focus on the case of PISA, which assesses to what degree 15-year-old students have acquired knowledge and skills essential for full participation in society. Our research question is to identify PISA science item…

Descriptors: Achievement Tests, Foreign Countries, International Assessment, Secondary School Students

Examining the Reliability and Validity of a Turkish Version of the Community of Inquiry Survey

Peer reviewed
PDF on ERIC

Download full text

Olpak, Yusuf Ziya; Kiliç Çakmak, Ebru – Online Learning, 2018

The aim of this study was to describe the validity and reliability of a Turkish language version of the CoI survey developed by Arbaugh et al. (2008). Data were obtained from 1150 students enrolled in online courses in various departments in three Turkish state universities. The data were randomly divided into two parts: the first part was…

Descriptors: Foreign Countries, Test Reliability, Test Validity, Student Surveys

Reasoning with Pseudowords: How Properties of Novel Verbal Stimuli Influence Item Difficulty and Linguistic-Group Score Differences on Cognitive Ability Assessments

Direct link

Agnello, Paul – ProQuest LLC, 2018

Pseudowords (words that are not real but resemble real words in a language) have been used increasingly as a technique to reduce contamination due to construct-irrelevant variance in assessments of verbal fluid reasoning (Gf). However, despite pseudowords being researched heavily in other psychology sub-disciplines, they have received little…

Descriptors: Scores, Intelligence Tests, Difficulty Level, Item Analysis

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 14

ETS Research Report Series	14
Educational and Psychological…	12
Journal of Educational…	7
Language Testing	5
ProQuest LLC	5
Applied Psychological…	3
International Journal of…	3
International Journal of…	3
CBE - Life Sciences Education	2
Educational Research and…	2
European Journal of Physics…	2
International Journal of…	2
International Journal of…	2
Journal of Education and…	2
Journal of Educational…	2
Journal of Educational and…	2
Language Assessment Quarterly	2
Large-scale Assessments in…	2
Online Submission	2
Practical Assessment,…	2
Psychometrika	2
Studies in Second Language…	2
Accounting Education	1
Advances in Language and…	1
African Journal of Research…	1
More ▼

Dorans, Neil J.	4
Reckase, Mark D.	4
Benson, Jeri	2
Brown, James Dean	2
Gómez-Benito, Juana	2
Ironson, Gail H.	2
Kim, Sooyeon	2
Kostin, Irene	2
Lesniewska, Justyna	2
Livingston, Samuel A.	2
McKinley, Robert L.	2
Merz, William R.	2
Metsämuuronen, Jari	2
Moses, Tim	2
Phillips, Gary W.	2
Rudner, Lawrence M.	2
Scheuneman, Janice	2
Yen, Wendy M.	2
von Davier, Matthias	2
Abedi, Jamal	1
Abramzon, Andrea	1
Acar, Tülin	1
Adedoyin, O. O.	1
Adeleke, A. A.	1
More ▼

SAT (College Admission Test)	5
Test of English as a Foreign…	5
Graduate Record Examinations	4
Trends in International…	4
International English…	2
Program for International…	2
ACT Assessment	1
California Achievement Tests	1
Comprehensive Tests of Basic…	1
Florida Comprehensive…	1
Goodenough Harris Drawing Test	1
Iowa Tests of Basic Skills	1
Law School Admission Test	1
Metropolitan Achievement Tests	1
Metropolitan Readiness Tests	1
National Assessment of…	1
Praxis Series	1
Stanford Binet Intelligence…	1
Test of English for…	1
Test of Standard Written…	1
United States Medical…	1
More ▼