ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	7

Descriptor

Error Patterns	7
Statistical Analysis	7
Test Items	7
Evaluation Methods	3
Accuracy	2
Computation	2
Computer Assisted Testing	2
Correlation	2
Foreign Countries	2
Item Response Theory	2
Multiple Choice Tests	2
Scores	2
Simulation	2
Test Format	2
Age Differences	1
Cognitive Processes	1
Communication Disorders	1
Comparative Analysis	1
Difficulty Level	1
English (Second Language)	1
Equations (Mathematics)	1
Evaluation Research	1
Gender Differences	1
Goodness of Fit	1
Intelligent Tutoring Systems	1
More ▼

Source

Educational and Psychological…	2
Education Sciences	1
Grantee Submission	1
International Education…	1
International Journal of…	1
Journal of Experimental…	1

Author

DeMars, Christine E.	1
Ganzfried, Sam	1
Haenschel, Corinna	1
Jackson, Margaret C.	1
Khaksefidi, Saman	1
Kim, Eun Sook	1
Kriegeskorte, Nikolaus	1
Lee, Taehun	1
Linden, David E. J.	1
Roberts, Mark V.	1
Sinharay, Sandip	1
Socha, Alan	1
Valdez, Alfred	1
Yoon, Myeongsun	1
Yusuf, Farzana	1
More ▼

Publication Type

Reports - Research	7
Journal Articles	6
Tests/Questionnaires	1

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Location

Iran	1
United Kingdom (Wales)	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Extension of Caution Indices to Mixed-Format Tests

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip – Grantee Submission, 2018

Tatsuoka (1984) suggested several extended caution indices and their standardized versions that have been used as person-fit statistics by researchers such as Drasgow, Levine, and McLaughlin (1987), Glas and Meijer (2003), and Molenaar and Hoijtink (1990). However, these indices are only defined for tests with dichotomous items. This paper extends…

Descriptors: Test Format, Goodness of Fit, Item Response Theory, Error Patterns

Optimal Weighting for Exam Composition

Peer reviewed
PDF on ERIC

Download full text

Ganzfried, Sam; Yusuf, Farzana – Education Sciences, 2018

A problem faced by many instructors is that of designing exams that accurately assess the abilities of the students. Typically, these exams are prepared several days in advance, and generic question scores are used based on rough approximation of the question difficulty and length. For example, for a recent class taught by the author, there were…

Descriptors: Weighted Scores, Test Construction, Student Evaluation, Multiple Choice Tests

The Psychological Effect of Errors in Standardized Language Test Items on EFL Students' Responses to the Following Item

Peer reviewed
PDF on ERIC

Download full text

Khaksefidi, Saman – International Education Studies, 2017

This study investigates the psychological effect of a wrong question with wrong items on answering to the next question in a test of structure. Forty students selected through stratified random sampling are given 15 questions of a standardized test namely a TOEFL structure test in which questions number 7 and number 11 are wrong and their answers…

Descriptors: Language Tests, English (Second Language), Second Language Learning, Statistical Analysis

Peer reviewed

Direct link

Jackson, Margaret C.; Linden, David E. J.; Roberts, Mark V.; Kriegeskorte, Nikolaus; Haenschel, Corinna – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2015

A number of studies have shown that visual working memory (WM) is poorer for complex versus simple items, traditionally accounted for by higher information load placing greater demands on encoding and storage capacity limits. Other research suggests that it may not be complexity that determines WM performance per se, but rather increased…

Descriptors: Visual Perception, Short Term Memory, Test Items, Cognitive Processes

An Investigation of Sample Size Splitting on ATFIND and DIMTEST

Peer reviewed

Direct link

Socha, Alan; DeMars, Christine E. – Educational and Psychological Measurement, 2013

Modeling multidimensional test data with a unidimensional model can result in serious statistical errors, such as bias in item parameter estimates. Many methods exist for assessing the dimensionality of a test. The current study focused on DIMTEST. Using simulated data, the effects of sample size splitting for use with the ATFIND procedure for…

Descriptors: Sample Size, Test Length, Correlation, Test Format

Student Metacognitive Monitoring: Predicting Test Achievement from Judgment Accuracy

Peer reviewed
PDF on ERIC

Download full text

Valdez, Alfred – International Journal of Higher Education, 2013

Metacognitive monitoring processes have been shown to be critical determinants of human learning. Metacognitive monitoring consist of various knowledge estimates that enable learners to engage in self-regulatory processes important for both the acquisition of knowledge and the monitoring of one's knowledge when engaged in assessment. This study…

Descriptors: Metacognition, Accuracy, Correlation, Validity

Testing Measurement Invariance Using MIMIC: Likelihood Ratio Test with a Critical Value Adjustment

Peer reviewed

Direct link

Kim, Eun Sook; Yoon, Myeongsun; Lee, Taehun – Educational and Psychological Measurement, 2012

Multiple-indicators multiple-causes (MIMIC) modeling is often used to test a latent group mean difference while assuming the equivalence of factor loadings and intercepts over groups. However, this study demonstrated that MIMIC was insensitive to the presence of factor loading noninvariance, which implies that factor loading invariance should be…

Descriptors: Test Items, Simulation, Testing, Statistical Analysis