ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	5

Descriptor

Difficulty Level	14
Psychological Testing	14
Test Items	7
Psychometrics	6
Comparative Analysis	5
Item Response Theory	5
Test Construction	5
Educational Testing	4
Computer Assisted Testing	3
Elementary Education	3
Item Analysis	3
Mathematical Models	3
Benchmarking	2
Children	2
Cognitive Processes	2
Construct Validity	2
Correlation	2
Elementary School Students	2
Guessing (Tests)	2
Intelligence Tests	2
Neuropsychology	2
Preschool Education	2
Probability	2
Reaction Time	2
Regression (Statistics)	2
More ▼

Source

Educational and Psychological…	3
Assessment	2
Canadian Journal of Special…	1
College Board	1
Journal of Educational…	1
Psychometrika	1

Publication Type

Journal Articles	8
Reports - Research	6
Reports - Evaluative	4
Speeches/Meeting Papers	2
Information Analyses	1
Non-Print Media	1
Reference Materials - General	1
Reports - Descriptive	1

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Researchers

Location

Canada

Laws, Policies, & Programs

Assessments and Surveys

Bender Visual Motor Gestalt…	1
Goodenough Harris Drawing Test	1
Raven Progressive Matrices	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Why Do Regular and Reversed Items Load on Separate Factors? Response Difficulty vs. Item Extremity

Peer reviewed

Direct link

Kam, Chester Chun Seng – Educational and Psychological Measurement, 2023

When constructing measurement scales, regular and reversed items are often used (e.g., "I am satisfied with my job"/"I am not satisfied with my job"). Some methodologists recommend excluding reversed items because they are more difficult to understand and therefore engender a second, artificial factor distinct from the…

Descriptors: Test Items, Difficulty Level, Test Construction, Construct Validity

On the Issue of Item Selection in Computerized Adaptive Testing with Response Times

Peer reviewed

Direct link

Veldkamp, Bernard P. – Journal of Educational Measurement, 2016

Many standardized tests are now administered via computer rather than paper-and-pencil format. The computer-based delivery mode brings with it certain advantages. One advantage is the ability to adapt the difficulty level of the test to the ability level of the test taker in what has been termed computerized adaptive testing (CAT). A second…

Descriptors: Computer Assisted Testing, Reaction Time, Standardized Tests, Difficulty Level

How Much Power and Speed Is Measured in This Test?

Peer reviewed

Direct link

Partchev, Ivailo; De Boeck, Paul; Steyer, Rolf – Assessment, 2013

An old issue in psychological assessment is to what extent power and speed each are measured by a given intelligence test. Starting from accuracy and response time data, an approach based on posterior time limits (cut-offs of recorded response time) leads to three kinds of recoded data: time data (whether or not the response precedes the cut-off),…

Descriptors: Psychological Testing, Intelligence Tests, Time, Item Response Theory

A Generalized Model with Internal Restrictions on Item Difficulty for Polytomous Items

Peer reviewed

Direct link

Wang, Wen-Chung; Jin, Kuan-Yu – Educational and Psychological Measurement, 2010

In this study, the authors extend the standard item response model with internal restrictions on item difficulty (MIRID) to fit polytomous items using cumulative logits and adjacent-category logits. Moreover, the new model incorporates discrimination parameters and is rooted in a multilevel framework. It is a nonlinear mixed model so that existing…

Descriptors: Difficulty Level, Test Items, Item Response Theory, Generalization

College and Career Readiness: An Initial Validation Argument

Download full text

Camara, Wayne – College Board, 2011

This presentation was presented at the 2011 National Conference on Student Assessment (CCSSO). The focus of this presentation is how to validate the common core state standards (CCSS) in math and ELA and the subsequent assessments that will be developed by state consortia. The CCSS specify the skills students need to be ready for post-secondary…

Descriptors: College Readiness, Career Readiness, Benchmarking, Student Evaluation

The Category Test: A Comparison of Computerized and Standard Versions.

Peer reviewed

Berger, Steven G.; And Others – Assessment, 1994

As part of a neuropsychological assessment, 95 adult patients completed either standard or computerized versions of the Category Test. Subjects who completed the computerized version exhibited more errors than those who completed the standard version, suggesting that it may be more difficult. (SLD)

Descriptors: Adults, Comparative Analysis, Computer Assisted Testing, Demography

Measuring the Influence of Complexity on Relational Reasoning

Peer reviewed

Direct link

Birney, Damian P.; Halford, Graeme S.; Andrews, Glenda – Educational and Psychological Measurement, 2006

Relational complexity (RC) theory conceptualizes an individual's processing capacity and a task's complexity along a common ordinal metric. The authors describe the development of the Latin Square Task (LST) that assesses the influence of RC on reasoning. The LST minimizes the role of knowledge and storage capacity and thus refines the…

Descriptors: Memory, Age Differences, Cognitive Processes, Psychometrics

Dunn, Denise A.; And Others – 1990

A study was conducted that attempted to show changes in electroencephalographic (EEG) patterns (identified using topographic EEG mapping) when children were required to perform the relatively simple task of button pressing during an eyes-open baseline session of low cognitive demand and a complex reaction time (RT) task of high cognitive demand.…

Descriptors: Brain Hemisphere Functions, Children, Cognitive Processes, Comparative Analysis

A Theoretical Study of the Measurement Effectiveness of Flexilevel Tests.

Download full text

Lord, Frederic M. – 1971

A flexilevel test is found to be inferior to a peaked conventional test for measuring examinees in the middle of the ability range, superior for examinees at the extremes. Throughout the entire range of ability, a flexilevel test is much superior to any conventional test that attempts to provide accurate measurement at both extremes. See also ED…

Descriptors: Ability, Comparative Analysis, Difficulty Level, Guessing (Tests)

The Equivalence of Scores from Automated and Conventional Educational and Psychological Tests: A Review of the Literature. College Board Report No. 88-8.

Mazzeo, John; Harvey, Anne L. – 1988

A literature review was conducted to determine the current state of knowledge concerning the effects of computer administration of standardized educational and psychological tests on the psychometric properties of these instruments. Students were grouped according to a number of factors relevant to the administration of tests by computer. Based on…

Descriptors: Comparative Analysis, Computer Assisted Testing, Difficulty Level, Educational Testing

Tailored Testing, An Application of Stochastic Approximation.

Download full text

Lord, Frederic M. – 1971

Some stochastic approximation procedures are considered in relation to the problem of choosing a sequence of test questions to accurately estimate a given examinee's standing on a psychological dimension. Illustrations are given evaluating certain procedures in a specific context. (Author/CK)

Descriptors: Academic Ability, Adaptive Testing, Computer Programs, Difficulty Level

Examining Differential Item Functioning Due to Item Difficulty and Alternative Attractiveness.

Peer reviewed

Westers, Paul; Kelderman, Henk – Psychometrika, 1992

A method for analyzing test-item responses is proposed to examine differential item functioning (DIF) in multiple-choice items within the latent class framework. Different models for detection of DIF are formulated, defining the subgroup as a latent variable. An efficient estimation method is described and illustrated. (SLD)

Descriptors: Chi Square, Difficulty Level, Educational Testing, Equations (Mathematics)

Component Identification and Item Difficulty of Raven's Matrices Items.

Download full text

Green, Kathy E.; Kluever, Raymond C. – 1991

Item components that might contribute to the difficulty of items on the Raven Colored Progressive Matrices (CPM) and the Standard Progressive Matrices (SPM) were studied. Subjects providing responses to CPM items were 269 children aged 2 years 9 months to 11 years 8 months, most of whom were referred for testing as potentially gifted. A second…

Descriptors: Academically Gifted, Children, Comparative Testing, Difficulty Level

Northwest Territories Inuit, and Urban and Rural Alberta Normative Data: A Final Note on the Re-Norming/versus Scoring Revision Issue.

Peer reviewed

Wilgosh, L.; And Others – Canadian Journal of Special Education, 1990

Item analysis data were collected for the Bender Visual Motor Gestalt Test and Goodenough-Harris Drawing Test, from urban and rural Alberta (Canada) youngsters and Inuit youngsters from the Northwest Territories (Canada). Both tests were inadequate in individual item difficulty levels, suggesting the necessity of revising scoring systems and…

Descriptors: Cultural Context, Difficulty Level, Elementary Education, Eskimos

Lord, Frederic M.	2
Andrews, Glenda	1
Berger, Steven G.	1
Birney, Damian P.	1
Camara, Wayne	1
De Boeck, Paul	1
Dunn, Denise A.	1
Green, Kathy E.	1
Halford, Graeme S.	1
Harvey, Anne L.	1
Jin, Kuan-Yu	1
Kam, Chester Chun Seng	1
Kelderman, Henk	1
Kluever, Raymond C.	1
Mazzeo, John	1
Partchev, Ivailo	1
Steyer, Rolf	1
Veldkamp, Bernard P.	1
Wang, Wen-Chung	1
Westers, Paul	1
Wilgosh, L.	1
More ▼