NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Researchers1
Location
Canada1
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing all 14 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Kam, Chester Chun Seng – Educational and Psychological Measurement, 2023
When constructing measurement scales, regular and reversed items are often used (e.g., "I am satisfied with my job"/"I am not satisfied with my job"). Some methodologists recommend excluding reversed items because they are more difficult to understand and therefore engender a second, artificial factor distinct from the…
Descriptors: Test Items, Difficulty Level, Test Construction, Construct Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Veldkamp, Bernard P. – Journal of Educational Measurement, 2016
Many standardized tests are now administered via computer rather than paper-and-pencil format. The computer-based delivery mode brings with it certain advantages. One advantage is the ability to adapt the difficulty level of the test to the ability level of the test taker in what has been termed computerized adaptive testing (CAT). A second…
Descriptors: Computer Assisted Testing, Reaction Time, Standardized Tests, Difficulty Level
Peer reviewed Peer reviewed
Direct linkDirect link
Partchev, Ivailo; De Boeck, Paul; Steyer, Rolf – Assessment, 2013
An old issue in psychological assessment is to what extent power and speed each are measured by a given intelligence test. Starting from accuracy and response time data, an approach based on posterior time limits (cut-offs of recorded response time) leads to three kinds of recoded data: time data (whether or not the response precedes the cut-off),…
Descriptors: Psychological Testing, Intelligence Tests, Time, Item Response Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Wang, Wen-Chung; Jin, Kuan-Yu – Educational and Psychological Measurement, 2010
In this study, the authors extend the standard item response model with internal restrictions on item difficulty (MIRID) to fit polytomous items using cumulative logits and adjacent-category logits. Moreover, the new model incorporates discrimination parameters and is rooted in a multilevel framework. It is a nonlinear mixed model so that existing…
Descriptors: Difficulty Level, Test Items, Item Response Theory, Generalization
Camara, Wayne – College Board, 2011
This presentation was presented at the 2011 National Conference on Student Assessment (CCSSO). The focus of this presentation is how to validate the common core state standards (CCSS) in math and ELA and the subsequent assessments that will be developed by state consortia. The CCSS specify the skills students need to be ready for post-secondary…
Descriptors: College Readiness, Career Readiness, Benchmarking, Student Evaluation
Peer reviewed Peer reviewed
Berger, Steven G.; And Others – Assessment, 1994
As part of a neuropsychological assessment, 95 adult patients completed either standard or computerized versions of the Category Test. Subjects who completed the computerized version exhibited more errors than those who completed the standard version, suggesting that it may be more difficult. (SLD)
Descriptors: Adults, Comparative Analysis, Computer Assisted Testing, Demography
Peer reviewed Peer reviewed
Direct linkDirect link
Birney, Damian P.; Halford, Graeme S.; Andrews, Glenda – Educational and Psychological Measurement, 2006
Relational complexity (RC) theory conceptualizes an individual's processing capacity and a task's complexity along a common ordinal metric. The authors describe the development of the Latin Square Task (LST) that assesses the influence of RC on reasoning. The LST minimizes the role of knowledge and storage capacity and thus refines the…
Descriptors: Memory, Age Differences, Cognitive Processes, Psychometrics
Dunn, Denise A.; And Others – 1990
A study was conducted that attempted to show changes in electroencephalographic (EEG) patterns (identified using topographic EEG mapping) when children were required to perform the relatively simple task of button pressing during an eyes-open baseline session of low cognitive demand and a complex reaction time (RT) task of high cognitive demand.…
Descriptors: Brain Hemisphere Functions, Children, Cognitive Processes, Comparative Analysis
Lord, Frederic M. – 1971
A flexilevel test is found to be inferior to a peaked conventional test for measuring examinees in the middle of the ability range, superior for examinees at the extremes. Throughout the entire range of ability, a flexilevel test is much superior to any conventional test that attempts to provide accurate measurement at both extremes. See also ED…
Descriptors: Ability, Comparative Analysis, Difficulty Level, Guessing (Tests)
Mazzeo, John; Harvey, Anne L. – 1988
A literature review was conducted to determine the current state of knowledge concerning the effects of computer administration of standardized educational and psychological tests on the psychometric properties of these instruments. Students were grouped according to a number of factors relevant to the administration of tests by computer. Based on…
Descriptors: Comparative Analysis, Computer Assisted Testing, Difficulty Level, Educational Testing
Lord, Frederic M. – 1971
Some stochastic approximation procedures are considered in relation to the problem of choosing a sequence of test questions to accurately estimate a given examinee's standing on a psychological dimension. Illustrations are given evaluating certain procedures in a specific context. (Author/CK)
Descriptors: Academic Ability, Adaptive Testing, Computer Programs, Difficulty Level
Peer reviewed Peer reviewed
Westers, Paul; Kelderman, Henk – Psychometrika, 1992
A method for analyzing test-item responses is proposed to examine differential item functioning (DIF) in multiple-choice items within the latent class framework. Different models for detection of DIF are formulated, defining the subgroup as a latent variable. An efficient estimation method is described and illustrated. (SLD)
Descriptors: Chi Square, Difficulty Level, Educational Testing, Equations (Mathematics)
Green, Kathy E.; Kluever, Raymond C. – 1991
Item components that might contribute to the difficulty of items on the Raven Colored Progressive Matrices (CPM) and the Standard Progressive Matrices (SPM) were studied. Subjects providing responses to CPM items were 269 children aged 2 years 9 months to 11 years 8 months, most of whom were referred for testing as potentially gifted. A second…
Descriptors: Academically Gifted, Children, Comparative Testing, Difficulty Level
Peer reviewed Peer reviewed
Wilgosh, L.; And Others – Canadian Journal of Special Education, 1990
Item analysis data were collected for the Bender Visual Motor Gestalt Test and Goodenough-Harris Drawing Test, from urban and rural Alberta (Canada) youngsters and Inuit youngsters from the Northwest Territories (Canada). Both tests were inadequate in individual item difficulty levels, suggesting the necessity of revising scoring systems and…
Descriptors: Cultural Context, Difficulty Level, Elementary Education, Eskimos