ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	7

Descriptor

Difficulty Level	86
Item Analysis	86
Test Items	76
Test Construction	35
Latent Trait Theory	27
Mathematical Models	26
Achievement Tests	19
Higher Education	19
Multiple Choice Tests	18
Test Validity	18
Test Reliability	14
Comparative Analysis	13
Testing Problems	13
Goodness of Fit	12
Statistical Studies	12
Item Banks	11
Scores	11
Statistical Analysis	10
Test Bias	10
College Entrance Examinations	9
Reading Tests	9
Test Format	9
Equated Scores	8
Elementary Secondary Education	7
Foreign Countries	7
More ▼

Source

AERA Online Paper Repository	1
Applied Measurement in…	1
Educational and Psychological…	1
International Association for…	1
International Baltic…	1
International Educational…	1
Mathematics Education…	1
Online Submission	1
Pearson	1
Studies in Educational…	1

Publication Type

Speeches/Meeting Papers	86
Reports - Research	72
Reports - Evaluative	6
Reports - Descriptive	4
Journal Articles	3
Information Analyses	2
Guides - Non-Classroom	1

Education Level

Higher Education	2
Postsecondary Education	2
Secondary Education	2
Elementary Education	1
Elementary Secondary Education	1
Grade 6	1
Intermediate Grades	1
Middle Schools	1

Audience

Researchers

Location

California	1
Czech Republic	1
Denmark	1
Florida	1
Georgia	1
Germany	1
India	1
Indonesia	1
Michigan	1
New Zealand	1
Nigeria	1
Turkey (Istanbul)	1
Virginia	1
West Virginia	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

New Jersey College Basic…	2
ACT Assessment	1
Cattell Culture Fair…	1
Comprehensive Tests of Basic…	1
Graduate Record Examinations	1
Matching Familiar Figures Test	1
Medical College Admission Test	1
SAT (College Admission Test)	1
Sequential Tests of…	1
Stanford Achievement Tests	1

What Works Clearinghouse Rating

Showing 1 to 15 of 86 results Save | Export

Deep-IRT with Independent Student and Item Networks

Peer reviewed
PDF on ERIC

Download full text

Tsutsumi, Emiko; Kinoshita, Ryo; Ueno, Maomi – International Educational Data Mining Society, 2021

Knowledge tracing (KT), the task of tracking the knowledge state of each student over time, has been assessed actively by artificial intelligence researchers. Recent reports have described that Deep-IRT, which combines Item Response Theory (IRT) with a deep learning model, provides superior performance. It can express the abilities of each student…

Descriptors: Item Response Theory, Prediction, Accuracy, Artificial Intelligence

The Pattern of Test-Taking Effort across Items in Cognitive Ability Test: A Latent Class Analysis

Peer reviewed
PDF on ERIC

Download full text

Akhtar, Hanif – International Association for Development of the Information Society, 2022

When examinees perceive a test as low stakes, it is logical to assume that some of them will not put out their maximum effort. This condition makes the validity of the test results more complicated. Although many studies have investigated motivational fluctuation across tests during a testing session, only a small number of studies have…

Descriptors: Intelligence Tests, Student Motivation, Test Validity, Student Attitudes

Developing the Diagnostic Test of Misconceptions of Fractions

Peer reviewed
PDF on ERIC

Download full text

Aleyna Altan; Zehra Taspinar Sener – Online Submission, 2023

This research aimed to develop a valid and reliable test to be used to detect sixth grade students' misconceptions and errors regarding the subject of fractions. A misconception diagnostic test has been developed that includes the concept of fractions, different representations of fractions, ordering and comparing fractions, equivalence of…

Descriptors: Diagnostic Tests, Mathematics Tests, Fractions, Misconceptions

Lawson Classroom Test of Scientific Reasoning at Entrance University Level

Peer reviewed
PDF on ERIC

Download full text

Hrouzková, Tereza; Richterek, Lukáš – International Baltic Symposium on Science and Technology Education, 2021

The Lawson classroom test of scientific reasoning is a quite popular and widely used tool that measures the level and development of the student's scientific reasoning skills. In this contribution, the results of this test for the N=446 students of the Faculty of Science Palacký University Olomouc from the years 2018-2020 at the beginning of their…

Descriptors: Science Tests, Thinking Skills, Undergraduate Students, Science Education

Modeling Instructional Sensitivity of Vocational Competences via Differential Item Functioning (DIF) in Competence-Based Assessments

Peer reviewed

Direct link

Klotz, Viola Katharina; Winther, Esther; Marx, Christian; Goeze, Annika; Fischer, Christoph; Sangmeister, Julia – AERA Online Paper Repository, 2016

Apprentices' performance after vocational educational training (VET) is commonly attributed to more or less effective training. This implies the assumption that learning is significantly affected by vocational instruction (instructional sensitivity). However, the question has not been investigated yet if VETs are effective, i.e., that they foster…

Descriptors: Vocational Education, Apprenticeships, Performance Based Assessment, Item Analysis

Getting out of Bed: Students' Beliefs

Download full text

Watson, Jane; Callingham, Rosemary – Mathematics Education Research Group of Australasia, 2015

Responses of 223 students in grades 6 to 11 to questions related to beliefs about getting out of bed on the left side are analysed from two perspectives. On one hand the items explore subjective beliefs about chance. On the other hand the different wording and context of the items provide opportunity to show different levels of understanding of…

Descriptors: Beliefs, Student Attitudes, Difficulty Level, Item Response Theory

Population Invariance of Vertical Scaling Results

Direct link

Powers, Sonya; Turhan, Ahmet; Binici, Salih – Pearson, 2012

The population sensitivity of vertical scaling results was evaluated for a state reading assessment spanning grades 3-10 and a state mathematics test spanning grades 3-8. Subpopulations considered included males and females. The 3-parameter logistic model was used to calibrate math and reading items and a common item design was used to construct…

Descriptors: Scaling, Equated Scores, Standardized Tests, Reading Tests

The Relative Difficulty Ratio--A Test and Item Index.

PDF pending restoration

Frisbie, David A. – 1980

The development of a new technique, the Relative Difficulty Ratio (RDR), is described, as well as how it can be used to determine the difficulty level of a test so that meaningful inter-test difficulty comparisons can be made. Assumptions made in computing RDR include: 1) each item must be scored dichotomously with only one answer choice keyed as…

Descriptors: Difficulty Level, Item Analysis, Measurement Techniques, Scores

An Item-Level Analysis of "None of the Above."

Download full text

Rich, Charles E.; Johanson, George A. – 1990

Despite the existence of little empirical evidence for their effectiveness, many techniques have been suggested for writing multiple-choice items. The option "none of the above" (NA) has been widely used although a recent review of empirical studies of NA suggests that, while generally decreasing the difficulty index, NA also decreases…

Descriptors: Difficulty Level, Item Analysis, Multiple Choice Tests, Test Construction

An Investigation of Cross-Cultural Stability in Mental Test Items.

Download full text

Breland, Hunter M. – 1974

Examples of cross-cultural stability or instability of mental test items are illustrated. A statistical procedure involving the cross-plotting of item difficulties for two different groups and generating a line of mutual regression through the resulting scatter of points is described. D-values, representing the perpendicular distance, in delta…

Descriptors: Cross Cultural Studies, Difficulty Level, Item Analysis, Statistical Analysis

Identification of Nonuniform Differential Item Functioning Using a Variation of the Mantel-Haenszel Procedure.

Peer reviewed

Mazor, Kathleen M.; And Others – Educational and Psychological Measurement, 1994

A variation of the Mantel Haenszel procedure is proposed that improves detection rates of nonuniform differential item functioning (DIF) without increasing the Type I error rate. The procedure, which is illustrated with simulated examinee responses, involves splitting the sample into low- and high-performing groups. (SLD)

Descriptors: Difficulty Level, Identification, Item Analysis, Item Bias

A General Model for Item Dependency.

Download full text

Ackerman, Terry A.; Spray, Judith A. – 1986

A model of test item dependency is presented and used to illustrate the effect that violations of local independence have on the behavior of item characteristic curves. The dependency model is flexible enough to simulate the interaction of a number of factors including item difficulty and item discrimination, varying degrees of item dependence,…

Descriptors: Difficulty Level, Item Analysis, Latent Trait Theory, Mathematical Models

Item Difficulty Reconsidered: An IRT Perspective.

PDF pending restoration

Reckase, Mark D.; McKinley, Robert L. – 1984

A new indicator of item difficulty, which identifies effectiveness ranges, overcomes the limitations of other item difficulty indexes in describing the difficulty of an item or a test as a whole and in aiding the selection of appropriate ability level items for a test. There are three common uses of the term "item difficulty": (1) the probability…

Descriptors: Difficulty Level, Evaluation Methods, Item Analysis, Latent Trait Theory

A Comparison Study of the Unidimensional IRT Estimation of Compensatory and Noncompensatory Multidimensional Item Response Data.

Download full text

Ackerman, Terry A. – 1987

Concern has been expressed over the item response theory (IRT) assumption that a person's ability can be estimated in a unidimensional latent space. To examine whether or not the response to an item requires only a single latent ability, unidimensional ability estimates were compared for data generated from the multidimensional item response…

Descriptors: Ability, Computer Simulation, Difficulty Level, Item Analysis

Item Bias Identification: A Comparison of Two Approaches.

Groome, Mary Lynn; Groome, William R. – 1979

Angoff's method for identifying possible biased test items was applied to four computer-generated hypothetical tests, two of which contained no biased items and two of which contained a few biased items. The tests were generated to match specifications of a latent trait model. Angoff's method compared item difficulty estimates for two different…

Descriptors: Difficulty Level, Identification, Item Analysis, Mathematical Models

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Reckase, Mark D.	5
Smith, Richard M.	3
Ackerman, Terry A.	2
McKinley, Robert L.	2
Mitchell, Virginia P.	2
Roid, Gale	2
Abdel-fattah, Abdel-fattah A.	1
Akhtar, Hanif	1
Alderson, J. Charles	1
Aleyna Altan	1
Allen, Thomas E.	1
Bacon, Tina P.	1
Bauer, Ernest A.	1
Beard, Jacob G.	1
Bell, Anita I.	1
Benson, Jeri	1
Binici, Salih	1
Breland, Hunter M.	1
Brinzer, Raymond J.	1
Callingham, Rosemary	1
Carlson, James E.	1
Chen, Ju Shan	1
Chissom, Brad	1
Chukabarah, Prince C. O.	1
More ▼