NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Researchers27
Laws, Policies, & Programs
Elementary and Secondary…1
What Works Clearinghouse Rating
Showing 1 to 15 of 86 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Tsutsumi, Emiko; Kinoshita, Ryo; Ueno, Maomi – International Educational Data Mining Society, 2021
Knowledge tracing (KT), the task of tracking the knowledge state of each student over time, has been assessed actively by artificial intelligence researchers. Recent reports have described that Deep-IRT, which combines Item Response Theory (IRT) with a deep learning model, provides superior performance. It can express the abilities of each student…
Descriptors: Item Response Theory, Prediction, Accuracy, Artificial Intelligence
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Akhtar, Hanif – International Association for Development of the Information Society, 2022
When examinees perceive a test as low stakes, it is logical to assume that some of them will not put out their maximum effort. This condition makes the validity of the test results more complicated. Although many studies have investigated motivational fluctuation across tests during a testing session, only a small number of studies have…
Descriptors: Intelligence Tests, Student Motivation, Test Validity, Student Attitudes
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Aleyna Altan; Zehra Taspinar Sener – Online Submission, 2023
This research aimed to develop a valid and reliable test to be used to detect sixth grade students' misconceptions and errors regarding the subject of fractions. A misconception diagnostic test has been developed that includes the concept of fractions, different representations of fractions, ordering and comparing fractions, equivalence of…
Descriptors: Diagnostic Tests, Mathematics Tests, Fractions, Misconceptions
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Hrouzková, Tereza; Richterek, Lukáš – International Baltic Symposium on Science and Technology Education, 2021
The Lawson classroom test of scientific reasoning is a quite popular and widely used tool that measures the level and development of the student's scientific reasoning skills. In this contribution, the results of this test for the N=446 students of the Faculty of Science Palacký University Olomouc from the years 2018-2020 at the beginning of their…
Descriptors: Science Tests, Thinking Skills, Undergraduate Students, Science Education
Peer reviewed Peer reviewed
Direct linkDirect link
Klotz, Viola Katharina; Winther, Esther; Marx, Christian; Goeze, Annika; Fischer, Christoph; Sangmeister, Julia – AERA Online Paper Repository, 2016
Apprentices' performance after vocational educational training (VET) is commonly attributed to more or less effective training. This implies the assumption that learning is significantly affected by vocational instruction (instructional sensitivity). However, the question has not been investigated yet if VETs are effective, i.e., that they foster…
Descriptors: Vocational Education, Apprenticeships, Performance Based Assessment, Item Analysis
Watson, Jane; Callingham, Rosemary – Mathematics Education Research Group of Australasia, 2015
Responses of 223 students in grades 6 to 11 to questions related to beliefs about getting out of bed on the left side are analysed from two perspectives. On one hand the items explore subjective beliefs about chance. On the other hand the different wording and context of the items provide opportunity to show different levels of understanding of…
Descriptors: Beliefs, Student Attitudes, Difficulty Level, Item Response Theory
Powers, Sonya; Turhan, Ahmet; Binici, Salih – Pearson, 2012
The population sensitivity of vertical scaling results was evaluated for a state reading assessment spanning grades 3-10 and a state mathematics test spanning grades 3-8. Subpopulations considered included males and females. The 3-parameter logistic model was used to calibrate math and reading items and a common item design was used to construct…
Descriptors: Scaling, Equated Scores, Standardized Tests, Reading Tests
PDF pending restoration PDF pending restoration
Frisbie, David A. – 1980
The development of a new technique, the Relative Difficulty Ratio (RDR), is described, as well as how it can be used to determine the difficulty level of a test so that meaningful inter-test difficulty comparisons can be made. Assumptions made in computing RDR include: 1) each item must be scored dichotomously with only one answer choice keyed as…
Descriptors: Difficulty Level, Item Analysis, Measurement Techniques, Scores
Rich, Charles E.; Johanson, George A. – 1990
Despite the existence of little empirical evidence for their effectiveness, many techniques have been suggested for writing multiple-choice items. The option "none of the above" (NA) has been widely used although a recent review of empirical studies of NA suggests that, while generally decreasing the difficulty index, NA also decreases…
Descriptors: Difficulty Level, Item Analysis, Multiple Choice Tests, Test Construction
Breland, Hunter M. – 1974
Examples of cross-cultural stability or instability of mental test items are illustrated. A statistical procedure involving the cross-plotting of item difficulties for two different groups and generating a line of mutual regression through the resulting scatter of points is described. D-values, representing the perpendicular distance, in delta…
Descriptors: Cross Cultural Studies, Difficulty Level, Item Analysis, Statistical Analysis
Peer reviewed Peer reviewed
Mazor, Kathleen M.; And Others – Educational and Psychological Measurement, 1994
A variation of the Mantel Haenszel procedure is proposed that improves detection rates of nonuniform differential item functioning (DIF) without increasing the Type I error rate. The procedure, which is illustrated with simulated examinee responses, involves splitting the sample into low- and high-performing groups. (SLD)
Descriptors: Difficulty Level, Identification, Item Analysis, Item Bias
Ackerman, Terry A.; Spray, Judith A. – 1986
A model of test item dependency is presented and used to illustrate the effect that violations of local independence have on the behavior of item characteristic curves. The dependency model is flexible enough to simulate the interaction of a number of factors including item difficulty and item discrimination, varying degrees of item dependence,…
Descriptors: Difficulty Level, Item Analysis, Latent Trait Theory, Mathematical Models
PDF pending restoration PDF pending restoration
Reckase, Mark D.; McKinley, Robert L. – 1984
A new indicator of item difficulty, which identifies effectiveness ranges, overcomes the limitations of other item difficulty indexes in describing the difficulty of an item or a test as a whole and in aiding the selection of appropriate ability level items for a test. There are three common uses of the term "item difficulty": (1) the probability…
Descriptors: Difficulty Level, Evaluation Methods, Item Analysis, Latent Trait Theory
Ackerman, Terry A. – 1987
Concern has been expressed over the item response theory (IRT) assumption that a person's ability can be estimated in a unidimensional latent space. To examine whether or not the response to an item requires only a single latent ability, unidimensional ability estimates were compared for data generated from the multidimensional item response…
Descriptors: Ability, Computer Simulation, Difficulty Level, Item Analysis
Groome, Mary Lynn; Groome, William R. – 1979
Angoff's method for identifying possible biased test items was applied to four computer-generated hypothetical tests, two of which contained no biased items and two of which contained a few biased items. The tests were generated to match specifications of a latent trait model. Angoff's method compared item difficulty estimates for two different…
Descriptors: Difficulty Level, Identification, Item Analysis, Mathematical Models
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4  |  5  |  6