Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 5 |
| Since 2017 (last 10 years) | 29 |
| Since 2007 (last 20 years) | 60 |
Descriptor
| Statistics | 90 |
| Test Items | 90 |
| Item Response Theory | 27 |
| Foreign Countries | 22 |
| Difficulty Level | 21 |
| Mathematics Tests | 17 |
| Higher Education | 16 |
| Item Analysis | 15 |
| Probability | 15 |
| Scores | 15 |
| Computer Assisted Testing | 14 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 5 |
| Teachers | 3 |
| Researchers | 2 |
Location
| Australia | 5 |
| Turkey | 3 |
| Canada | 2 |
| Indonesia | 2 |
| Netherlands | 2 |
| New York | 2 |
| United States | 2 |
| Asia | 1 |
| California | 1 |
| China | 1 |
| Cyprus | 1 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Sinharay, Sandip – Grantee Submission, 2021
Drasgow, Levine, and Zickar (1996) suggested a statistic based on the Neyman-Pearson lemma (e.g., Lehmann & Romano, 2005, p. 60) for detecting preknowledge on a known set of items. The statistic is a special case of the optimal appropriateness indices of Levine and Drasgow (1988) and is the most powerful statistic for detecting item…
Descriptors: Robustness (Statistics), Hypothesis Testing, Statistics, Test Items
Sherwin E. Balbuena – Online Submission, 2024
This study introduces a new chi-square test statistic for testing the equality of response frequencies among distracters in multiple-choice tests. The formula uses the information from the number of correct answers and wrong answers, which becomes the basis of calculating the expected values of response frequencies per distracter. The method was…
Descriptors: Multiple Choice Tests, Statistics, Test Validity, Testing
Gorney, Kylie; Wollack, James A. – Journal of Educational Measurement, 2023
In order to detect a wide range of aberrant behaviors, it can be useful to incorporate information beyond the dichotomous item scores. In this paper, we extend the l[subscript z] and l*[subscript z] person-fit statistics so that unusual behavior in item scores and unusual behavior in item distractors can be used as indicators of aberrance. Through…
Descriptors: Test Items, Scores, Goodness of Fit, Statistics
Ramadhani, Rahmi; Saragih, Sahat; Napitupulu, E. Elvis – Mathematics Teaching Research Journal, 2022
Statistical reasoning ability is one of the essential skills in developing competence, which is one of the Sustainable Development Goals (SDGs). This study aims to explore the statistical reasoning ability of junior high school students in descriptive statistics learning. The investigation directs students to determine their level of statistical…
Descriptors: Statistics, Thinking Skills, Statistics Education, Junior High School Students
Michelle Cheung; Bronwyn Reid O’Connor; Ben Zunica – Mathematics Education Research Group of Australasia, 2024
Progressing from additive to multiplicative thinking is a key outcome of school mathematics, making ratios an essential topic of study in junior secondary. In this study, 15 Australian Year 8 students were administered a ratio test followed by semi-structured interviews to explore their conceptions of ratio prior to formal instruction. In this…
Descriptors: Secondary School Students, Mathematics Instruction, Foreign Countries, Multiplication
Su, Shiyang; Wang, Chun; Weiss, David J. – Educational and Psychological Measurement, 2021
S-X[superscript 2] is a popular item fit index that is available in commercial software packages such as "flex"MIRT. However, no research has systematically examined the performance of S-X[superscript 2] for detecting item misfit within the context of the multidimensional graded response model (MGRM). The primary goal of this study was…
Descriptors: Statistics, Goodness of Fit, Test Items, Models
Köhler, Carmen; Robitzsch, Alexander; Hartig, Johannes – Journal of Educational and Behavioral Statistics, 2020
Testing whether items fit the assumptions of an item response theory model is an important step in evaluating a test. In the literature, numerous item fit statistics exist, many of which show severe limitations. The current study investigates the root mean squared deviation (RMSD) item fit statistic, which is used for evaluating item fit in…
Descriptors: Test Items, Goodness of Fit, Statistics, Bias
Olsho, Alexis; Smith, Trevor I.; Eaton, Philip; Zimmerman, Charlotte; Boudreaux, Andrew; White Brahmia, Suzanne – Physical Review Physics Education Research, 2023
We developed the Physics Inventory of Quantitative Literacy (PIQL) to assess students' quantitative reasoning in introductory physics contexts. The PIQL includes several "multiple-choice-multipleresponse" (MCMR) items (i.e., multiple-choice questions for which more than one response may be selected) as well as traditional single-response…
Descriptors: Multiple Choice Tests, Science Tests, Physics, Measures (Individuals)
Fox, Jean-Paul; Marianti, Sukaesi – Journal of Educational Measurement, 2017
Response accuracy and response time data can be analyzed with a joint model to measure ability and speed of working, while accounting for relationships between item and person characteristics. In this study, person-fit statistics are proposed for joint models to detect aberrant response accuracy and/or response time patterns. The person-fit tests…
Descriptors: Accuracy, Reaction Time, Statistics, Test Items
Durak, Ismail; Karagoz, Yalcin – International Journal of Assessment Tools in Education, 2021
The aim of this study is to adapt the Statistics Anxiety Scale (SAS) developed by Vigil-Colet et al. (2008) to Turkish. This study is expected to fill an important gap in the literature since no valid and reliable specific statistics anxiety scale developed or adapted in Turkish for undergraduate students in the literature is available. The sample…
Descriptors: Foreign Countries, Affective Measures, Statistics, Mathematics Anxiety
Tekalmaz, Gözde; Kezer, Fatih – Participatory Educational Research, 2021
The aim of the research is to develop an online environment where teachers can access test and item statistics and a comprehensible report regarding the tests they have used, to ensure the storage of their reports in their own online environment as well as to have an idea on the utility of the subject matter environment. The research is planned as…
Descriptors: Computer Uses in Education, Statistics, Tests, Test Items
Walker, A. Adrienne; Jennings, Jeremy Kyle; Engelhard, George, Jr. – Educational Assessment, 2018
Individual person fit analyses provide important information regarding the validity of test score inferences for an "individual" test taker. In this study, we use data from an undergraduate statistics test (N = 1135) to illustrate a two-step method that researchers and practitioners can use to examine individual person fit. First, person…
Descriptors: Test Items, Test Validity, Scores, Statistics
Patton, Jeffrey M.; Cheng, Ying; Hong, Maxwell; Diao, Qi – Journal of Educational and Behavioral Statistics, 2019
In psychological and survey research, the prevalence and serious consequences of careless responses from unmotivated participants are well known. In this study, we propose to iteratively detect careless responders and cleanse the data by removing their responses. The careless responders are detected using person-fit statistics. In two simulation…
Descriptors: Test Items, Response Style (Tests), Identification, Computation
Hidayati, Kana; Budiyono; Sugiman – Eurasian Journal of Educational Research, 2019
Purpose: Essay test in mathematics, both in the form of restricted-response and extended-response, generally consist of polytomous scored items. However, the essay test used by teachers in Indonesia has not been fully supported by sufficient quality evidence. There have been many studies focusing on the development of the essay test, but not many…
Descriptors: Alignment (Education), Item Response Theory, Statistics, Essay Tests
Sandoval-Bravo, Salvador; Celso-Arellano, Pedro Luis; Gualajara, Victor; Coronado, Semei – European Journal of Contemporary Education, 2019
The objective of this study is to analyze the ability of students of the University Center for the Economic Administrative Sciences which forms part of the University of Guadalajara from different economic-administrative undergraduate programs, to solve distinct problems in the area of probability, applying a multiple-choice instrument aligned to…
Descriptors: Probability, Undergraduate Students, Economics Education, Problem Solving

Peer reviewed
Direct link
