NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 1,261 to 1,275 of 9,530 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Ali, Ikram; Haral, Muhammad Nouman; Tahira, Fatima; Ali, Mirza; Imran, Aqeel – Education and Urban Society, 2020
Foundation of quality examination is based on the key features of validity and reliability of question papers. Question papers of examination boards in Pakistan--usually these features as contents of a question paper--are written by a single paper setter. To address these issues, Federal Board of Intermediate and Secondary Education (FBISE)…
Descriptors: Foreign Countries, Multiple Choice Tests, Test Items, Difficulty Level
Peer reviewed Peer reviewed
Direct linkDirect link
Shin, Jinnie; Bulut, Okan; Gierl, Mark J. – Journal of Experimental Education, 2020
The arrangement of response options in multiple-choice (MC) items, especially the location of the most attractive distractor, is considered critical in constructing high-quality MC items. In the current study, a sample of 496 undergraduate students taking an educational assessment course was given three test forms consisting of the same items but…
Descriptors: Foreign Countries, Undergraduate Students, Multiple Choice Tests, Item Response Theory
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Dirlik, Ezgi Mor – International Journal of Progressive Education, 2020
Mokken models have recently started to become the preferred method of researchers from different fields in studies of nonparametric item response theory (NIRT). Despite increasing application of these models, some features of this type of modelling need further study and explanation. Invariant item ordering (IIO) is one of these areas, which the…
Descriptors: Item Response Theory, Test Items, Nonparametric Statistics, Scoring
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Sahin, Melek Gulsah – International Journal of Assessment Tools in Education, 2020
Computer Adaptive Multistage Testing (ca-MST), which take the advantage of computer technology and adaptive test form, are widely used, and are now a popular issue of assessment and evaluation. This study aims at analyzing the effect of different panel designs, module lengths, and different sequence of a parameter value across stages and change in…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Item Response Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Bokander, Lars; Bylund, Emanuel – Language Learning, 2020
Over the past decade, the LLAMA language aptitude test battery has come to play an increasingly important role as an instrument in research on individual differences in language development. However, a potentially serious problem that has been pointed out by several scholars is that the LLAMA has not yet been carefully validated. We addressed this…
Descriptors: Item Analysis, Language Tests, Test Items, Individual Differences
Peer reviewed Peer reviewed
Direct linkDirect link
Martin, Jessica L.; Zamboanga, Byron L.; Haase, Richard F.; Buckner, Lindsay C. – Measurement and Evaluation in Counseling and Development, 2020
The purpose of this study was to assess measurement equivalence of the 15-item Protective Behavioral Strategies Scale (PBSS) across White and Black college students. Results partially supported measurement equivalence across racial groups. Clinicians and researchers should be cautious in using the PBSS to make comparisons between White and Black…
Descriptors: Likert Scales, White Students, African American Students, Drinking
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Al-Mahrooqi, Rahma; Denman, C. J. – International Journal of Instruction, 2020
The level of critical thinking skills of Omani tertiary-level students is an area that has received only a limited amount of investigative attention. This study employed an adapted version of the Cornell Class-Reasoning Test, Form X to assess the critical thinking skills of students in the humanities- and science-based colleges of Sultan Qaboos…
Descriptors: Critical Thinking, Thinking Skills, Humanities, College Students
Peer reviewed Peer reviewed
Direct linkDirect link
Brocato, Nicole; Hix, Laura; Jayawickreme, Eranda – Journal of Moral Education, 2020
University settings present a unique opportunity for young adults to develop characteristics constitutive of wisdom. One challenge for educators working to support this development involves effectively measuring these characteristics. In this article, we present results from a secondary analysis of cognitive interviews to examine challenges that…
Descriptors: Undergraduate Students, Young Adults, Personality, Individual Characteristics
Peer reviewed Peer reviewed
Direct linkDirect link
Trace, Jonathan – Language Testing, 2020
Originally designed to measure reading and passage comprehension in L1 readers, cloze tests continue to be used for L2 assessment purposes. However, there remain disputes about whether or not cloze items can measure beyond local comprehension information, as well as whether or not they are purely a test of reading alone, or if performance can be…
Descriptors: Cloze Procedure, Second Language Learning, Reading Comprehension, Native Language
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Kara, Hakan; Cetin, Sevda – International Journal of Assessment Tools in Education, 2020
In this study, the efficiency of various random sampling methods to reduce the number of items rated by judges in an Angoff standard-setting study was examined and the methods were compared with each other. Firstly, the full-length test was formed by combining Placement Test 2012 and 2013 mathematics subsets. After then, simple random sampling…
Descriptors: Cutting Scores, Standard Setting (Scoring), Sampling, Error of Measurement
Parry, James R. – Online Submission, 2020
This paper presents research and provides a method to ensure that parallel assessments, that are generated from a large test-item database, maintain equitable difficulty and content coverage each time the assessment is presented. To maintain fairness and validity it is important that all instances of an assessment, that is intended to test the…
Descriptors: Culture Fair Tests, Difficulty Level, Test Items, Test Validity
Rachel A. Gross – ProQuest LLC, 2020
The present study was motivated by the theory-method mismatch between heterotypic continuity (aspects of development that manifest differently across the lifespan thus cannot be measured the same way over time) and longitudinal measurement equivalence (the statistical assumption that the developmental phenomenon studied is measured on the same…
Descriptors: Robustness (Statistics), Structural Equation Models, Longitudinal Studies, Error of Measurement
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Akbay, Lokman; Kilinç, Mustafa – International Journal of Assessment Tools in Education, 2018
Measurement models need to properly delineate the real aspect of examinees' response processes for measurement accuracy purposes. To avoid invalid inferences, fit of examinees' response data to the model is studied through "person-fit" statistics. Misfit between the examinee response data and measurement model may be due to invalid…
Descriptors: Reliability, Goodness of Fit, Cognitive Measurement, Models
Sinharay, Sandip; Jensen, Jens Ledet – Grantee Submission, 2018
In educational and psychological measurement, researchers and/or practitioners are often interested in examining whether the ability of an examinee is the same over two sets of items. Such problems can arise in measurement of change, detection of cheating on unproctored tests, erasure analysis, detection of item preknowledge etc. Traditional…
Descriptors: Test Items, Ability, Mathematics, Item Response Theory
Benton, Tom – Cambridge Assessment, 2018
One of the questions with the longest history in educational assessment is whether it is possible to increase the reliability of a test simply by altering the way in which scores on individual test items are combined to make the overall test score. Most usually, the score available on each item is communicated to the candidate within a question…
Descriptors: Test Items, Scoring, Predictive Validity, Test Reliability
Pages: 1  |  ...  |  81  |  82  |  83  |  84  |  85  |  86  |  87  |  88  |  89  |  ...  |  636