Publication Date
| In 2026 | 0 |
| Since 2025 | 197 |
| Since 2022 (last 5 years) | 1067 |
| Since 2017 (last 10 years) | 2577 |
| Since 2007 (last 20 years) | 4938 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 225 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 65 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Finch, Holmes; Habing, Brian – Journal of Educational Measurement, 2005
This study examines the performance of a new method for assessing and characterizing dimensionality in test data using the NOHARM model, and comparing it with DETECT. Dimensionality assessment is carried out using two goodness-of-fit statistics that are compared to reference X[2] distributions. A Monte Carlo study is used with item parameters…
Descriptors: Program Effectiveness, Monte Carlo Methods, Item Response Theory, Comparative Analysis
Bedore, Lisa M.; Pena, Elizabeth D.; Garcia, Melissa; Cortez, Celina – Language, Speech, and Hearing Services in Schools, 2005
Purpose: This study evaluates the extent to which bilingual children produce the same or overlapping responses on tasks assessing semantic skills in each of their languages and whether classification analysis based on monolingual or conceptual scoring can accurately classify the semantic development of typically developing (TD) bilingual children.…
Descriptors: Monolingualism, Semantics, Skill Development, Young Children
Liu, Xiufeng; MacIsaac, Dan – Journal of Science Education and Technology, 2005
This study investigates factors affecting the degree of novice physics students application of the naive impetus theory. Six hundred and fourteen first-year university engineering physics students answered the Force Concept Inventory as a pre-test for their calculus-based course. We examined the degree to which students consistently applied the…
Descriptors: Prediction, Familiarity, Physics, College Freshmen
Wang, Wen-Chung; Cheng, Ying-Yao; Wilson, Mark – Educational and Psychological Measurement, 2005
A parallel design, in which items across different scales within an instrument share common stimuli and subjects respond to the common stimulus for each scale, is sometimes used in questionnaires or inventories. Because the items across scales share the same stimuli, the assumption of local item independence may not hold, thereby violating the…
Descriptors: Stimuli, Psychometrics, Test Items, Item Response Theory
Peer reviewedYoung, Ellie L.; Sudweeks, Richard R. – Measurement and Evaluation in Counseling and Development, 2005
Differential item functioning (DIF) in the Multidimensional Self Concept Scale (B. A. Bracken, 1992) was evaluated using 2 different methods to identify and describe DIF. Of 149 items from the Multidimensional Self Concept Scale, 42% exhibited gender DIF. Nonuniform, crossover DIF was evident in items throughout the instrument.
Descriptors: Early Adolescents, Measures (Individuals), Self Concept, Test Items
Multiple Choice and True/False Tests: Reliability Measures and Some Implications of Negative Marking
Burton, Richard F. – Assessment & Evaluation in Higher Education, 2004
The standard error of measurement usefully provides confidence limits for scores in a given test, but is it possible to quantify the reliability of a test with just a single number that allows comparison of tests of different format? Reliability coefficients do not do this, being dependent on the spread of examinee attainment. Better in this…
Descriptors: Multiple Choice Tests, Error of Measurement, Test Reliability, Test Items
Pommerich, Mary – Journal of Technology, Learning, and Assessment, 2004
As testing moves from paper-and-pencil administration toward computerized administration, how to present tests on a computer screen becomes an important concern. Of particular concern are tests that contain necessary information that cannot be displayed on screen all at once for an item. Ideally, the method of presentation should not interfere…
Descriptors: Test Content, Computer Assisted Testing, Multiple Choice Tests, Computer Interfaces
Powers, Robert A.; Allison, Dean E.; Grassl, Richard M. – International Journal for Technology in Mathematics Education, 2005
This study investigated the impact of the TI-92 handheld Computer Algebra System (CAS) on student achievement in a discrete mathematics course. Specifically, the researchers examined the differences between a CAS section and a control section of discrete mathematics on students' in-class examinations. Additionally, they analysed student approaches…
Descriptors: Control Groups, Mathematics Education, Test Items, Investigations
Van den Noortgate, Wim; De Boeck, Paul – Journal of Educational and Behavioral Statistics, 2005
Although differential item functioning (DIF) theory traditionally focuses on the behavior of individual items in two (or a few) specific groups, in educational measurement contexts, it is often plausible to regard the set of items as a random sample from a broader category. This article presents logistic mixed models that can be used to model…
Descriptors: Test Bias, Item Response Theory, Educational Assessment, Mathematical Models
Oliver, Renee; Williams, Robert L. – Journal of Behavioral Education, 2005
Students in four sections of an undergraduate educational course (two large and two small sections) took out-of-class practice exams prior to actual exams for each of five course units. Each course unit consisted of five class sessions focusing on a specific developmental theme. Some sections received practice-exam credit based on the number of…
Descriptors: Undergraduate Students, Predictor Variables, Contingency Management, Education Courses
van Barneveld, Christina – Alberta Journal of Educational Research, 2003
The purpose of this study was to examine the potential effect of false assumptions regarding the motivation of examinees on item calibration and test construction. A simulation study was conducted using data generated by means of several models of examinee item response behaviors (the three-parameter logistic model alone and in combination with…
Descriptors: Simulation, Motivation, Computation, Test Construction
Sheehan, Kathleen M.; Kostin, Irene; Futagi, Yoko – ETS Research Report Series, 2007
This paper explores alternative approaches for facilitating efficient, evidence-centered item development for a new type of verbal reasoning item developed for use on the GRE® General Test. Results obtained in two separate studies are reported. The first study documented the development and validation of a fully automated approach for locating the…
Descriptors: College Entrance Examinations, Graduate Study, Test Items, Item Analysis
Kennedy, Lauren Culzean – Online Submission, 2007
This research paper describes the benefits of using an activity-based rhetorical perspective to develop English for specific purposes (ESP) test specifications. This approach expands the potential of ESP test specifications to analyze and describe target language use (TLU) situations, TLU tasks, and ESP test tasks. Multiple activity systems are…
Descriptors: Freshman Composition, Tests, English for Academic Purposes, Rhetorical Theory
Beckert, Troy E.; Strom, Robert D.; Strom, Paris S.; Yang, Cheng-Ta; Singh, Archana – Educational and Psychological Measurement, 2007
This study examined whether the original factor structure of the Parent Success Indicator (PSI) could be replicated with scores from generational views on both the English- and Mandarin-language versions of the instrument. The 60-item PSI was evaluated using responses from 840 Taiwanese parents (n = 429) and their 10- to 14-year-old adolescents (n…
Descriptors: Goodness of Fit, Adolescents, Factor Structure, Success
Deane, Paul; Graf, Edith Aurora; Higgins, Derrick; Futagi, Yoko; Lawless, René – ETS Research Report Series, 2006
This study focuses on the relationship between item modeling and evidence-centered design (ECD); it considers how an appropriately generalized item modeling software tool can support systematic identification and exploitation of task-model variables, and then examines the feasibility of this goal, using linear-equation items as a test case. The…
Descriptors: Test Items, Models, Computer Software, Equations (Mathematics)

Direct link
