Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 6 |
| Since 2007 (last 20 years) | 9 |
Descriptor
| Test Items | 30 |
| Testing | 30 |
| Testing Problems | 30 |
| Test Construction | 12 |
| Multiple Choice Tests | 8 |
| Test Reliability | 8 |
| Elementary Secondary Education | 7 |
| Test Format | 7 |
| Difficulty Level | 6 |
| Higher Education | 6 |
| Item Analysis | 6 |
| More ▼ | |
Source
Author
| Herndon, Enid B. | 2 |
| Altepeter, Tom | 1 |
| Barlow, Lisa | 1 |
| Boldt, R. F. | 1 |
| Bramley, Tom | 1 |
| Brown, Alan S. | 1 |
| Canning, Christine | 1 |
| Casserly, Michael | 1 |
| Chastain, Kenneth D. | 1 |
| Corcoran, Amanda | 1 |
| Crisp, Victoria | 1 |
| More ▼ | |
Publication Type
Education Level
| Elementary Secondary Education | 2 |
| Adult Education | 1 |
| Elementary Education | 1 |
| Higher Education | 1 |
| Junior High Schools | 1 |
| Middle Schools | 1 |
| Postsecondary Education | 1 |
| Secondary Education | 1 |
Audience
| Practitioners | 2 |
| Teachers | 2 |
Location
| Indonesia | 1 |
| New Jersey | 1 |
| United Arab Emirates | 1 |
| United Kingdom | 1 |
| United Kingdom (Great Britain) | 1 |
| United States | 1 |
Laws, Policies, & Programs
| Elementary and Secondary… | 1 |
| Individuals with Disabilities… | 1 |
| No Child Left Behind Act 2001 | 1 |
| Perkins Loan Program | 1 |
Assessments and Surveys
| National Assessment of… | 2 |
| Expressive One Word Picture… | 1 |
| SAT (College Admission Test) | 1 |
| State Trait Anxiety Inventory | 1 |
What Works Clearinghouse Rating
Kim, Sooyeon; Walker, Michael – ETS Research Report Series, 2021
In this investigation, we used real data to assess potential differential effects associated with taking a test in a test center (TC) versus testing at home using remote proctoring (RP). We used a pseudo-equivalent groups (PEG) approach to examine group equivalence at the item level and the total score level. If our assumption holds that the PEG…
Descriptors: Testing, Distance Education, Comparative Analysis, Test Items
Patrisius Istiarto Djiwandono; Daniel Ginting – Language Education & Assessment, 2025
The teaching of English as a foreign language in Indonesia has a long history, and it is always important to ask whether the assessment of the students' language skills has been valid and reliable. A screening of many articles in several prominent databases reveal that a number of evaluation studies have been done by Indonesian scholars in the…
Descriptors: Foreign Countries, Language Tests, English (Second Language), Second Language Learning
Haladyna, Thomas M.; Rodriguez, Michael C.; Stevens, Craig – Applied Measurement in Education, 2019
The evidence is mounting regarding the guidance to employ more three-option multiple-choice items. From theoretical analyses, empirical results, and practical considerations, such items are of equal or higher quality than four- or five-option items, and more items can be administered to improve content coverage. This study looks at 58 tests,…
Descriptors: Multiple Choice Tests, Test Items, Testing Problems, Guessing (Tests)
Reed, Deborah K.; Stevenson, Nathan; LeBeau, Brandon C. – Elementary School Journal, 2019
This study investigated the effects of imposing task- or process-oriented reading behaviors on reading comprehension assessment performance. Students in grades 5-8 (N = 275) were randomly assigned to hear multiple-choice items read aloud before or after reading a test passage and when they were and were not allowed access to the passage while…
Descriptors: Reading Comprehension, Reading Tests, Multiple Choice Tests, Reading Aloud to Others
Bramley, Tom; Crisp, Victoria – Assessment in Education: Principles, Policy & Practice, 2019
For many years, question choice has been used in some UK public examinations, with students free to choose which questions they answer from a selection (within certain parameters). There has been little published research on choice of exam questions in recent years in the UK. In this article we distinguish different scenarios in which choice…
Descriptors: Test Items, Test Construction, Difficulty Level, Foreign Countries
Sinharay, Sandip – Journal of Educational Measurement, 2017
Person-fit assessment (PFA) is concerned with uncovering atypical test performance as reflected in the pattern of scores on individual items on a test. Existing person-fit statistics (PFSs) include both parametric and nonparametric statistics. Comparison of PFSs has been a popular research topic in PFA, but almost all comparisons have employed…
Descriptors: Goodness of Fit, Testing, Test Items, Scores
Henning, Grant – English Teaching Forum, 2012
To some extent, good testing procedure, like good language use, can be achieved through avoidance of errors. Almost any language-instruction program requires the preparation and administration of tests, and it is only to the extent that certain common testing mistakes have been avoided that such tests can be said to be worthwhile selection,…
Descriptors: Testing, English (Second Language), Testing Problems, Student Evaluation
National Council on Measurement in Education, 2012
Testing and data integrity on statewide assessments is defined as the establishment of a comprehensive set of policies and procedures for: (1) the proper preparation of students; (2) the management and administration of the test(s) that will lead to accurate and appropriate reporting of assessment results; and (3) maintaining the security of…
Descriptors: State Programs, Integrity, Testing, Test Preparation
Hart, Ray; Casserly, Michael; Uzzell, Renata; Palacios, Moses; Corcoran, Amanda; Spurgeon, Liz – Council of the Great City Schools, 2015
There has been little data collected on how much testing actually goes on in America's schools and how the results are used. So in the Spring of 2014, the Council staff developed and launched a survey of assessment practices. This report presents the findings from that survey and subsequent Council analysis and review of the data. It also offers…
Descriptors: Urban Schools, Student Evaluation, Testing Programs, Testing
Peer reviewedRusch, Reuben; Steiner, Judith – Journal of Experimental Education, 1979
The Selected Marker Tests were examined for scoring problems and internal consistency and were administered orally to sixth and seventh graders. Scoring problems were discovered and changes were suggested. The problem was found to be item reliability rather than interrater reliability. (Author/MH)
Descriptors: Cognitive Tests, Elementary Education, Item Analysis, Problem Solving
Lee, Jo Ann; And Others – 1984
The difficulty of test items administered by paper and pencil were compared with the difficulty of the same items administered by computer. The study was conducted to determine if an interaction exists between mode of test administration and ability. An arithmetic reasoning test was constructed for this study. All examinees had taken the Armed…
Descriptors: Adults, Comparative Analysis, Computer Assisted Testing, Difficulty Level
Haenn, Joseph F. – 1981
Procedures for conducting functional level testing have been available for use by practitioners for some time. However, the Title I Evaluation and Reporting System (TIERS), developed in response to the educational amendments of 1974 to the Elementary and Secondary Education Act (ESEA), has provided the impetus for widespread adoption of this…
Descriptors: Achievement Tests, Difficulty Level, Scores, Scoring
Peer reviewedHillman, R. A. H.; And Others – School Science Review, 1981
Describes a study which explored some difficulties related to technical and nontechnical vocabulary and the structure of the examination questions in electrochemistry. Includes results from a sample of 1,500 students in the fourth forms. (DS)
Descriptors: Chemistry, Multiple Choice Tests, Science Education, Science Instruction
Peer reviewedAltepeter, Tom – School Psychology Review, 1983
A critical review of the Expressive One-Word Picture Vocabulary Test (Gardner) is offered. The reviewer feels that the instrument cannot be recommended in its present form. Further research concerning the manual, and theoretical issues, (particularly test-retest stability) is strongly recommended. (Author/PN)
Descriptors: Error of Measurement, Intelligence Tests, Item Analysis, Pictorial Stimuli
Peer reviewedTalmir, Pinchas – Biochemical Education, 1991
Describes how multiple-choice items can be designed and used as an effective diagnostic tool by avoiding their pitfalls and by taking advantage of their potential benefits. The following issues are discussed: correct' versus best answers; construction of diagnostic multiple-choice items; the problem of guessing; the use of justifications of…
Descriptors: Biochemistry, Educational Research, Evaluation, Higher Education
Previous Page | Next Page ยป
Pages: 1 | 2
Direct link
