NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 1,231 to 1,245 of 3,093 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Allalouf, Avi; Rapp, Joel; Stoller, Reuven – International Journal of Testing, 2009
When a test is adapted from a source language (SL) into a target language (TL), the two forms are usually not psychometrically equivalent. If linking between test forms is necessary, those items that have had their psychometric characteristics altered by the translation (differential item functioning [DIF] items) should be eliminated from the…
Descriptors: Test Items, Test Format, Verbal Tests, Psychometrics
New York State Education Department, 2014
This technical report provides an overview of the New York State Alternate Assessment (NYSAA), including a description of the purpose of the NYSAA, the processes utilized to develop and implement the NYSAA program, and Stakeholder involvement in those processes. The purpose of this report is to document the technical aspects of the 2013-14 NYSAA.…
Descriptors: Alternative Assessment, Educational Assessment, State Departments of Education, Student Evaluation
Christensen, Laurene L. – ProQuest LLC, 2010
This study investigated the inclusion of English language learners (ELLs) in state standards and assessments, as measured by comments made by peer reviewers in the federal evaluation of states' standards and assessments. As required by the Elementary and Secondary Education Act (ESEA), reauthorized in 2004 as No Child Left Behind (NCLB), states…
Descriptors: Elementary Secondary Education, Federal Legislation, Research Methodology, State Standards
Peer reviewed Peer reviewed
Direct linkDirect link
Taguchi, Naoko – Modern Language Journal, 2008
This study developed an original instrument that measures pragmatic comprehension in Japanese as a foreign language (JFL). It examined the ability to comprehend implied meaning encoded in conventional and nonconventional features and the effect of proficiency on comprehension. There were 63 college students of Japanese at 2 proficiency levels who…
Descriptors: Test Format, Scores, Second Language Learning, Japanese
Peer reviewed Peer reviewed
Direct linkDirect link
Marshall, Robert C.; Wright, Heather Harris – American Journal of Speech-Language Pathology, 2007
Purpose: The Kentucky Aphasia Test (KAT) is an objective measure of language functioning for persons with aphasia. This article describes materials, administration, and scoring of the KAT; presents the rationale for development of test items; reports information from a pilot study; and discusses the role of the KAT in aphasia assessment. Method:…
Descriptors: Aphasia, Test Format, Language Tests, Expressive Language
National Assessment Governing Board, 2009
As the ongoing national indicator of what American students know and can do, the National Assessment of Educational Progress (NAEP) in Reading regularly collects achievement information on representative samples of students in grades 4, 8, and 12. The information that NAEP provides about student achievement helps the public, educators, and…
Descriptors: National Competency Tests, Reading Tests, Test Items, Test Format
Peer reviewed Peer reviewed
Direct linkDirect link
Girard, Todd A.; Christensen, Bruce K. – Psychological Assessment, 2008
The correlation between a short-form (SF) test and its full-scale (FS) counterpart is a mainstay in the evaluation of SF validity. However, in correcting for overlapping error variance in this measure, investigators have overattenuated the validity coefficient through an intuitive misapplication of P. Levy's (1967) formula. The authors of the…
Descriptors: Error of Measurement, Computation, Psychiatric Services, Correlation
Peer reviewed Peer reviewed
Direct linkDirect link
von Davier, Alina A.; Wilson, Christine – Applied Psychological Measurement, 2008
Dorans and Holland (2000) and von Davier, Holland, and Thayer (2003) introduced measures of the degree to which an observed-score equating function is sensitive to the population on which it is computed. This article extends the findings of Dorans and Holland and of von Davier et al. to item response theory (IRT) true-score equating methods that…
Descriptors: Advanced Placement, Advanced Placement Programs, Equated Scores, Calculus
Peer reviewed Peer reviewed
Direct linkDirect link
Anakwe, Bridget – Journal of Education for Business, 2008
The author investigated the impact of assessment methods on student performance on accounting tests. Specifically, the author used analysis of variance to determine whether the use of computer-based tests instead of paper-based tests affects students' traditional test scores in accounting examinations. The author included 2 independent variables,…
Descriptors: Student Evaluation, Testing, Statistical Analysis, Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Glenda C. Rakes – Journal of Interactive Online Learning, 2008
One continuing concern associated with online courses is assessment of student performance. One option for online assessment is the use of open book tests. This study investigated the impact of training in open book test-taking strategies on student test performance in online, timed, unproctored, open book tests. When the tutorial was required…
Descriptors: Online Courses, Electronic Learning, Test Format, Test Wiseness
Peer reviewed Peer reviewed
Molina, Maria Teresa Lopez-Mezquita – Indian Journal of Applied Linguistics, 2009
Lexical competence is considered to be an essential step in the development and consolidation of a student's linguistic ability, and thus the reliable assessment of such competence turns out to be a fundamental aspect in this process. The design and construction of vocabulary tests has become an area of special interest, as it may provide teachers…
Descriptors: Student Evaluation, Second Language Learning, Computer Assisted Testing, Foreign Countries
Peer reviewed Peer reviewed
Direct linkDirect link
Schumacker, Randall E.; Smith, Everett V., Jr. – Educational and Psychological Measurement, 2007
Measurement error is a common theme in classical measurement models used in testing and assessment. In classical measurement models, the definition of measurement error and the subsequent reliability coefficients differ on the basis of the test administration design. Internal consistency reliability specifies error due primarily to poor item…
Descriptors: Measurement Techniques, Error of Measurement, Item Sampling, Item Response Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Balch, William R. – Teaching of Psychology, 2007
Undergraduates studied the definitions of 16 psychology terms, expecting either a multiple-choice (n = 132) or short-answer (n = 122) test. All students then received the same multiple-choice test, requiring them to recognize the definitions as well as novel examples of the terms. Compared to students expecting a multiple-choice test, those…
Descriptors: Expectation, Definitions, Multiple Choice Tests, Undergraduate Students
Tanguma, Jesus – 2000
This paper describes four commonly used designs in equating test scores. These designs are: (1) single-group; (2) random-group; (3) equivalent-group; and (4) anchor-test. Each design requires that its data be collected according to specific guidelines. Three of the four methods are illustrated through hypothetical examples. All four methods try to…
Descriptors: Equated Scores, Test Format
Peer reviewed Peer reviewed
Castle, Nicholas G.; Engberg, John – Gerontologist, 2004
Purpose: A factor common to the results of many satisfaction surveys of elders is a lack of response variability. Increasing response variability may be useful if satisfaction surveys of elders are to be productively used in the future. In this paper, we first examine elders' preferences between five response formats and then examine the response…
Descriptors: Surgery, Patients, Test Format
Pages: 1  |  ...  |  79  |  80  |  81  |  82  |  83  |  84  |  85  |  86  |  87  |  ...  |  207