Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 6 |
Descriptor
Scores | 27 |
Test Format | 27 |
Test Use | 27 |
Achievement Tests | 9 |
Test Construction | 8 |
Test Validity | 7 |
Adaptive Testing | 5 |
Correlation | 5 |
Elementary Secondary Education | 5 |
Standardized Tests | 5 |
Testing Programs | 5 |
More ▼ |
Source
Author
Baldwin, Janet, Ed. | 2 |
Hambleton, Ronald K. | 2 |
Arter, Judith A. | 1 |
Barrett, Michael J. | 1 |
Bielinski, John | 1 |
Bortnik, Boris | 1 |
Bunch, Michael B. | 1 |
Buser, Karen | 1 |
Bush, Martin | 1 |
Cheng, Liying | 1 |
Cheung, Fanny M. | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 3 |
Postsecondary Education | 3 |
Early Childhood Education | 1 |
Elementary Education | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
Grade 6 | 1 |
Grade 7 | 1 |
Grade 8 | 1 |
High Schools | 1 |
More ▼ |
Audience
Practitioners | 2 |
Teachers | 1 |
Location
Hong Kong | 1 |
Indonesia | 1 |
Kentucky | 1 |
New York (Albany) | 1 |
New York (Buffalo) | 1 |
New York (New York) | 1 |
New York (Rochester) | 1 |
New York (Syracuse) | 1 |
Russia | 1 |
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Assessments and Surveys
General Educational… | 4 |
Minnesota Multiphasic… | 2 |
Armed Services Vocational… | 1 |
College Board Achievement… | 1 |
Test of English as a Foreign… | 1 |
Watson Glaser Critical… | 1 |
Wechsler Adult Intelligence… | 1 |
What Works Clearinghouse Rating
Peng, Yue; Yan, Wei; Cheng, Liying – Language Testing, 2021
This test review focuses on the current version (2009) of [Chinese characters omitted] (Hanyu Shuiping Kaoshi), literally translated as the Chinese Language Proficiency Test and abbreviated as HSK. Tailored to non-native speakers of the Chinese language, this test consists of six proficiency levels (Levels 1 and 2 as beginners, Levels 3 and 4 as…
Descriptors: Language Proficiency, Language Tests, Chinese, Decision Making
Yulianto, Ahmad; Pudjitriherwanti, Anastasia; Kusumah, Chevy; Oktavia, Dies – International Journal of Language Testing, 2023
The increasing use of computer-based mode in language testing raises concern over its similarities with and differences from paper-based format. The present study aimed to delineate discrepancies between TOEFL PBT and CBT. For that objective, a quantitative method was employed to probe into scores equivalence, the performance of male-female…
Descriptors: Computer Assisted Testing, Test Format, Comparative Analysis, Scores
Bortnik, Boris; Stozhko, Natalia; Pervukhina, Irina – Education Sciences, 2021
Testing as an assessment technique has been widely used at all levels of education--from primary to higher school. The main purpose of the paper is to evaluate the effect of context-based testing in teaching and learning of analytical chemistry in a Russian university. The paper formulates the objectives of context-based testing, discusses its…
Descriptors: Chemistry, Science Instruction, Science Tests, Undergraduate Students
Otoyo, Lucia; Bush, Martin – Practical Assessment, Research & Evaluation, 2018
This article presents the results of an empirical study of "subset selection" tests, which are a generalisation of traditional multiple-choice tests in which test takers are able to express partial knowledge. Similar previous studies have mostly been supportive of subset selection, but the deduction of marks for incorrect responses has…
Descriptors: Multiple Choice Tests, Grading, Test Reliability, Test Format
New York State Education Department, 2015
This technical report provides an overview of the New York State Alternate Assessment (NYSAA), including a description of the purpose of the NYSAA, the processes utilized to develop and implement the NYSAA program, and Stakeholder involvement in those processes. By comparing the intent of the NYSAA with its process and design, the validity of the…
Descriptors: Alternative Assessment, Grade 3, Grade 4, Grade 5
Kolen, Michael J.; Lee, Won-Chan – Educational Measurement: Issues and Practice, 2011
This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…
Descriptors: Test Use, Test Format, Error of Measurement, Raw Scores

Herring, Warren – Journal of Correctional Education, 1999
Analyzes the relationship between success on the two new practice-test forms (EE and FF) developed by Steck-Vaughn and success on the General Educational Development (GED) test. Success with practice-form EE correlated with GED test scores; form FF failed to correlate. (JOW)
Descriptors: Adult Education, Eligibility, Prediction, Scores

Stern, Paul C.; Guagnano, Gregory A.; Dietz, Thomas – Educational and Psychological Measurement, 1998
A brief version of the instrument developed by S. Schwartz (1992, 1994) to measure the structure and content of human values was developed. Studies with 199 adults and 420 adults support the reliability of scores produced by the brief inventory's four three-item scales. Uses of the brief form are discussed. (SLD)
Descriptors: Adults, Reliability, Scores, Test Construction
Koretz, Daniel; Hamilton, Laura – 1999
An earlier study (D. Koretz, 1997) found that Kentucky had been unusually successful in testing most students with disabilities, but it also found numerous signs of poor measurement, including differential item functioning (DIF) in mathematics, apparently excessive use of accommodations, and implausibly high mean scores for some groups of students…
Descriptors: Disabilities, Elementary Secondary Education, Item Bias, Scores

Loo, S. Robert; Thorpe, Karran – Educational and Psychological Measurement, 1999
Used samples of 142 management and 123 nursing undergraduates to evaluate the psychometric properties and factor structure of the newly developed Form S (short form) of the Watson-Glaser Critical Thinking Appraisal (G. Watson and E. Glaser, 1964, 1994). Results provide only limited support for Form S, and further refinement is suggested. (SLD)
Descriptors: Administration, Critical Thinking, Higher Education, Nursing

Cheung, Fanny M.; Ho, Ringo M. – Psychological Assessment, 1997
The Chinese Minnesota Multiphasic Personality Inventory-Adolescents (MMPI-A) was applied in Hong Kong to a normative sample of 565 male and 664 female students aged 14 to 18. In conjunction with previous research, findings support the possibility of cultural differences in item interpretation, which should be considered in clinical interpretations…
Descriptors: Adolescents, Chinese, Cultural Differences, Foreign Countries

Gaston, Michele F.; And Others – Assessment, 1994
Comparability of the Minnesota Multiphasic Personality Inventory (MMPI) and the MMPI-2 was explored by examining T-score means, profile configurations, score distribution, and rank-order correlations on validity scales for 84 undergraduates. Equivalency of the two forms was generally supported. (SLD)
Descriptors: Comparative Analysis, Correlation, Higher Education, Personality Assessment
Bielinski, John; Thurlow, Martha; Minnema, Jane; Scott, Jim – 2000
This report is a review and analysis of the psychometric literature on the topic of out-of-level testing. Out-of-level testing refers to the practice of using a level of the test other than the test taken by most of the students in a student's current grade level. Much of the research on out-of-level testing was conducted in the 1970s and 1980s,…
Descriptors: Achievement Tests, Elementary Secondary Education, Equated Scores, Error of Measurement

Mattis, Paul J.; And Others – Psychological Assessment, 1992
The predictive power of the short-form Wechsler Adult Intelligence Scale of P. Satz and S. Mogel to provide equivalent information about IQ scores and age-corrected scale scores was not differentially affected by the side of the lesion for 63 patients with brain tumors. (SLD)
Descriptors: Adults, Brain Hemisphere Functions, Correlation, Diagnostic Tests
Wise, Lauress – 1993
As high-stakes use of tests increases, it becomes vital that test developers and test users communicate clearly about the accuracy and limitations of the scores generated by a test after it is assembled and used. A procedure is described for portraying the accuracy of test scores. It can be used in setting accuracy targets during form construction…
Descriptors: Classification, High Stakes Tests, Item Response Theory, Military Personnel
Previous Page | Next Page ยป
Pages: 1 | 2