NotesFAQContact Us
Collection
Advanced
Search Tips
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 57 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Andrew P. Jaciw – American Journal of Evaluation, 2025
By design, randomized experiments (XPs) rule out bias from confounded selection of participants into conditions. Quasi-experiments (QEs) are often considered second-best because they do not share this benefit. However, when results from XPs are used to generalize causal impacts, the benefit from unconfounded selection into conditions may be offset…
Descriptors: Elementary School Students, Elementary School Teachers, Generalization, Test Bias
Peer reviewed Peer reviewed
Direct linkDirect link
Wim J. van der Linden; Luping Niu; Seung W. Choi – Journal of Educational and Behavioral Statistics, 2024
A test battery with two different levels of adaptation is presented: a within-subtest level for the selection of the items in the subtests and a between-subtest level to move from one subtest to the next. The battery runs on a two-level model consisting of a regular response model for each of the subtests extended with a second level for the joint…
Descriptors: Adaptive Testing, Test Construction, Test Format, Test Reliability
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Noa Weiss-Klayman; Mark T. Greenberg; Daphne Kopelman-Rubin – International Journal of Emotional Education, 2024
In recent years, there has been increasing awareness on the benefits of social-emotional competencies (SEC) on Israeli students. A self-report SEL measure tailored to the Israeli context, however, has yet to be developed. This research aims to validate the Social-Emotional Questionnaire for Grades 4-6 (SEQ [G4-6]), a new self-report questionnaire…
Descriptors: Foreign Countries, Social Emotional Learning, Self Management, Emotional Development
Sarah Stevens – ProQuest LLC, 2021
Early identification of at-risk students is a critical element of Response to Intervention (RTI) but there are few validated universal screeners for high school mathematics. This study evaluated the effectiveness of the screening system currently in use at one urban Texas public high school and compared its effectiveness to alternate screening…
Descriptors: Urban Schools, Public Schools, High School Students, At Risk Students
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Totten, Jeff W. – Journal of Learning in Higher Education, 2014
The original SOCO Scale was reduced to 10 items by Thomas, Soutar, and Ryan (2001). The author conducted a pretest and a posttest in his Personal Selling class during the Fall 2009 semester. Significant differences by gender, student sales experience and family member in the sales field were identified. The author once again pretested the…
Descriptors: Test Construction, Program Validation, Pretests Posttests, Questionnaires
Peer reviewed Peer reviewed
Direct linkDirect link
Elosua, Paula; Iliescu, Dragos – International Journal of Testing, 2012
Psychometric practice does not always converge with the advances of psychometric theory. In order to investigate this gap, the authors focus on the 10 most used psychological tests in Europe, as identified by recent surveys. The article analyzes test manuals published in 6 different European countries for these 10 most used tests. A total of 32…
Descriptors: Psychological Testing, Personality Measures, Error of Measurement, Foreign Countries
Peer reviewed Peer reviewed
Feingold, Alan – Journal of Clinical Psychology, 1982
Analyzed published data on the Wechsler Adult Intelligence Scale (WAIS) to ascertain whether the Information and Vocabulary subtests can function as measures of intelligence. Concluded that the addition of more WAIS subtests will not result in any increase in predictive validity and these additional tests, therefore, lack incremental validity.…
Descriptors: Adults, Cognitive Measurement, Comparative Testing, Intelligence Tests
Peer reviewed Peer reviewed
Wolf, Fredric M.; And Others – Evaluation and the Health Professions, 1991
Five studies involving medical students over a four-year period were conducted to examine the reliability and validity of the Medical Helping Relationship Inventory (MHRI)--an instrument for evaluating medical communication skills. Evidence of the MHRI's construct validity, convergent validity, and discriminant validity is discussed. (SLD)
Descriptors: Communication Skills, Comparative Testing, Concurrent Validity, Construct Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Kim, Do-Hong; Huynh, Huynh – Educational and Psychological Measurement, 2008
The current study compared student performance between paper-and-pencil testing (PPT) and computer-based testing (CBT) on a large-scale statewide end-of-course English examination. Analyses were conducted at both the item and test levels. The overall results suggest that scores obtained from PPT and CBT were comparable. However, at the content…
Descriptors: Reading Comprehension, Computer Assisted Testing, Factor Analysis, Comparative Testing
Page, Ronald C.; De Puga, Ignacio Suarez – 1992
This paper addresses the cross-cultural development and use of the Behavior Description Index (BDI). The development of culturally-transportable scales; cross-cultural comparisons of results for 12 Spanish, 15 Hungarian, 15 Indonesian, and U.S. managers; and the transportability of competencies as constructs are considered. A competency is an…
Descriptors: Administrator Evaluation, Adults, Comparative Testing, Competence
Peer reviewed Peer reviewed
Quereshi, M. Y.; Seitz, Rainer – Intelligence, 1993
Letter series and number series tests of items based on identical rules were administered to 160 male and 160 female undergraduates to determine comparability by testing equality of means, variances, and validity coefficients. Results indicate that letter and number series tests are not equivalent. Number series tests are easier, probably because…
Descriptors: Comparative Testing, Higher Education, Test Construction, Test Validity
Borg, Walter R. – 1985
Two approaches to testing student achievement are discussed: (1) traditional standardized tests; and (2) testing achievement on an instructional unit. Content validity is a major problem with standardized achievement tests. Even though test publishers typically consider a number of curriculum guides and textbooks in producing a test which is…
Descriptors: Achievement Tests, Alternative Assessment, Comparative Testing, Content Validity
Mowsesian, Richard; Hays, William L. – 1985
The predictive validity of the revised Graduate Record Examination Analytical Test (GRE-A) was compared with the experimental form of the GRE-A, in terms of graduate school admissions as well as advancement to Ph.D. candidacy decisions. Prior to 1974 the Graduate Record Examination included just a verbal and a quantitative test; in 1974 it was…
Descriptors: Admission Criteria, Cognitive Tests, College Admission, College Entrance Examinations
Benes, Patricia; Dusewicz, Russell A. – 1975
This study was undertaken to determine reliability and validity estimates for a newly developed preschool inventory of cognitive functioning which provides many advantages over traditionally utilized measures. A predominantly pictorial stimulus-psychomotor response set was the format for the test. The test consisted of a series of 61 items divided…
Descriptors: Cognitive Development, Comparative Testing, Pictorial Stimuli, Preschool Education
Peer reviewed Peer reviewed
Cudeck, Robert – Multivariate Behavioral Research, 1985
Twelve structural models of similarity were fitted to data from conventional and computer adaptive test (CAT) batteries measuring the same aptitude in a double cross-validation design. Three of the 12 models, including a multiplicative structure model, performed well, providing support for using CATs as replacements for conventional tests. (NSF)
Descriptors: Adaptive Testing, Aptitude Tests, Comparative Testing, Computer Assisted Testing
Previous Page | Next Page ยป
Pages: 1  |  2  |  3  |  4