Publication Date
| In 2026 | 0 |
| Since 2025 | 10 |
| Since 2022 (last 5 years) | 125 |
| Since 2017 (last 10 years) | 302 |
| Since 2007 (last 20 years) | 619 |
Descriptor
| Comparative Analysis | 835 |
| Computer Assisted Testing | 835 |
| Foreign Countries | 237 |
| Scores | 193 |
| Second Language Learning | 142 |
| Adaptive Testing | 139 |
| Test Format | 135 |
| Test Items | 133 |
| English (Second Language) | 121 |
| Language Tests | 114 |
| Statistical Analysis | 108 |
| More ▼ | |
Source
Author
| Dodd, Barbara G. | 12 |
| Chang, Hua-Hua | 7 |
| Attali, Yigal | 5 |
| De Ayala, R. J. | 5 |
| Weiss, David J. | 5 |
| Coniam, David | 4 |
| Ponsoda, Vicente | 4 |
| Sinharay, Sandip | 4 |
| Veldkamp, Bernard P. | 4 |
| Alt, Mary | 3 |
| Anderson, Paul S. | 3 |
| More ▼ | |
Publication Type
Education Level
Location
| China | 24 |
| Australia | 20 |
| Germany | 19 |
| United Kingdom | 18 |
| Turkey | 15 |
| Canada | 13 |
| Netherlands | 12 |
| Taiwan | 12 |
| Iran | 11 |
| Japan | 11 |
| Massachusetts | 10 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 4 |
| Every Student Succeeds Act… | 2 |
| Individuals with Disabilities… | 2 |
| Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
Nietfeld, John L.; Enders, Craig K; Schraw, Gregory – Educational and Psychological Measurement, 2006
Researchers studying monitoring accuracy currently use two different indexes to estimate accuracy: relative accuracy and absolute accuracy. The authors compared the distributional properties of two measures of monitoring accuracy using Monte Carlo procedures that fit within these categories. They manipulated the accuracy of judgments (i.e., chance…
Descriptors: Monte Carlo Methods, Test Items, Computation, Metacognition
Bebell, Damian; Kay, Rachel – Journal of Technology, Learning, and Assessment, 2010
This paper examines the educational impacts of the Berkshire Wireless Learning Initiative (BWLI), a pilot program that provided 1:1 technology access to all students and teachers across five public and private middle schools in western Massachusetts. Using a pre/post comparative study design, the current study explores a wide range of program…
Descriptors: Research Design, Middle Schools, Pilot Projects, Research Methodology
Shermis, Mark D.; And Others – 1992
The reliability of four branching algorithms commonly used in computer adaptive testing (CAT) was examined. These algorithms were: (1) maximum likelihood (MLE); (2) Bayesian; (3) modal Bayesian; and (4) crossover. Sixty-eight undergraduate college students were randomly assigned to one of the four conditions using the HyperCard-based CAT program,…
Descriptors: Adaptive Testing, Algorithms, Bayesian Statistics, Comparative Analysis
Puhan, Gautam; Boughton, Keith A.; Kim, Sooyeon – ETS Research Report Series, 2005
The study evaluated the comparability of two versions of a teacher certification test: a paper-and-pencil test (PPT) and computer-based test (CBT). Standardized mean difference (SMD) and differential item functioning (DIF) analyses were used as measures of comparability at the test and item levels, respectively. Results indicated that effect sizes…
Descriptors: Comparative Analysis, Test Items, Statistical Analysis, Teacher Certification
Nutter, Norma; And Others – 1989
The East Tennessee State University College of Education has begun removing objective testing from regularly scheduled class time and has begun using Macintosh computers for content tutoring and the evaluation of students at their convenience. The new testing program requires students to meet criteria for the number of correct responses within…
Descriptors: Comparative Analysis, Computer Assisted Instruction, Computer Assisted Testing, Conventional Instruction
Anderson, Paul S. – 1987
Seven formats of educational testing were compared for student test preferences and how well each evaluated learning. The formats were: (1) true/false; (2) multiple choice; (3) matching; (4) MDT Multiple Digit Testing, in which a machine scores fill-in-the-blanks; (5) fill-in-the-blanks; (6) short answers; and (7) essay. A total of 1,440 survey…
Descriptors: College Students, Comparative Analysis, Computer Assisted Testing, Essay Tests
Federico, Pat-Anthony – 1989
To determine the relative reliabilities and validities of paper-based and computer-based measurement procedures, 83 male student pilots and radar intercept officers were administered computer and paper-based tests of aircraft recognition. The subject matter consisted of line drawings of front, side, and top silhouettes of aircraft. Reliabilities…
Descriptors: Armed Forces, Comparative Analysis, Computer Assisted Testing, Discriminant Analysis
Kingsbury, G. Gage; Weiss, David J. – 1981
Conventional mastery tests designed to make optimal mastery classifications were compared with fixed-length and variable-length adaptive mastery tests. Comparisons between the testing procedures were made across five content areas in an introductory biology course from tests administered to volunteers. The criterion was the student's standing in…
Descriptors: Achievement Tests, Adaptive Testing, Biology, Comparative Analysis
Hansen, Duncan N.; And Others – 1977
A computerized adaptive testing model was applied to a hierarchically arranged series of subtests within the instructional context of a technical education system. The model was a modification of Lord's flexilevel paradigm; however, it did not allow for individualized entry. Two achievement tests, each divided into five hierarchically related…
Descriptors: Achievement Tests, Adaptive Testing, Branching, Comparative Analysis
OECD Publishing (NJ1), 2005
The original data strategy on which PISA was based suggested that, after the completion of the first three assessments in 2000, 2003 and 2006, the cycle would repeat itself with three-yearly assessments in the areas of reading, mathematics and science. However, in light of newly emerging policy priorities and the experience gained with PISA so…
Descriptors: Achievement Tests, Educational Testing, Test Construction, Long Range Planning
Sireci, Stephen G.; Foster, David F.; Robin, Frederic; Olsen, James – 1997
Evaluating the comparability of a test administered in different languages is a difficult, if not impossible, task. Comparisons are problematic because observed differences in test performance between groups who take different language versions of a test could be due to a difference in difficulty between the tests, to cultural differences in test…
Descriptors: Adaptive Testing, Adults, Certification, Comparative Analysis
Peer reviewedFulcher, Glenn – ELT Journal, 1999
Considers the computerization of an English-language placement test for delivery on the World Wide Web. Describes a pilot study to investigate potential bias against students who lack computer familiarity or have negative attitudes towards technology, and assesses the usefulness of the test as a placement instrument by comparing the accuracy of…
Descriptors: Comparative Analysis, Computer Assisted Testing, Computer Literacy, English (Second Language)
Kyte, Zoe A.; Goodyer, Ian M.; Sahakian, Barbara J. – Journal of Child Psychology and Psychiatry, 2005
Background: To investigate whether recent first episode major depression in adolescence is characterised by selected executive difficulties in attentional flexibility, behavioural inhibition and decision-making. Methods: Selected executive functions were compared in adolescents with recent (past year) first episode major depression (n = 30) and…
Descriptors: Stimuli, Adolescents, Depression (Psychology), Cognitive Ability
Poggio, John; Glasnapp, Douglas R.; Yang, Xiangdong; Poggio, Andrew J. – Journal of Technology, Learning, and Assessment, 2005
The present study reports results from a quasi-controlled empirical investigation addressing the impact on student test scores when using fixed form computer based testing (CBT) versus paper and pencil (P&P) testing as the delivery mode to assess student mathematics achievement in a state's large scale assessment program. Grade 7 students…
Descriptors: Mathematics Achievement, Measures (Individuals), Program Effectiveness, Measurement
Maher, Thomas G. – 1993
A general evaluation design was developed to examine the effectiveness of a computer-based, multimedia simulation test on California smog check mechanics. The simulation test operated on an Apple Macintosh IIci, with a single touchscreen color monitor controlling a videodisc player; it had three parts: introduction-tutorial-help, data, and test.…
Descriptors: Adult Education, Auto Mechanics, Comparative Analysis, Computer Assisted Testing

Direct link
