Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 2 |
| Since 2017 (last 10 years) | 6 |
| Since 2007 (last 20 years) | 8 |
Descriptor
| Educational Testing | 12 |
| Psychological Testing | 12 |
| Test Items | 12 |
| Item Response Theory | 5 |
| Goodness of Fit | 4 |
| Test Construction | 4 |
| Computer Assisted Testing | 3 |
| Student Evaluation | 3 |
| Adaptive Testing | 2 |
| Evaluation Criteria | 2 |
| Factor Analysis | 2 |
| More ▼ | |
Source
Author
| Sinharay, Sandip | 2 |
| Beck, Klaus | 1 |
| Chang, Yuan-chin Ivan | 1 |
| Cui, Ying | 1 |
| Frey, Andreas | 1 |
| Gorney, Kylie | 1 |
| Kelderman, Henk | 1 |
| Kylie Gorney | 1 |
| Lu, Hung-Yi | 1 |
| Mousavi, Amin | 1 |
| Perdue, Bob | 1 |
| More ▼ | |
Publication Type
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Kylie Gorney; Sandip Sinharay – Educational and Psychological Measurement, 2025
Test-takers, policymakers, teachers, and institutions are increasingly demanding that testing programs provide more detailed feedback regarding test performance. As a result, there has been a growing interest in the reporting of subscores that potentially provide such detailed feedback. Haberman developed a method based on classical test theory…
Descriptors: Scores, Test Theory, Test Items, Testing
Gorney, Kylie – ProQuest LLC, 2023
Aberrant behavior refers to any type of unusual behavior that would not be expected under normal circumstances. In educational and psychological testing, such behaviors have the potential to severely bias the aberrant examinee's test score while also jeopardizing the test scores of countless others. It is therefore crucial that aberrant examinees…
Descriptors: Behavior Problems, Educational Testing, Psychological Testing, Test Bias
Sinharay, Sandip; van Rijn, Peter W. – Journal of Educational and Behavioral Statistics, 2020
Response time models (RTMs) are of increasing interest in educational and psychological testing. This article focuses on the lognormal model for response times, which is one of the most popular RTMs. Several existing statistics for testing normality and the fit of factor analysis models are repurposed for testing the fit of the lognormal model. A…
Descriptors: Educational Testing, Psychological Testing, Goodness of Fit, Factor Analysis
Sinharay, Sandip; van Rijn, Peter – Grantee Submission, 2020
Response-time models are of increasing interest in educational and psychological testing. This paper focuses on the lognormal model for response times (van der Linden, 2006), which is one of the most popular response-time models. Several existing statistics for testing normality and the fit of factor-analysis models are repurposed for testing the…
Descriptors: Educational Testing, Psychological Testing, Goodness of Fit, Factor Analysis
Beck, Klaus – Frontline Learning Research, 2020
Many test developers try to ensure the content validity of their tests by having external experts review the items, e.g. in terms of relevance, difficulty, or clarity. Although this approach is widely accepted, a closer look reveals several pitfalls need to be avoided if experts' advice is to be truly helpful. The purpose of this paper is to…
Descriptors: Content Validity, Psychological Testing, Educational Testing, Student Evaluation
Mousavi, Amin; Cui, Ying – Education Sciences, 2020
Often, important decisions regarding accountability and placement of students in performance categories are made on the basis of test scores generated from tests, therefore, it is important to evaluate the validity of the inferences derived from test results. One of the threats to the validity of such inferences is aberrant responding. Several…
Descriptors: Student Evaluation, Educational Testing, Psychological Testing, Item Response Theory
Chang, Yuan-chin Ivan; Lu, Hung-Yi – Psychometrika, 2010
Item calibration is an essential issue in modern item response theory based psychological or educational testing. Due to the popularity of computerized adaptive testing, methods to efficiently calibrate new items have become more important than that in the time when paper and pencil test administration is the norm. There are many calibration…
Descriptors: Test Items, Educational Testing, Adaptive Testing, Measurement
Frey, Andreas; Seitz, Nicki-Nils – Studies in Educational Evaluation, 2009
The paper gives an overview of multidimensional adaptive testing (MAT) and evaluates its applicability in educational and psychological testing. The approach of Segall (1996) is described as a general framework for MAT. The main advantage of MAT is its capability to increase measurement efficiency. In simulation studies conceptualizing situations…
Descriptors: Psychological Testing, Adaptive Testing, Simulation, Evaluation Methods
Peer reviewedPiotrowski, Chris; Perdue, Bob – Behavioral & Social Sciences Librarian, 1999
Presents an overview of the major contemporary reference sources (print, online, and electronic) for scholarly information about psychological/educational tests. Stresses books and compendia that will assist reference librarians, and includes a compilation of texts that provide actual test items. Contains 53 references. (Author/LRW)
Descriptors: Educational Testing, Electronic Text, Library Materials, Online Searching
Optimal Assembly of Educational and Psychological Tests, with a Bibliography. Research Report 98-05.
van der Linden, Wim J. – 1998
The advent of computers in educational and psychological measurement has lead to the need for algorithms for optimal assembly of tests from item banks. This paper reviews the literature on optimal test assembly and introduces the contributions to this report on the topic. Four different approaches to computerized test assembly are discussed:…
Descriptors: Algorithms, Computer Assisted Testing, Educational Testing, Equated Scores
Peer reviewedWesters, Paul; Kelderman, Henk – Psychometrika, 1992
A method for analyzing test-item responses is proposed to examine differential item functioning (DIF) in multiple-choice items within the latent class framework. Different models for detection of DIF are formulated, defining the subgroup as a latent variable. An efficient estimation method is described and illustrated. (SLD)
Descriptors: Chi Square, Difficulty Level, Educational Testing, Equations (Mathematics)
Educational Testing Service, Princeton, NJ. – 1957
This conference focused on the broad theme, improving the quality and scope of measurement. The first session centered on improving criteria for educational and psychological measurement, with papers on Criteria for Complex Mental Processes by Robert C. Wilson and on Criteria of Nonintellectual Aspects of Personality by Morris I. Stein. The second…
Descriptors: Achievement Tests, Cognitive Tests, Educational Testing, Elementary Secondary Education

Direct link
