ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	4

Descriptor

Test Content	7
Test Items	3
College Students	2
Comparative Analysis	2
Computer Assisted Testing	2
Psychometrics	2
Scores	2
Test Validity	2
Accounting	1
Adaptive Testing	1
Alignment (Education)	1
Case Studies	1
Certification	1
Classification	1
Construct Validity	1
Content Validity	1
Criteria	1
Data Analysis	1
Decision Making	1
Education Majors	1
Educational Testing	1
Elementary Education	1
Essays	1
Estimation (Mathematics)	1
Evaluators	1
More ▼

Source

Educational and Psychological…

Author

Abedi, Jamal	1
Baker, Eva L.	1
Baldwin, Peter	1
Bell, Karen N.	1
Breithaupt, Krista	1
Chang, Hua-Hua	1
Clauser, Jerome C.	1
Hambleton, Ronald K.	1
Hare, Donovan R.	1
Hau, Kit-Tai	1
Kam, Chester Chun Seng	1
Leung, Chi-Keung	1
Li, Xueming	1
Ludlow, Larry H.	1
Sireci, Stephen G.	1
More ▼

Publication Type

Journal Articles	7
Reports - Research	6
Reports - Evaluative	2

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Location

California	1
Canada	1
Delaware	1
Florida	1
Kentucky	1
Maryland	1
Ohio	1
South Carolina	1
Texas	1
Virginia	1
Washington	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	1
United States Medical…	1

What Works Clearinghouse Rating

Showing all 7 results Save | Export

The Effect of Rating Unfamiliar Items on Angoff Passing Scores

Peer reviewed

Direct link

Clauser, Jerome C.; Hambleton, Ronald K.; Baldwin, Peter – Educational and Psychological Measurement, 2017

The Angoff standard setting method relies on content experts to review exam items and make judgments about the performance of the minimally proficient examinee. Unfortunately, at times content experts may have gaps in their understanding of specific exam content. These gaps are particularly likely to occur when the content domain is broad and/or…

Descriptors: Scores, Item Analysis, Classification, Decision Making

Further Considerations in Using Items with Diverse Content to Measure Acquiescence

Peer reviewed

Direct link

Kam, Chester Chun Seng – Educational and Psychological Measurement, 2016

To measure the response style of acquiescence, researchers recommend the use of at least 15 items with heterogeneous content. Such an approach is consistent with its theoretical definition and is a substantial improvement over traditional methods. Nevertheless, measurement of acquiescence can be enhanced by two additional considerations: first, to…

Descriptors: Test Items, Response Style (Tests), Test Content, Measurement

A New Method for Analyzing Content Validity Data Using Multidimensional Scaling

Peer reviewed

Direct link

Li, Xueming; Sireci, Stephen G. – Educational and Psychological Measurement, 2013

Validity evidence based on test content is of essential importance in educational testing. One source for such evidence is an alignment study, which helps evaluate the congruence between tested objectives and those specified in the curriculum. However, the results of an alignment study do not always sufficiently capture the degree to which a test…

Descriptors: Content Validity, Multidimensional Scaling, Data Analysis, Educational Testing

Automated Simultaneous Assembly of Multistage Testlets for a High-Stakes Licensing Examination

Peer reviewed

Direct link

Breithaupt, Krista; Hare, Donovan R. – Educational and Psychological Measurement, 2007

Many challenges exist for high-stakes testing programs offering continuous computerized administration. The automated assembly of test questions to exactly meet content and other requirements, provide uniformity, and control item exposure can be modeled and solved by mixed-integer programming (MIP) methods. A case study of the computerized…

Descriptors: Testing Programs, Psychometrics, Certification, Accounting

Incorporation of Content Balancing Requirements in Stratification Designs for Computerized Adaptive Testing.

Peer reviewed

Leung, Chi-Keung; Chang, Hua-Hua; Hau, Kit-Tai – Educational and Psychological Measurement, 2003

Studied three stratification designs for computerized adaptive testing in conjunction with three well-developed content balancing methods. Simulation study results show substantial differences in item overlap rate and pool utilization among different methods. Recommends an optimal combination of stratification design and content balancing method.…

Descriptors: Adaptive Testing, Computer Assisted Testing, Item Banks, Simulation

Psychometric Characteristics of the Attitudes toward Mathematics and Its Teaching (ATMAT) Scale.

Peer reviewed

Ludlow, Larry H.; Bell, Karen N. – Educational and Psychological Measurement, 1996

Fifty education majors in two sections responded to an Attitudes toward Mathematics and Its Teaching (ATMAT) scale. Results with two psychometric models, classical true-score theory and the one-parameter Rasch model, supported the ATMAT's reliability, content and construct validity, and invariance over three time points. (SLD)

Descriptors: College Students, Construct Validity, Education Majors, Elementary Education

A Latent-Variable Modeling Approach to Assessing Interrater Reliability, Topic Generalizability, and Validity of a Content Assessment Scoring Rubric.

Peer reviewed

Abedi, Jamal; Baker, Eva L. – Educational and Psychological Measurement, 1995

Results from a performance assessment in which 68 high school students wrote essays support the use of latent variable modeling for estimating reliability, concurrent validity, and generalizability of a scoring rubric. The latent variable modeling approach overcomes the limitations of certain conventional statistical techniques in handling…

Descriptors: Criteria, Essays, Estimation (Mathematics), Generalizability Theory