ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	12
Since 2017 (last 10 years)	28
Since 2007 (last 20 years)	48

Descriptor

Test Construction	150
Test Length	150
Test Items	66
Test Validity	47
Test Reliability	42
Computer Assisted Testing	32
Test Format	27
Adaptive Testing	26
Item Banks	21
Psychometrics	20
Testing Problems	20
Higher Education	19
Item Analysis	19
Item Response Theory	19
Criterion Referenced Tests	16
Foreign Countries	16
Scores	15
Cutting Scores	14
Mastery Tests	14
Measurement Techniques	13
Sample Size	13
Achievement Tests	12
Mathematical Models	12
Difficulty Level	11
Factor Structure	11
More ▼

Publication Type

Reports - Research	100
Journal Articles	78
Reports - Evaluative	25
Speeches/Meeting Papers	25
Reports - Descriptive	9
Guides - Non-Classroom	7
Numerical/Quantitative Data	7
Opinion Papers	6
Information Analyses	4
Tests/Questionnaires	4
Dissertations/Theses -…	3
Collected Works - Serials	1
Guides - General	1
Historical Materials	1
Reports - General	1
More ▼

Education Level

Higher Education	9
Postsecondary Education	8
Secondary Education	5
Elementary Education	4
High Schools	4
Elementary Secondary Education	2
Grade 3	2
Grade 6	2
Intermediate Grades	2
Middle Schools	2
Early Childhood Education	1
Grade 4	1
Grade 5	1
Junior High Schools	1
Primary Education	1
More ▼

Audience

Researchers	5
Practitioners	2
Administrators	1

Location

Australia	2
Canada	2
China	2
Ireland	2
Singapore	2
United Kingdom	2
Asia	1
Germany	1
Israel	1
Italy	1
Japan	1
Netherlands	1
New Jersey	1
New Zealand	1
Pennsylvania	1
Poland	1
Rhode Island	1
South Korea	1
Turkey	1
United Kingdom (England)	1
United States	1
More ▼

Laws, Policies, & Programs

Race to the Top

What Works Clearinghouse Rating

Test Construction X

Showing 16 to 30 of 150 results Save | Export

An Empirical Research on Identifiability and Q-Matrix Design for DINA Model

Peer reviewed
PDF on ERIC

Download full text

Xu, Peng; Desmarais, Michel C. – International Educational Data Mining Society, 2018

In most contexts of student skills assessment, whether the test material is administered by the teacher or within a learning environment, there is a strong incentive to minimize the number of questions or exercises administered in order to get an accurate assessment. This minimization objective can be framed as a Q-matrix design problem: given a…

Descriptors: Test Items, Accuracy, Test Construction, Skills

Not All Perfectionism Cognitions Are Multidimensional: Evidence for the Perfectionism Cognitions Inventory--10

Peer reviewed

Direct link

Hill, Andrew P.; Donachie, Tracy – Journal of Psychoeducational Assessment, 2020

The measurement of perfectionistic cognitions has recently caused disagreement among researchers. Flett, Hewitt, Blankstein, and Gray proposed that perfectionistic cognitions are unidimensional. However, after re-examining the factor structure of the instrument used to measure perfectionistic automatic thoughts (Perfectionism Cognitions Inventory…

Descriptors: Factor Structure, Test Length, Cognitive Processes, Personality Traits

Dynamic Multistage Testing: A Highly Efficient and Regulated Adaptive Testing Method

Peer reviewed

Direct link

Luo, Xiao; Wang, Xinrui – International Journal of Testing, 2019

This study introduced dynamic multistage testing (dy-MST) as an improvement to existing adaptive testing methods. dy-MST combines the advantages of computerized adaptive testing (CAT) and computerized adaptive multistage testing (ca-MST) to create a highly efficient and regulated adaptive testing method. In the test construction phase, multistage…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Construction, Psychometrics

Routing Strategies and Optimizing Design for Multistage Testing in International Large-Scale Assessments

Peer reviewed

Direct link

Svetina, Dubravka; Liaw, Yuan-Ling; Rutkowski, Leslie; Rutkowski, David – Journal of Educational Measurement, 2019

This study investigates the effect of several design and administration choices on item exposure and person/item parameter recovery under a multistage test (MST) design. In a simulation study, we examine whether number-correct (NC) or item response theory (IRT) methods are differentially effective at routing students to the correct next stage(s)…

Descriptors: Measurement, Item Analysis, Test Construction, Item Response Theory

Development and Validation of an Enlarged Version of the Student Agentic Engagement Scale

Peer reviewed

Direct link

Mameli, Consuelo; Passini, Stefano – Journal of Psychoeducational Assessment, 2019

The elusive character of student agency makes it a relevant construct to be investigated and measured. An initial effort in this direction was represented by the Agentic Engagement Scale, a five-item instrument designed to assess the degree to which students constructively contribute to the flow of the instructions they receive from the teacher.…

Descriptors: Measures (Individuals), Test Construction, Test Validity, Learner Engagement

Different Methods of Adjusting for Form Difficulty under the Rasch Model: Impact on Consistency of Assessment Results. Research Report. ETS RR-19-08

Peer reviewed
PDF on ERIC

Download full text

Manna, Venessa F.; Gu, Lixiong – ETS Research Report Series, 2019

When using the Rasch model, equating with a nonequivalent groups anchor test design is commonly achieved by adjustment of new form item difficulty using an additive equating constant. Using simulated 5-year data, this report compares 4 approaches to calculating the equating constants and the subsequent impact on equating results. The 4 approaches…

Descriptors: Item Response Theory, Test Items, Test Construction, Sample Size

Test Equity in Developing Short Version Conceptual Inventories: A Case Study on the Conceptual Survey of Electricity and Magnetism

Peer reviewed

Direct link

Xiao, Yang; Koenig, Kathleen; Han, Jing; Liu, Jing; Liu, Qiaoyi; Bao, Lei – Physical Review Physics Education Research, 2019

Standardized concept inventories (CIs) have been widely used in science, technology, engineering, and mathematics education for assessment of student learning. In practice, there have been concerns regarding the length of the test and possible test-retest memory effect. To address these issues, a recent study developed a method to split a CI into…

Descriptors: Scientific Concepts, Science Tests, Energy, Magnets

Illustration of a Survey Refinement Process Using Psychometric Analysis

Peer reviewed

Direct link

Smith, William Zachary; Dickenson, Tammiee S.; Rogers, Bradley David – AERA Online Paper Repository, 2017

Questionnaire refinement and a process for selecting items for elimination are important tools for survey developers. One of the major obstacles in questionnaire refinement and elimination in surveys lies in one's ability to adequately and appropriately reconstruct a survey. Often times, surveys can be long and strenuous on the respondent,…

Descriptors: Surveys, Psychometrics, Test Construction, Test Reliability

The Impact of Q-Matrix Designs on Diagnostic Classification Accuracy in the Presence of Attribute Hierarchies

Peer reviewed

Direct link

Liu, Ren; Huggins-Manley, Anne Corinne; Bradshaw, Laine – Educational and Psychological Measurement, 2017

There is an increasing demand for assessments that can provide more fine-grained information about examinees. In response to the demand, diagnostic measurement provides students with feedback on their strengths and weaknesses on specific skills by classifying them into mastery or nonmastery attribute categories. These attributes often form a…

Descriptors: Matrices, Classification, Accuracy, Diagnostic Tests

Designing CAT MOCCA: Guiding Principles and Simulation Research. MOCCA Technical Report MTR-2021-1

Peer reviewed
PDF on ERIC

Download full text

Mark L. Davison; David J. Weiss; Ozge Ersan; Joseph N. DeWeese; Gina Biancarosa; Patrick C. Kennedy – Grantee Submission, 2021

MOCCA is an online assessment of inferential reading comprehension for students in 3rd through 6th grades. It can be used to identify good readers and, for struggling readers, identify those who overly rely on either a Paraphrasing process or an Elaborating process when their comprehension is incorrect. Here a propensity to over-rely on…

Descriptors: Reading Tests, Computer Assisted Testing, Reading Comprehension, Elementary School Students

The Big Three Perfectionism Scale--Short Form (BTPS-SF): Development of a Brief Self-Report Measure of Multidimensional Perfectionism

Peer reviewed

Direct link

Feher, Anita; Smith, Martin M.; Saklofske, Donald H.; Plouffe, Rachel A.; Wilson, Claire A.; Sherry, Simon B. – Journal of Psychoeducational Assessment, 2020

The Big Three Perfectionism Scale (BTPS) is a 45-item self-report measure of perfectionism with three overarching factors: rigid, self-critical, and narcissistic perfectionism. Our objective was to create a brief version of the BTPS, the Big Three Perfectionism Scale--Short Form (BTPS-SF). Sixteen items were selected, and confirmatory factor…

Descriptors: Personality Measures, Personality Traits, Test Construction, Measurement Techniques

The Effects of Test Length and Sample Size on Item Parameters in Item Response Theory

Peer reviewed
PDF on ERIC

Download full text

Sahin, Alper; Anil, Duygu – Educational Sciences: Theory and Practice, 2017

This study investigates the effects of sample size and test length on item-parameter estimation in test development utilizing three unidimensional dichotomous models of item response theory (IRT). For this purpose, a real language test comprised of 50 items was administered to 6,288 students. Data from this test was used to obtain data sets of…

Descriptors: Test Length, Sample Size, Item Response Theory, Test Construction

Refining Change Measure with the Rasch Model

Peer reviewed

Direct link

Zaporozhets, Olga; Fox, Christine M.; Beltyukova, Svetlana A.; Laux, John M.; Piazza, Nick J.; Salyers, Kathleen – Measurement and Evaluation in Counseling and Development, 2015

This study was to develop a linear measure of change using University of Rhode Island Change Assessment items that represented Prochaska and DiClemente's theory. The resulting Toledo Measure of Change is short, is easy to use, and provides reliable scores for identification of individuals' stage of change and progression within that stage.

Descriptors: Item Response Theory, Change, Measures (Individuals), Test Construction

Student Test Scores: How the Sausage Is Made and Why You Should Care. Evidence Speaks Reports, Vol 1, #25

Direct link

Jacob, Brian A. – Center on Children and Families at Brookings, 2016

Contrary to popular belief, modern cognitive assessments--including the new Common Core tests--produce test scores based on sophisticated statistical models rather than the simple percent of items a student answers correctly. While there are good reasons for this, it means that reported test scores depend on many decisions made by test designers,…

Descriptors: Scores, Common Core State Standards, Test Length, Test Content

An Information-Correction Method for Testlet-Based Test Analysis: From the Perspectives of Item Response Theory and Generalizability Theory. Research Report. ETS RR-17-27

Peer reviewed
PDF on ERIC

Download full text

Li, Feifei – ETS Research Report Series, 2017

An information-correction method for testlet-based tests is introduced. This method takes advantage of both generalizability theory (GT) and item response theory (IRT). The measurement error for the examinee proficiency parameter is often underestimated when a unidimensional conditional-independence IRT model is specified for a testlet dataset. By…

Descriptors: Item Response Theory, Generalizability Theory, Tests, Error of Measurement

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10

Educational and Psychological…	21
Journal of Educational…	6
Applied Measurement in…	5
Applied Psychological…	5
Journal of Psychoeducational…	4
International Journal of…	3
ProQuest LLC	3
College Board	2
ETS Research Report Series	2
Education and Information…	2
Grantee Submission	2
Journal of Experimental…	2
Psychological Assessment	2
AERA Online Paper Repository	1
Assessment	1
British Journal of Guidance &…	1
British Journal of Learning…	1
Center on Children and…	1
Contemporary Education	1
Education Week	1
Educational Sciences: Theory…	1
Elementary School Journal	1
Evaluation in Education:…	1
Intelligence	1
International Educational…	1
More ▼

Hambleton, Ronald K.	12
Wainer, Howard	5
Reckase, Mark D.	4
Berk, Ronald A.	3
Wilcox, Rand R.	3
Sijtsma, Klaas	2
Thissen, David	2
Abrams, Matthew	1
Ang, Cheng	1
Anil, Duygu	1
Arbet, Scott E.	1
Arens, A. Katrin	1
Arthur, Winfred, Jr.	1
Bandalos, Deborah L.	1
Bao, Lei	1
Batinic, Bernad	1
Beltyukova, Svetlana A.	1
Benson, Jeri	1
Bergstrom, Betty	1
Boulton, Elicia	1
Boyd, Thomas A.	1
Bradshaw, Laine	1
Braun, Virginia	1
Brown, Anna	1
More ▼

Graduate Record Examinations	2
Program for International…	2
SAT (College Admission Test)	2
Test of English as a Foreign…	2
Academic Motivation Scale	1
Advanced Placement…	1
Bar Examinations	1
COMPASS (Computer Assisted…	1
California Achievement Tests	1
Draw a Person Test	1
Fennema Sherman Mathematics…	1
Minnesota Multiphasic…	1
Minnesota Teacher Attitude…	1
National Assessment of…	1
Preliminary Scholastic…	1
Profile of Mood States	1
Raven Advanced Progressive…	1
School and College Ability…	1
Self Description Questionnaire	1
Trends in International…	1
Wechsler Adult Intelligence…	1
Wechsler Intelligence Scale…	1
Wechsler Intelligence Scales…	1
More ▼