ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	0
Since 2007 (last 20 years)	8

Descriptor

Reliability	12
Validity	9
Construct Validity	4
Foreign Countries	4
Test Theory	4
Correlation	3
Factor Structure	3
Item Response Theory	3
Scores	3
Classification	2
Error of Measurement	2
Factor Analysis	2
Models	2
Multiple Regression Analysis	2
Probability	2
Psychometrics	2
Self Concept	2
Ability	1
Academic Achievement	1
Adaptive Testing	1
Adult Education	1
Adult Students	1
Asians	1
Bias	1
Chinese	1
More ▼

Source

International Journal of…

Publication Type

Journal Articles	12
Reports - Research	6
Reports - Evaluative	5
Reports - Descriptive	1

Education Level

Grade 7	1
Grade 8	1
Grade 9	1
High Schools	1
Secondary Education	1

Audience

Location

Hong Kong	2
Italy	1
Taiwan	1

Laws, Policies, & Programs

Assessments and Surveys

Motivated Strategies for…	1
Self Description Questionnaire	1
United States Medical…	1

What Works Clearinghouse Rating

Showing all 12 results Save | Export

The Effect of Data Format on Integration of Performance Data into Angoff Judgments

Peer reviewed

Direct link

Clauser, Brian E.; Mee, Janet; Margolis, Melissa J. – International Journal of Testing, 2013

This study investigated the extent to which the performance data format impacted data use in Angoff standard setting exercises. Judges from two standard settings (a total of five panels) were randomly assigned to one of two groups. The full-data group received two types of data: (1) the proportion of examinees selecting each option and (2) plots…

Descriptors: Standard Setting (Scoring), Cutting Scores, Validity, Reliability

Validating the Chinese Version of the Inventory of School Motivation

Peer reviewed

Direct link

King, Ronnel B.; Watkins, David A. – International Journal of Testing, 2013

The aim of this study is to assess the cross-cultural applicability of the Chinese version of the Inventory of School Motivation (ISM; McInerney & Sinclair, 1991) in the Hong Kong context using both within-network and between-network approaches to construct validation. The ISM measures four types of achievement goals: mastery, performance,…

Descriptors: Factor Analysis, Reliability, Learning Motivation, Foreign Countries

Revising the Rorschach Ego Impairment Index to Accommodate Recent Recommendations about Improving Rorschach Validity

Peer reviewed

Direct link

Viglione, Donald J.; Perry, William; Giromini, Luciano; Meyer, Gregory J. – International Journal of Testing, 2011

We used multiple regression to calculate a new Ego Impairment Index (EII-3). The aim was to incorporate changes in the component variables and distribution of the number of responses as found in the new Rorschach Performance Assessment System, while sustaining the validity and reliability of previous EIIs. The EII-3 formula was derived from a…

Descriptors: Test Items, Self Concept, Validity, Evaluation

Adaptation and Analysis of Motivated Strategies for Learning Questionnaire in the Chinese Setting

Peer reviewed

Direct link

Lee, John Chi-kin; Yin, Hongbiao; Zhang, Zhonghua – International Journal of Testing, 2010

This article reports the adaptation and analysis of Pintrich's Motivated Strategies for Learning Questionnaire (MSLQ) in Hong Kong. First, this study examined the psychometric qualities of the existing Chinese version of MSLQ (MSLQ-CV). Based on this examination, this study developed a revised Chinese version of MSLQ (MSLQ-RCV) for junior…

Descriptors: Foreign Countries, Questionnaires, Psychometrics, Secondary School Students

A Monte Carlo Simulation Investigating the Validity and Reliability of Ability Estimation in Item Response Theory with Speeded Computer Adaptive Tests

Peer reviewed

Direct link

Schmitt, T. A.; Sass, D. A.; Sullivan, J. R.; Walker, C. M. – International Journal of Testing, 2010

Imposed time limits on computer adaptive tests (CATs) can result in examinees having difficulty completing all items, thus compromising the validity and reliability of ability estimates. In this study, the effects of speededness were explored in a simulated CAT environment by varying examinee response patterns to end-of-test items. Expectedly,…

Descriptors: Monte Carlo Methods, Simulation, Computer Assisted Testing, Adaptive Testing

Correcting Fallacies in Validity, Reliability, and Classification

Peer reviewed

Direct link

Sijtsma, Klaas – International Journal of Testing, 2009

This article reviews three topics from test theory that continue to raise discussion and controversy and capture test theorists' and constructors' interest. The first topic concerns the discussion of the methodology of investigating and establishing construct validity; the second topic concerns reliability and its misuse, alternative definitions…

Descriptors: Construct Validity, Reliability, Classification, Test Theory

Cross-Cultural Validity of the TIMSS-1999 Mathematics Test: Verification of a Cognitive Model

Peer reviewed

Direct link

Chen, Yi-Hsin; Gorin, Joanna S.; Thompson, Marilyn S.; Tatsuoka, Kikumi K. – International Journal of Testing, 2008

As with any test administered across linguistically and culturally diverse groups, evidence suggesting the equivalence of score meaning across countries is needed for valid comparisons. The current study examines the cross-cultural equivalence of score interpretations from the Trends in International Mathematics and Science Study (TIMSS)-1999 from…

Descriptors: Construct Validity, Mathematics Tests, Foreign Countries, Equated Scores

Does It Matter if You "Kill" the Patient or Order Too Many Tests? Scoring Alternatives for a Test of Clinical Reasoning Skill

Peer reviewed

Direct link

Childs, Ruth A.; Dunn, Jennifer L.; van Barneveld, Christina; Jaciw, Andrew P. – International Journal of Testing, 2007

This study compares five scoring approaches for a test of clinical reasoning skills. All of the approaches incorporate information about the correct item responses selected and the errors, such as selecting too many responses or selecting a response that is inappropriate and/or harmful to the patient. The approaches are combinations of theoretical…

Descriptors: Scoring, Clinical Diagnosis, Thinking Skills, Reliability

IRT Item Parameters and the Reliability and Validity of Pretest, Posttest, and Gain Scores

Peer reviewed

Direct link

May, Kim; Jackson, Tameika S. – International Journal of Testing, 2005

The effect of different combinations of item response theory (IRT) item parameters (item difficulty, item discrimination, and the guessing probability) on the reliability and construct validity (correlation with the latent trait being measured) of pretest, posttest, and gain scores was analytically examined using the 3-parameter logistic (3PL)…

Descriptors: Pretests Posttests, Guessing (Tests), Probability, Scores

The Geometry of Probability, Statistics, and Test Theory.

Peer reviewed

Zimmerman, Donald W.; Zumbo, Bruno D. – International Journal of Testing, 2001

Presents a model of tests and measurement that identifies test scores with Hilbert space vectors and true and error components of scores with linear operators. This geometric point of view brings to light relations among elementary concepts in test theory, including reliability, validity, and parallel tests. (Author/SLD)

Descriptors: Models, Probability, Reliability, Scores

Italian Version of the Self-Description Questionnaire-III.

Peer reviewed

Maggi, Stefania – International Journal of Testing, 2001

Developed an Italian version of the Self-Description Questionnaire (SDQ-III) and studied the reliability and factorial validity of this translated instrument. Results show that the translated version has psychometric properties similar to those of the original English version. (SLD)

Descriptors: Factor Structure, Foreign Countries, Psychometrics, Reliability

Evaluation of Web-Based Educational Programs via the Demand-Driven Learning Model: A Measure of Web-Based Learning.

Peer reviewed

MacDonald, Colla J.; Breithaupt, Krista; Stodel, Emma J.; Farres, Laura G.; Gabriel, Martha A. – International Journal of Testing, 2002

Developed and tested an online survey to assess Web-based learning (WBL) educational programs, extending theoretical work on the Demand Driven Learning Model. Data from 93 adult learners from 3 WBL programs found high internal reliability and adequate construct validity for the 5 scales of the online measure. (SLD)

Descriptors: Adult Education, Adult Students, Distance Education, Educational Demand

Breithaupt, Krista	1
Chen, Yi-Hsin	1
Childs, Ruth A.	1
Clauser, Brian E.	1
Dunn, Jennifer L.	1
Farres, Laura G.	1
Gabriel, Martha A.	1
Giromini, Luciano	1
Gorin, Joanna S.	1
Jaciw, Andrew P.	1
Jackson, Tameika S.	1
King, Ronnel B.	1
Lee, John Chi-kin	1
MacDonald, Colla J.	1
Maggi, Stefania	1
Margolis, Melissa J.	1
May, Kim	1
Mee, Janet	1
Meyer, Gregory J.	1
Perry, William	1
Sass, D. A.	1
Schmitt, T. A.	1
Sijtsma, Klaas	1
Stodel, Emma J.	1
Sullivan, J. R.	1
More ▼