ERIC - Search Results

Publication Date

In 2026	0
Since 2025	12
Since 2022 (last 5 years)	114
Since 2017 (last 10 years)	375
Since 2007 (last 20 years)	1130

Descriptor

Comparative Analysis	1943
Reliability	880
Test Reliability	792
Foreign Countries	554
Test Validity	443
Correlation	350
Validity	332
Interrater Reliability	327
Statistical Analysis	321
Scores	280
Measures (Individuals)	236
Evaluation Methods	212
Higher Education	201
Psychometrics	180
Questionnaires	165
Factor Analysis	161
Test Construction	160
College Students	159
English (Second Language)	149
Student Attitudes	141
Test Items	136
Second Language Learning	133
Scoring	130
Rating Scales	127
Student Evaluation	125
More ▼

Education Level

Higher Education	360
Postsecondary Education	285
Secondary Education	150
Elementary Education	135
Elementary Secondary Education	73
High Schools	68
Middle Schools	61
Early Childhood Education	41
Junior High Schools	34
Grade 8	29
Preschool Education	25
Grade 7	24
Intermediate Grades	24
Grade 4	22
Grade 5	20
Grade 6	20
Kindergarten	20
Primary Education	20
Adult Education	19
Grade 10	16
Grade 11	12
Grade 12	10
Grade 2	10
Grade 3	10
Grade 9	10
More ▼

Audience

Researchers	35
Practitioners	29
Teachers	15
Administrators	9
Policymakers	6
Counselors	2
Media Staff	2
Parents	1
Support Staff	1

Location

Turkey	59
United States	47
Australia	36
China	33
Canada	32
United Kingdom (England)	32
United Kingdom	28
Germany	25
Netherlands	24
Taiwan	22
Hong Kong	20
Iran	20
Spain	17
Belgium	15
California	15
Florida	13
Finland	12
Greece	12
Sweden	12
Texas	12
Indonesia	11
Japan	11
Jordan	11
Malaysia	11
Portugal	11
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	6
Every Student Succeeds Act…	2
Individuals with Disabilities…	2
Americans with Disabilities…	1
Comprehensive Employment and…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Race to the Top	1
Temporary Assistance for…	1

What Works Clearinghouse Rating

Meets WWC Standards with or without Reservations	1
Does not meet standards	1

Comparative Analysis X

Showing 1,081 to 1,095 of 1,943 results Save | Export

A Study of the Comparability of Speaking Proficiency Interview Ratings across Three Government Language Training Agencies.

Download full text

Clark, John L. D. – 1986

A study of the reliability of the proficiency ratings scale and techniques used by three federal government agencies--the Central Intelligence Agency, the Defense Language Institute, and the Foreign Service Institute (FSI)--to test employees' oral language proficiency in French and German had two randomly selected two-person teams of testers from…

Descriptors: Comparative Analysis, Federal Government, French, German

Rater Reliability of the ACTFL Oral Proficiency Interview.

Peer reviewed

Magnan, Sally Sieloff – Canadian Modern Language Review, 1987

Differences in procedures used by academic institutions and government agencies in administering the American Council on the Teaching of Foreign Languages' Oral Proficiency Interview test are examined, and results and implications of two studies of interrater reliability are discussed. (MSE)

Descriptors: Comparative Analysis, Correlation, Evaluation Methods, Evaluators

Why Generalizability Theory Yields Better Results than Classical Test Theory.

Download full text

Eason, Sandra H. – 1989

Generalizability theory provides a technique for accurately estimating the reliability of measurements. The power of this theory is based on the simultaneous analysis of multiple sources of error variances. Equally important, generalizability theory considers relationships among the sources of measurement error. Just as multivariate inferential…

Descriptors: Comparative Analysis, Generalizability Theory, Test Reliability, Test Theory

A Modified Rule of Thumb for Evaluating Scale Reproducibilities Determined by Electronic Computers

Peer reviewed

Hofmann, Richard J. – Educational and Psychological Measurement, 1978

The Goodenough technique for determining scale error is compared to the Guttman technique and demonstrated to be more conservative than the Guttman technique. Implications with regard to Guttman's evaluative rule of thumb for evaluating a reproducibility are noted. (Author)

Descriptors: Comparative Analysis, Rating Scales, Statistical Analysis, Test Reliability

Anonymous Reports of Child Physical Abuse: Are They as Serious as Reports from Other Sources?

Zuravin, Susan J.; And Others – Child Abuse and Neglect: The International Journal, 1987

Anonymous reports (n=155) of child physical abuse in Baltimore (MD) were compared with reports made by professionals (n=588) and nonprofessionals (n=262) in terms of substantiation rate, seriousness of substantiated incidents, and severity of allegations. While anonymous reports were more likely to be unfounded, those that were substantiated were…

Descriptors: Child Abuse, Comparative Analysis, Professional Personnel, Reliability

Comparison of the Strong Vocational Interest Blank and the Kuder Occupational Interest Survey Scoring Procedures

Lefkowitz, David M. – J Counseling Psychol, 1970

Comparisons made in errors of classifications, percentage of overlapping, correlation of identical scales scored by both scoring procedures, intercorrelations of scales, and the ranking of scale scores within each subject showed that the two scoring systems produced different interest scores. (Author)

Descriptors: Comparative Analysis, Evaluation, Interest Inventories, Reliability

A Comparison of Three Indexes of Agreement between Observers: Proportion of Agreement, G-Index, and Kappa.

Peer reviewed

Green, Samuel B. – Educational and Psychological Measurement, 1981

The proportion of agreement, G, and kappa indexes are shown to differ in how they correct for chance agreements between two observers. On the basis of the findings, it is suggested that no single agreement index is appropriate for all sets of data. (Author/BW)

Descriptors: Comparative Analysis, Measurement Techniques, Test Reliability, Testing Problems

Testing the Equality of Two Related Intraclass Reliability Coefficients.

Peer reviewed

Alsawalmeh, Yousef M.; Feldt, Leonard S. – Applied Psychological Measurement, 1994

An approximate statistical test of the equality of two intraclass reliability coefficients based on the same sample of people is derived. Such a test is needed when a researcher wishes to compare the reliability of two measurement procedures, and both procedures can be applied to results from the same group. (SLD)

Descriptors: Comparative Analysis, Measurement Techniques, Reliability, Sampling

Screening for Pragmatic Language Impairment: The Potential of the Children's Communication Checklist

Peer reviewed

Direct link

Ketelaars, Mieke P.; Cuperus, Juliane M.; van Daal, John; Jansonius, Kino; Verhoeven, Ludo – Research in Developmental Disabilities: A Multidisciplinary Journal, 2009

The present study examines the validity of the Dutch Children's Communication Checklist (CCC) for children in kindergarten in a community sample, in order to assess the feasibility of using it as a screening instrument in the general population. Teachers completed the CCC for a representative sample of 1396 children at kindergarten level, taken…

Descriptors: Check Lists, Emotional Problems, Language Impairments, Construct Validity

Exploring Rater Behaviour with Rasch Techniques.

Download full text

McNamara, T. F.; Adams, R. J. – 1991

A preliminary study is reported of the use of new multifaceted Rasch measurement mechanisms for investigating rater characteristics in language testing. Ratings from four judges of scripts from 50 candidates taking the International English Language Testing System test, a test of English for Academic Purposes, are analyzed. The analysis…

Descriptors: Comparative Analysis, English (Second Language), Foreign Countries, Interrater Reliability

An Empirical Comparison of Three- and Four-Choice Items and Tests: Susceptibility to Testwiseness and Internal Consistency Reliability.

Peer reviewed

Rogers, W. Todd; Harley, Dwight – Educational and Psychological Measurement, 1999

Examined item-level and test-level characteristics for items in a high-stakes school-leaving mathematics examination. Results from 158 students show that the influence of testwiseness is lessened when three-option items are used. Tests of three-option items are at least equivalent to four-option item tests in terms of internal-consistency score…

Descriptors: Comparative Analysis, High School Students, High Schools, High Stakes Tests

Specificity of a Maximal Step Exercise Test

Peer reviewed

Direct link

Darby, Lynn A.; Marsh, Jennifer L.; Shewokis, Patricia A.; Pohlman, Roberta L. – Measurement in Physical Education and Exercise Science, 2007

To adhere to the principle of "exercise specificity" exercise testing should be completed using the same physical activity that is performed during exercise training. The present study was designed to assess whether aerobic step exercisers have a greater maximal oxygen consumption (max VO sub 2) when tested using an activity specific, maximal step…

Descriptors: Metabolism, Physical Activities, Exercise Physiology, Females

Comparing the Impact of Accountability Examinations on Mississippi and Tennessee Social Studies Teachers' Instructional Practices

Peer reviewed

Direct link

Vogler, Kenneth E. – Educational Assessment, 2008

This study compared the impact of state accountability examinations on social studies teachers' instructional practices. Data were obtained from a survey instrument given to a representative sample of Mississippi teachers who teach the same content tested on their state's high-stakes high school graduation examination and a representative sample…

Descriptors: Academic Achievement, High Stakes Tests, Accountability, Social Studies

The Serial Use of Child Neurocognitive Tests: Development versus Practice Effects

Peer reviewed

Direct link

Slade, Peter D.; Townes, Brenda D.; Rosenbaum, Gail; Martins, Isabel P.; Luis, Henrique; Bernardo, Mario; Martin, Michael D.; DeRouen, Timothy A. – Psychological Assessment, 2008

When serial neurocognitive assessments are performed, 2 main factors are of importance: test-retest reliability and practice effects. With children, however, there is a third, developmental factor, which occurs as a result of maturation. Child tests recognize this factor through the provision of age-corrected scaled scores. Thus, a ready-made…

Descriptors: Validity, Diagnostic Tests, Test Reliability, Children

A Scale to Assist the Diagnosis of Autism and Asperger's Disorder in Adults (RAADS): A Pilot Study

Peer reviewed

Direct link

Ritvo, Riva Ariella; Ritvo, Edward R.; Guthrie, Donald; Yuwiler, Arthur; Ritvo, Max Joseph; Weisbender, Leo – Journal of Autism and Developmental Disorders, 2008

An empirically based 78 question self-rating scale based on DSM-IV-TR and ICD-10 criteria was developed to assist clinicians' diagnosis of adults with autism and Asperger's Disorder-the Ritvo Autism and Asperger's Diagnostic Scale (RAADS). It was standardized on 17 autistic and 20 Asperger's Disorder and 57 comparison subjects. Both autistic and…

Descriptors: Autism, Asperger Syndrome, Content Validity, Test Reliability

« Previous Page | Next Page »

Pages: 1 | ... | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | ... | 130

Educational and Psychological…	64
ProQuest LLC	59
Journal of Speech, Language,…	31
Online Submission	27
Journal of Educational…	22
Language Testing	21
Measurement in Physical…	21
ETS Research Report Series	17
Journal of Autism and…	16
Journal of Psychoeducational…	16
Educational Research and…	15
Assessment & Evaluation in…	14
Measurement and Evaluation in…	14
Psychology in the Schools	14
Journal of Consulting and…	12
International Education…	11
Journal of Education and…	11
Psychological Assessment	11
Research in Developmental…	11
Applied Measurement in…	10
Applied Psychological…	10
Educational Sciences: Theory…	10
Advances in Health Sciences…	9
Assessment in Education:…	9
Psychometrika	9
More ▼

Reckase, Mark D.	6
Attali, Yigal	5
Coniam, David	5
Brennan, Robert L.	4
Crehan, Kevin D.	4
Feldt, Leonard S.	4
Hakstian, A. Ralph	4
Jones, Ian	4
Kolen, Michael J.	4
Lunz, Mary E.	4
August, Diane	3
Bashaw, W. L.	3
Bennett, Randy Elliot	3
Benson, Jeri	3
Betz, Nancy E.	3
Ebel, Robert L.	3
Fletcher, Jack M.	3
Francis, David J.	3
Frisbie, David A.	3
Haberman, Shelby	3
Haladyna, Tom	3
Hambleton, Ronald K.	3
Henk, William A.	3
Iwata, Brian A.	3
More ▼

Journal Articles	1365
Reports - Research	1333
Reports - Evaluative	286
Speeches/Meeting Papers	165
Tests/Questionnaires	81
Reports - Descriptive	63
Dissertations/Theses -…	61
Information Analyses	55
Opinion Papers	30
Numerical/Quantitative Data	19
Collected Works - General	8
Books	7
Collected Works - Proceedings	5
Guides - Non-Classroom	5
Book/Product Reviews	4
Dissertations/Theses -…	4
Collected Works - Serials	3
Guides - General	2
Collected Works - Serial	1
Dissertations/Theses	1
Guides - Classroom - Teacher	1
Historical Materials	1
Non-Print Media	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Wechsler Intelligence Scale…	16
Peabody Picture Vocabulary…	13
Woodcock Johnson Tests of…	11
SAT (College Admission Test)	10
Test of English as a Foreign…	10
Wechsler Adult Intelligence…	10
Program for International…	9
Minnesota Multiphasic…	8
National Assessment of…	8
Torrance Tests of Creative…	7
Trends in International…	7
Wide Range Achievement Test	7
Autism Diagnostic Observation…	6
ACT Assessment	5
Raven Progressive Matrices	5
Self Directed Search	5
Center for Epidemiologic…	4
Dynamic Indicators of Basic…	4
Early Childhood Environment…	4
General Educational…	4
Graduate Record Examinations	4
Iowa Tests of Basic Skills	4
Metropolitan Achievement Tests	4
Rosenberg Self Esteem Scale	4
Social Skills Rating System	4
More ▼