ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	3
Since 2007 (last 20 years)	5

Descriptor

Evaluation Methods	26
Mathematical Models	26
Test Reliability	12
Interrater Reliability	9
Statistical Analysis	9
Reliability	7
Correlation	5
Latent Trait Theory	5
Higher Education	4
Measurement Techniques	4
Performance Based Assessment	4
Probability	4
Rating Scales	4
Scores	4
Test Validity	4
Comparative Analysis	3
Computer Assisted Testing	3
Difficulty Level	3
Educational Assessment	3
Elementary Secondary Education	3
Equations (Mathematics)	3
Error of Measurement	3
Estimation (Mathematics)	3
Evaluators	3
Foreign Countries	3
More ▼

Source

Multivariate Behavioral…	2
Applied Psychological…	1
Educational Measurement:…	1
Educational and Psychological…	1
European Journal of…	1
Evaluation and Program…	1
Florida Journal of…	1
International Association for…	1
International Journal of…	1
Journal of Educational…	1
Language Learning	1
Research Synthesis Methods	1
More ▼

Publication Type

Journal Articles	12
Reports - Evaluative	10
Reports - Research	10
Speeches/Meeting Papers	4
Collected Works - Proceedings	2
Guides - Non-Classroom	2
Opinion Papers	2
Reports - Descriptive	2
Collected Works - Serials	1
Guides - General	1
Numerical/Quantitative Data	1
Reference Materials -…	1
More ▼

Education Level

Higher Education	2
Postsecondary Education	2
Elementary Secondary Education	1

Audience

Practitioners	1
Researchers	1

Location

Germany	2
Asia	1
Australia	1
Brazil	1
California	1
Connecticut	1
Denmark	1
Egypt	1
Estonia	1
Florida	1
Greece	1
Hawaii	1
Ireland	1
Israel	1
Italy	1
Japan	1
Kazakhstan	1
Netherlands	1
Norway	1
Ohio	1
Pakistan	1
Pennsylvania	1
Philippines	1
Portugal	1
Russia	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

NEO Personality Inventory

What Works Clearinghouse Rating

Showing 1 to 15 of 26 results Save | Export

Enhancing Recall in Automated Record Screening: A Resampling Algorithm

Peer reviewed

Direct link

Zhipeng Hou; Elizabeth Tipton – Research Synthesis Methods, 2024

Literature screening is the process of identifying all relevant records from a pool of candidate paper records in systematic review, meta-analysis, and other research synthesis tasks. This process is time consuming, expensive, and prone to human error. Screening prioritization methods attempt to help reviewers identify most relevant records while…

Descriptors: Meta Analysis, Research Reports, Identification, Evaluation Methods

Trust Decision Model and Trust Evaluation Model for Quality Web Service Identification in Web Service Lifecycle Using QSW Data Analysis

Peer reviewed

Direct link

Raj, Gaurav; Mahajan, Manish; Singh, Dheerendra – International Journal of Web-Based Learning and Teaching Technologies, 2020

In secure web application development, the role of web services will not continue if it is not trustworthy. Retaining customers with applications is one of the major challenges if the services are not reliable and trustworthy. This article proposes a trust evaluation and decision model where the authors have defined indirect attribute, trust,…

Descriptors: Trust (Psychology), Models, Decision Making, Computer Software

Testing Methodology in the Student Learning Process

Peer reviewed
PDF on ERIC

Download full text

Gorbunova, Tatiana N. – European Journal of Contemporary Education, 2017

The subject of the research is to build methodologies to evaluate the student knowledge by testing. The author points to the importance of feedback about the mastering level in the learning process. Testing is considered as a tool. The object of the study is to create the test system models for defence practice problems. Special attention is paid…

Descriptors: Testing, Evaluation Methods, Feedback (Response), Simulation

Lexical Frequency Profiles and Zipf's Law

Peer reviewed

Direct link

Edwards, Roderick; Collins, Laura – Language Learning, 2011

Laufer and Nation (1995) proposed that the Lexical Frequency Profile (LFP) can estimate the size of a second-language writer's productive vocabulary. Meara (2005) questioned the sensitivity and the reliability of LFPs for estimating vocabulary sizes, based on the results obtained from probabilistic simulations of LFPs. However, the underlying…

Descriptors: Mathematical Models, Word Frequency, Profiles, Second Language Learning

Agreement of Personality Profiles across Observers.

Peer reviewed

McCrae, Robert R. – Multivariate Behavioral Research, 1993

To assess cross-observer agreement on personality profiles, an Index of Profile Agreement and an associated coefficient are proposed that take into account both the difference between the ratings and the extremes of their mean. Data from the Revised NEO Personality Inventory for 250 peer ratings/self-reports and 68 spouse ratings/self-reports…

Descriptors: Adults, Comparative Analysis, Equations (Mathematics), Evaluation Methods

Latent Structure Agreement Analysis. A RAND Note.

Download full text

Uebersax, John; Grove, Will – 1989

Methods of probability modeling to analyze rater agreement are described, emphasizing their basic similarities and viewing them as variants of a common methodology. Statistical techniques for analyzing agreement data are described to address questions such as how many opinions are required to make a medical diagnosis with necessary accuracy. Kappa…

Descriptors: Clinical Diagnosis, Correlation, Estimation (Mathematics), Evaluation Methods

Considerations Related to the Usefulness of the Performance Indicators in Dyer's Student Change Model of an Educational System.

Download full text

Forsyth, Robert A. – 1972

In Dyer's Student Change Model, performance indicators (PIs) are utilized as a measure of the effectiveness of educational programs. These PIs are examined and analyzed in some detail in this paper. In particular, the reliability of the residuals, the sensitivity of the measure to sample size, the random half reliability, and the stability of the…

Descriptors: Accountability, Evaluation Methods, Input Output Analysis, Mathematical Models

Measuring Academic Efficiency at the School Level.

Marzano, Robert J.; Hutchins, C. L. – 1981

In this paper, academic efficiency is operationally defined and a methodology for measuring it at the school level is described. Academic efficiency is defined as the extent to which a school utilizes its time for the academic development of all its students. The measure of academic efficiency must include three elements: time, students, and…

Descriptors: Classroom Observation Techniques, Elementary Secondary Education, Evaluation Methods, Mathematical Models

A Study on the Consistency of Stakeholder Preferences for Different Types of Information in Evaluating Police Services.

Peer reviewed

Deutsch, Stuart Jay; Malmborg, Charles J. – Evaluation and Program Planning, 1986

A questionnaire is designed to allow assessment of a simple additive value function for testing respondent preferences for different types of information used in evaluating police services. Responses are analyzed to determine what types of information different stakeholder groups consider useful. (Author/LMO)

Descriptors: Adults, Analysis of Variance, Data Collection, Evaluation Methods

A Critical Examination of the "Reliability" and "Abnormality" Approaches to the Evaluation of Subtest Score Differences.

Peer reviewed

Cahan, Sorel – Educational and Psychological Measurement, 1989

Statistical significance and "abnormality" have been used as criteria for the evaluation of intra-individual subtest score differences. Shortcomings of these criteria are identified, and improved estimates of the true score differences are suggested. The applicability of the abnormality criterion to these improved estimates is reviewed.…

Descriptors: Estimation (Mathematics), Evaluation Methods, Individual Differences, Mathematical Models

A Theory-based Comparison of the Reliabilities of Fixed-length and Trials-to-criterion Scoring of Physical Education Skills Tests.

Download full text

Feldt, Leonard S. – 1983

This paper considers, from a theoretical point of view, two measurement approaches used in measuring success and failure in skills tests in physical education. The first, "fixed length" (FL) testing, entails counting the number of successful performances in a fixed number of trials. The second, "trials-to-criterion" (TTC)…

Descriptors: Evaluation Methods, Mathematical Formulas, Mathematical Models, Measurement Techniques

Rater Stringency Error in Performance Rating: A Contrast of Three Models.

Download full text

Cason, Gerald J.; Cason, Carolyn L. – 1989

The use of three remedies for errors in the measurement of ability that arise from differences in rater stringency is discussed. Models contrasted are: (1) Conventional; (2) Handicap; and (3) deterministic Rater Response Theory (RRT). General model requirements, power, bias of measures, computing cost, and complexity are contrasted. Contrasts are…

Descriptors: Ability, Achievement Rating, Error of Measurement, Evaluation Methods

Analysis of Multitrait-Multirater Performance Appraisal Data: Composite Direct Product Method versus Confirmatory Factor Analysis.

Peer reviewed

Goffin, Richard D.; Jackson, Douglas N. – Multivariate Behavioral Research, 1992

The way in which trait and rater variance combine in multitrait-multirater (MTMR) performance appraisal data is explored. Implications of the confirmatory factor analytic model and the composite direct product (CDP) model for MTMR data are examined. Superior fit of the CDP model for four data sets is discussed. (SLD)

Descriptors: Equations (Mathematics), Evaluation Methods, Goodness of Fit, Interrater Reliability

R. & D. in Psychometrics: Technical Reports on Latent Structure Models.

Download full text

Wilcox, Rand R. – 1982

This document contains three papers from the Methodology Project of the Center for the Study of Evaluation. Methods for characterizing test accuracy are reported in the first two papers. "Bounds on the K Out of N Reliability of a Test, and an Exact Test for Hierarchically Related Items" describes and illustrates how an extension of a…

Descriptors: Educational Testing, Evaluation Methods, Guessing (Tests), Latent Trait Theory

Selection of Judges for Standard-Setting.

Peer reviewed

Jaeger, Richard M. – Educational Measurement: Issues and Practice, 1991

Issues concerning the selection of judges for standard setting are discussed. Determining the consistency of judges' recommendations, or their congruity with other expert recommendations, would help in selection. Enough judges must be chosen to allow estimation of recommendations by an entire population of judges. (SLD)

Descriptors: Cutting Scores, Evaluation Methods, Evaluators, Examiners

Previous Page | Next Page »

Pages: 1 | 2

Cason, Gerald J.	2
Cahan, Sorel	1
Cason, Carolyn L.	1
Cohen, Allan S., Comp.	1
Collins, Laura	1
Deutsch, Stuart Jay	1
Edwards, Roderick	1
Elizabeth Tipton	1
Feldt, Leonard S.	1
Forsyth, Robert A.	1
Goffin, Richard D.	1
Gorbunova, Tatiana N.	1
Grossen, Neal E.	1
Grove, Will	1
Houston, Walter M.	1
Hutchins, C. L.	1
Izard, J. F.	1
Jackson, Douglas N.	1
Jaeger, Richard M.	1
Mahajan, Manish	1
Malmborg, Charles J.	1
Marzano, Robert J.	1
McCrae, Robert R.	1
McLarty, Joyce	1
More ▼