Descriptor
Publication Type
| Journal Articles | 3 |
| Reports - Research | 2 |
| Opinion Papers | 1 |
| Reports - Descriptive | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedYang, Yiming; Wilbur, John – Journal of the American Society for Information Science, 1996
Studies aggressive automated word removal in text categorization in large databases based on corpus statistics to reduce the noise in free texts and to enhance the computational efficiency of categorization. Topics include stop word identification, categorization methods for comparison, tests on four document collections, and evaluation…
Descriptors: Comparative Analysis, Databases, Evaluation Methods, Information Retrieval
Peer reviewedWilbur, W. John – Journal of the American Society for Information Science, 1992
Describes a procedure for information retrieval testing that is based on the comparison of statistically independent methods of retrieval applied to the same database. The probability ranking principle is discussed, the statistical meaning of relevance is examined, and the methodology is illustrated on a large database of MEDLINE records. (19…
Descriptors: Comparative Analysis, Databases, Information Retrieval, Mathematical Formulas
Peer reviewedLosee, Robert M., Jr. – Information Processing and Management, 1994
Studies the performance of probabilistic information retrieval systems using differing statistical dependence assumptions when estimating the probabilities inherent in the retrieval model. Experimental results using the Bahadur Lazarsfeld expansion on the Cystic Fibrosis database are discussed that suggest that incorporating term dependence…
Descriptors: Cystic Fibrosis, Databases, Information Retrieval, Information Systems


