NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 3 results Save | Export
Peer reviewed Peer reviewed
Yang, Yiming; Wilbur, John – Journal of the American Society for Information Science, 1996
Studies aggressive automated word removal in text categorization in large databases based on corpus statistics to reduce the noise in free texts and to enhance the computational efficiency of categorization. Topics include stop word identification, categorization methods for comparison, tests on four document collections, and evaluation…
Descriptors: Comparative Analysis, Databases, Evaluation Methods, Information Retrieval
Peer reviewed Peer reviewed
Wilbur, W. John – Journal of the American Society for Information Science, 1992
Describes a procedure for information retrieval testing that is based on the comparison of statistically independent methods of retrieval applied to the same database. The probability ranking principle is discussed, the statistical meaning of relevance is examined, and the methodology is illustrated on a large database of MEDLINE records. (19…
Descriptors: Comparative Analysis, Databases, Information Retrieval, Mathematical Formulas
Peer reviewed Peer reviewed
Losee, Robert M., Jr. – Information Processing and Management, 1994
Studies the performance of probabilistic information retrieval systems using differing statistical dependence assumptions when estimating the probabilities inherent in the retrieval model. Experimental results using the Bahadur Lazarsfeld expansion on the Cystic Fibrosis database are discussed that suggest that incorporating term dependence…
Descriptors: Cystic Fibrosis, Databases, Information Retrieval, Information Systems