NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 14 results Save | Export
Peer reviewed Peer reviewed
Hamers, Lieve; And Others – Information Processing and Management, 1989
Describes two similarity measures used in citation and co-citation analysis--the Jaccard index and Salton's cosine formula--and investigates the relationship between the two measures. It is shown that Salton's formula yields a numerical value that is twice Jaccard's index in most cases, and an explanation is offered. (13 references) (CLB)
Descriptors: Citation Analysis, Comparative Analysis, Mathematical Formulas, Measurement
Peer reviewed Peer reviewed
Uratani, Noriyoshi; Takeda, Masayuki – Information Processing and Management, 1993
Describes a string-searching algorithm for multiple patterns in a text string; explains the construction of a pattern-matching machine; presents a theoretical analysis and empirical evidence that supports the sublinearity of the algorithm; and compares this algorithm with the Boyer-Moore algorithm for a single pattern. (Contains 10 references.)…
Descriptors: Algorithms, Comparative Analysis, Information Retrieval, Mathematical Formulas
Peer reviewed Peer reviewed
Howard, Paul G.; Vitter, Jeffrey Scott – Information Processing and Management, 1992
Identifies four components of a good predictive lossless image compression method: (1) pixel sequence, (2) image modeling and prediction, (3) error modeling, and (4) error coding. Highlights include Laplace distribution and a comparison of the multilevel progressive method for image coding with the prediction by partial precision matching method.…
Descriptors: Coding, Comparative Analysis, Information Processing, Mathematical Formulas
Peer reviewed Peer reviewed
Eastman, Caroline M. – Information Processing and Management, 1989
Compares the performance of inverted and signature file organizations in handling incrementally specified Boolean queries to an information retrieval system. The discussion covers the impact of more sophisticated signature file organizations, related problems, and possible future work. (65 references) (CLB)
Descriptors: Algorithms, Comparative Analysis, Database Management Systems, Information Retrieval
Peer reviewed Peer reviewed
Bookstein, Abraham; Klein, Shmuel T. – Information Processing and Management, 1992
Presents new methods for compressing bit matrices in large information retrieval systems which exploit possible correlations between rows of words and columns of documents. Three encoding methods are tested and compared--Shannon-Fano, arithmetic, and Huffman--and an appendix discusses binomial coefficients. (20 references) (LRW)
Descriptors: Coding, Comparative Analysis, Correlation, Information Processing
Peer reviewed Peer reviewed
Danilowicz, Czeslaw – Information Processing and Management, 1994
Discusses end-user searching in Boolean information retrieval systems considers the role of search intermediaries and proposes a model of user preferences that incorporates a user's profile. Highlights include document representation; information queries; document output ranking; calculating user profiles; and selecting documents for a local…
Descriptors: Algorithms, Comparative Analysis, Databases, Information Retrieval
Peer reviewed Peer reviewed
Bollmann, Peter; And Others – Information Processing and Management, 1992
Discusses the PRECALL, PRR (probability of relevance given retrieval), and EP (expected precision) approaches for dealing with the problem of weak ordering in information retrieval systems. Findings of two experiments comparing evaluation results obtained by PRR and EP are reported. Several mathematical formulas and proofs are included. (20…
Descriptors: Comparative Analysis, Evaluation Methods, Information Retrieval, Mathematical Formulas
Peer reviewed Peer reviewed
Lee, Joon Ho; And Others – Information Processing and Management, 1994
Investigates document ranking methods in thesaurus-based Boolean information retrieval systems and proposes a new thesaurus-based ranking algorithm called the Extended Relevance algorithm. Performance comparisons are made between the Extended Relevance algorithm and previous thesaurus-based ranking algorithms. (Contains 20 references.) (LRW)
Descriptors: Algorithms, Comparative Analysis, Correlation, Information Retrieval
Peer reviewed Peer reviewed
Wong, Wai Yee Peter; Lee, Dik Lun – Information Processing and Management, 1993
Discussion of search strategies in information retrieval systems focuses on the implementation of document ranking based on inverted files. Highlights include descriptions of three heuristic methods for implementing a weighting strategy and a comparison of two methods for estimating retrieval accuracy, including document movement and linear…
Descriptors: Comparative Analysis, Graphs, Heuristics, Information Retrieval
Peer reviewed Peer reviewed
Rousseau, Ronald – Information Processing and Management, 1994
Discussion of informetric distributions shows that generalized Leimkuhler functions give proper fits to a large variety of Bradford curves, including those exhibiting a Groos droop or a rising tail. The Kolmogorov-Smirnov test is used to test goodness of fit, and least-square fits are compared with Egghe's method. (Contains 53 references.) (LRW)
Descriptors: Bibliometrics, Comparative Analysis, Goodness of Fit, Least Squares Statistics
Peer reviewed Peer reviewed
Kwok, K. L.; Kuan, William – Information Processing and Management, 1988
Describes an investigation of the use of document components, such as terms, sentences, and whole documents, for indexing and retrieval. A number of probabilistic similarity measures based on document components are studied, as well as a new method of handling probability estimations involving small sample sizes. (24 references) (Author/CLB)
Descriptors: Comparative Analysis, Indexing, Information Retrieval, Mathematical Formulas
Peer reviewed Peer reviewed
Doreian, Patrick – Information Processing and Management, 1994
Discusses the use of citation networks for constructing measures of the relative standing of journals in scientific journal-to-journal networks. Topics addressed include eigenvectors; measuring the standing of national scientific communities; and an empirical comparison of two measures of standing, one considering the environment and one without…
Descriptors: Citation Analysis, Comparative Analysis, Environment, Mathematical Formulas
Peer reviewed Peer reviewed
Fuhr, Norbert – Information Processing and Management, 1989
Describes three models for probabilistic indexing, all based on the Darmstadt automatic indexing approach, and presents experimental evaluation results for each. The discussion covers the improved retrieval effectiveness of probabilistic indexing over binary indexing, and suggestions for using this automatic indexing method with free text terms.…
Descriptors: Automatic Indexing, Comparative Analysis, Information Retrieval, Mathematical Formulas
Peer reviewed Peer reviewed
Can, Fazli – Information Processing and Management, 1994
Discussion of relevancy in information retrieval systems focuses on an analysis of the efficiency of various cluster-based retrieval (CBR) strategies. A method for combining CBR and inverted index search is proposed that is cost effective in terms of time efficiency; and results of experiments are reported. (Contains 32 references.) (LRW)
Descriptors: Algorithms, Cluster Grouping, Comparative Analysis, Cost Effectiveness