Descriptor
| Full Text Databases | 6 |
| Mathematical Formulas | 6 |
| Information Retrieval | 4 |
| Optical Data Disks | 3 |
| Algorithms | 2 |
| Hypermedia | 2 |
| Indexes | 2 |
| Information Storage | 2 |
| Models | 2 |
| Tables (Data) | 2 |
| Character Recognition | 1 |
| More ▼ | |
Source
| Information Processing &… | 2 |
| Information Processing and… | 1 |
| Information Systems | 1 |
| Journal of Documentation | 1 |
| Journal of the American… | 1 |
Author
| Baeza-Yates, Ricardo | 1 |
| Bookstein, Abraham | 1 |
| Ellis, David | 1 |
| Losee, Robert M. | 1 |
| Moffat, Alistair | 1 |
| Nicholas, Charles | 1 |
| Pearce, Claudia | 1 |
Publication Type
| Journal Articles | 6 |
| Reports - Descriptive | 3 |
| Reports - Research | 3 |
| Information Analyses | 1 |
| Reports - Evaluative | 1 |
| Speeches/Meeting Papers | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedBookstein, Abraham; And Others – Information Processing and Management, 1992
Discusses the problems of compressing a large textual database for storage on CD-ROM. A text-compression algorithm is presented, new algorithms for compression of indices are described, and the ARTFL (American and French Research on the Treasury of the French Language) database is used as an example. (14 references) (LRW)
Descriptors: Algorithms, Coding, Full Text Databases, Indexes
Peer reviewedEllis, David; And Others – Journal of Documentation, 1994
Describes a study in which several different sets of hypertext links are inserted by different people in full-text documents. The degree of similarity between the sets is measured using coefficients and topological indices. As in comparable studies of inter-indexer consistency, the sets of links used by different people showed little similarity.…
Descriptors: Full Text Databases, Hypermedia, Information Retrieval, Mathematical Formulas
Peer reviewedPearce, Claudia; Nicholas, Charles – Journal of the American Society for Information Science, 1996
Presents experimentation results for the TELLTALE system, a dynamic hypertext environment that provides full-text search from a hypertext-style user interface for text corpora that may be garbled by OCR (optical character recognition) or transmission errors, and that may contain languages other than English. (Author/LRW)
Descriptors: Character Recognition, Full Text Databases, Hypermedia, Information Retrieval
Peer reviewedBaeza-Yates, Ricardo; And Others – Information Systems, 1996
Discusses indexes for text databases and presents an efficient implementation of an index for text searching called PAT array, or suffix array, where the database is stored on secondary storage devices such as magnetic or optical disks. Additional hierarchical index structures and searching algorithms are proposed that improve searching time, and…
Descriptors: Algorithms, Full Text Databases, Indexes, Information Storage
Peer reviewedLosee, Robert M. – Information Processing & Management, 1996
Discusses the nature of term groupings, phrases, and text windows in full-text documents and computes the statistical significance of windows. Topics include classifying documents within disciplines or on a theory versus practice spectrum; and grammatical characteristics for automatic classification of documents, for information retrieval, and for…
Descriptors: Classification, Full Text Databases, Information Retrieval, Intellectual Disciplines
Peer reviewedMoffat, Alistair; And Others – Information Processing & Management, 1994
Describes an approximate document ranking process that uses a compact array of in-memory, low-precision approximations for document length. Combined with another rule for reducing the memory required by partial similarity accumulators, the approximation heuristic allows the ranking of large document collections using less than one byte of memory…
Descriptors: Database Design, Database Management Systems, Full Text Databases, Information Retrieval


