Descriptor
| Classification | 2 |
| Information Retrieval | 2 |
| Natural Language Processing | 2 |
| Algorithms | 1 |
| Databases | 1 |
| Information Storage | 1 |
| Lexicology | 1 |
| Problems | 1 |
| Systems Approach | 1 |
| Thesauri | 1 |
| Word Processing | 1 |
| More ▼ | |
Source
| Information Processing &… | 2 |
Publication Type
| Journal Articles | 2 |
| Reports - Descriptive | 1 |
| Reports - Research | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedTan, Chade-Meng; Wang, Yuan-Fang; Lee, Chan-Do – Information Processing & Management, 2002
Presents an efficient text categorization (or text classification) algorithm for document retrieval of natural language texts that generates bigrams (two-word phrases) and uses the information gain metric, combined with various frequency thresholds. Experimental results suggest that the bigrams can substantially raise the quality of feature sets.…
Descriptors: Algorithms, Classification, Information Retrieval, Natural Language Processing
Peer reviewedMiller, Uri – Information Processing & Management, 1997
Discusses general problems of thesaurus construction theory and practice. Highlights include lexical control and its tools in various databases; natural language versus conceptual networks; systems approach; thesaurus versus classification, including associative relations; and thesaurus role in information storage and retrieval. (110 references)…
Descriptors: Classification, Databases, Information Retrieval, Information Storage


