Descriptor
Source
| Information Processing &… | 11 |
Author
| Kim, Dongseok | 2 |
| Lee, Gary Geunbae | 2 |
| Cha, Jeongwon | 1 |
| Chan, Benjamin | 1 |
| Flood, James | 1 |
| Frew, Brian | 1 |
| Hersh, William | 1 |
| Jung, Hanmin | 1 |
| Knowles, Kimberly A. | 1 |
| Kraemer, Dale | 1 |
| Lee, Chan-Do | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 11 |
| Reports - Research | 11 |
| Reports - Descriptive | 3 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedTan, Chade-Meng; Wang, Yuan-Fang; Lee, Chan-Do – Information Processing & Management, 2002
Presents an efficient text categorization (or text classification) algorithm for document retrieval of natural language texts that generates bigrams (two-word phrases) and uses the information gain metric, combined with various frequency thresholds. Experimental results suggest that the bigrams can substantially raise the quality of feature sets.…
Descriptors: Algorithms, Classification, Information Retrieval, Natural Language Processing
Peer reviewedHersh, William; Turpin, Andrew; Price, Susan; Kraemer, Dale; Olson, Daniel; Chan, Benjamin; Sacherek, Lynetta – Information Processing & Management, 2001
Describes research conducted at the TREC (Text Retrieval Conference) interactive track that compared Boolean and natural language searching, showing they achieved comparable results; and assessed the validity of batch-oriented retrieval evaluations, showing that the results from batch evaluations were not comparable to those obtained in…
Descriptors: Comparative Analysis, Evaluation Methods, Information Retrieval, Natural Language Processing
Peer reviewedShim, Junhyeok; Kim, Dongseok; Cha, Jeongwon; Lee, Gary Geunbae; Seo, Jungyun – Information Processing & Management, 2002
Discussion of natural language processing focuses on a multi-strategic integrated text preprocessing method for difficult problems of sentence boundary disambiguation and word boundary disambiguation of Web texts. Describes an evaluation of the method using Korean Web document collections. (Author/LRW)
Descriptors: Evaluation Methods, Korean, Mathematical Formulas, Natural Language Processing
Peer reviewedLewis, David D.; Knowles, Kimberly A. – Information Processing & Management, 1997
Discussion of electronic mail messages processing focuses on threads, which are conversations among two or more people carried out by exchange of messages. Suggests that effective threading systems should rely on conventions in human communication rather than on software communication, and shows that information retrieval techniques can be used…
Descriptors: Computer Software, Dialogs (Language), Electronic Mail, Futures (of Society)
Peer reviewedMcKeown, Kathleen; And Others – Information Processing & Management, 1995
Presents an approach to summarization that combines information from multiple facts into a single sentence using linguistic constructions. Describes two applications: one produces summaries of basketball games, and the other contains summaries of telephone network planning activity. Both summarize input data as opposed to full text. Discusses…
Descriptors: Basketball, Communications, Computational Linguistics, Information Sources
Peer reviewedKim, Dongseok; Jung, Hanmin; Lee, Gary Geunbae – Information Processing & Management, 2003
Presents a new extraction pattern, modified Document Type Definition (mDTD), which relies on analytical interpretation to identify extraction target from the contents of Web documents. Experiments with 330 Korean and 220 English Web documents on audio and video shopping sites yielded an average extraction precision of 91.3% for Korean and 81.9%…
Descriptors: Computer System Design, English, Information Retrieval, Korean
Peer reviewedTurtle, Howard; Flood, James – Information Processing & Management, 1995
Discusses two query evaluation strategies used in large text retrieval systems: (1) term-at-a-time; and (2) document-at-a-time. Describes optimization techniques that can reduce query evaluation costs. Presents simulation results that compare the performance of these optimization techniques when applied to natural language query evaluation. (JMV)
Descriptors: Access to Information, Comparative Analysis, Cost Effectiveness, Evaluation Methods
Peer reviewedStrzalkowski, Tomek – Information Processing & Management, 1995
Describes an information retrieval system in which advanced natural language processing is used to enhance the effectiveness of term-based document retrieval by preprocessing the documents; discovering interterm dependencies and build a conceptual hierarchy specific to database domain; and processing the user's natural language requests into…
Descriptors: Databases, Information Processing, Information Retrieval, Information Seeking
Peer reviewedMaybury, Mark T. – Information Processing & Management, 1995
Describes and evaluates a system that selects key information from an event database by reasoning about event frequencies, frequencies of relations between events, and domain-specific importance measures. The system aggregates similar information and plans a summary tailored to a stereotypical user. (AEF)
Descriptors: Abstracting, Data Processing, Databases, Electronic Text
Peer reviewedLosee, Robert M. – Information Processing & Management, 2001
Increasing information retrieval performance using phrases and part-of-speech (POS) information is one example of a type of decision-making performance that is improved when using this linguistic information. The relative effectiveness of using multi-term phrases as opposed to individual terms is shown, as well as the relative worth of POS tagged…
Descriptors: Decision Making, Form Classes (Languages), Improvement, Information Retrieval
Peer reviewedRowe, Neil C.; Frew, Brian – Information Processing & Management, 1998
Explores the indirect method of locating for indexing the likely explicit and implicit captions of photographs, using multimodal clues including the specific words used, syntax, surrounding layout of the Web page, and general appearance of the associated image. The MARIE-3 system thus avoids full image processing and full natural-language…
Descriptors: Captions, Computer System Design, Indexing, Information Processing


