Descriptor
| Natural Language Processing | 3 |
| World Wide Web | 3 |
| Computer System Design | 2 |
| Information Retrieval | 2 |
| Korean | 2 |
| Captions | 1 |
| English | 1 |
| Evaluation Methods | 1 |
| Indexing | 1 |
| Information Processing | 1 |
| Information Seeking | 1 |
| More ▼ | |
Source
| Information Processing &… | 3 |
Author
| Kim, Dongseok | 2 |
| Lee, Gary Geunbae | 2 |
| Cha, Jeongwon | 1 |
| Frew, Brian | 1 |
| Jung, Hanmin | 1 |
| Rowe, Neil C. | 1 |
| Seo, Jungyun | 1 |
| Shim, Junhyeok | 1 |
Publication Type
| Journal Articles | 3 |
| Reports - Research | 3 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedShim, Junhyeok; Kim, Dongseok; Cha, Jeongwon; Lee, Gary Geunbae; Seo, Jungyun – Information Processing & Management, 2002
Discussion of natural language processing focuses on a multi-strategic integrated text preprocessing method for difficult problems of sentence boundary disambiguation and word boundary disambiguation of Web texts. Describes an evaluation of the method using Korean Web document collections. (Author/LRW)
Descriptors: Evaluation Methods, Korean, Mathematical Formulas, Natural Language Processing
Peer reviewedKim, Dongseok; Jung, Hanmin; Lee, Gary Geunbae – Information Processing & Management, 2003
Presents a new extraction pattern, modified Document Type Definition (mDTD), which relies on analytical interpretation to identify extraction target from the contents of Web documents. Experiments with 330 Korean and 220 English Web documents on audio and video shopping sites yielded an average extraction precision of 91.3% for Korean and 81.9%…
Descriptors: Computer System Design, English, Information Retrieval, Korean
Peer reviewedRowe, Neil C.; Frew, Brian – Information Processing & Management, 1998
Explores the indirect method of locating for indexing the likely explicit and implicit captions of photographs, using multimodal clues including the specific words used, syntax, surrounding layout of the Web page, and general appearance of the associated image. The MARIE-3 system thus avoids full image processing and full natural-language…
Descriptors: Captions, Computer System Design, Indexing, Information Processing


