ERIC Number: ED148303
Record Type: RIE
Publication Date: 1973-Jun
Pages: 11
Abstractor: N/A
ISBN: N/A
ISSN: N/A
EISSN: N/A
Available Date: N/A
A Method Defining a Limited Set of Character-Strings with Maximal Coverage of a Sample of Text.
Hultgren, Jan; Larsson, Rolf
This is a progress report on a project attempting to design a system of compacting text for storage appropriate to disc oriented demand searching. After noting a number of previously designed methods of compression, it offers a tentative solution which couples a dictionary of most frequent character-strings with a set of variable-length codes. The dictionary is not restricted to natural language words or to content-bearing strings. A suite of IBM 360 assembly language programs has been constructed to count frequencies and to compute a set of parameters which will mitigate constructing a dictionary. A program to sort and print the file resulting from the other programs has also been developed. (WBC)
Publication Type: Reports - Research
Education Level: N/A
Audience: N/A
Language: N/A
Sponsor: N/A
Authoring Institution: Royal Inst. of Tech., Stockholm (Sweden). Library.
Grant or Contract Numbers: N/A
Author Affiliations: N/A