Repository logo

Intelligent document format: A text encoding scheme.

dc.contributor.advisorSkuce, D.,
dc.contributor.authorTran, Charles.
dc.date.accessioned2009-03-19T14:13:19Z
dc.date.available2009-03-19T14:13:19Z
dc.date.created1997
dc.date.issued1997
dc.degree.levelMasters
dc.degree.nameM.Comp.Sc.
dc.description.abstractThe issue of text representation is very important in text retrieval and natural language processing. The way the data is represented can significantly affect the efficiency of storage, retrieval, routing techniques, query formulation, and information extraction. This thesis describes a potential solution to encode documents efficiently and effectively so that information may be easily retrieved. In this thesis, we present a technique for encoding textual data in a representation format called the Intelligent Document Format (IDF). The IDF encodes English text so as to store textual data efficiently, permitting retrieval of text at sentence-, paragraph-, and document levels, and assisting term searching and retrieving as well as providing linguistic processing such as morphological analysis and sense disambiguation. To illustrate the IDF encoding method, we describe IDFconvert, an IDF encoder and decoder program, and carry out encoding experiments on the electronic-text version of Dracula (Stoker 1897).
dc.format.extent82 p.
dc.identifier.citationSource: Masters Abstracts International, Volume: 36-01, page: 0212.
dc.identifier.isbn9780612209565
dc.identifier.urihttp://hdl.handle.net/10393/4480
dc.identifier.urihttp://dx.doi.org/10.20381/ruor-13877
dc.publisherUniversity of Ottawa (Canada)
dc.subject.classificationComputer Science.
dc.titleIntelligent document format: A text encoding scheme.
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
MQ20956.PDF
Size:
2.81 MB
Format:
Adobe Portable Document Format