Intelligent document format: A text encoding scheme.
| dc.contributor.advisor | Skuce, D., | |
| dc.contributor.author | Tran, Charles. | |
| dc.date.accessioned | 2009-03-19T14:13:19Z | |
| dc.date.available | 2009-03-19T14:13:19Z | |
| dc.date.created | 1997 | |
| dc.date.issued | 1997 | |
| dc.degree.level | Masters | |
| dc.degree.name | M.Comp.Sc. | |
| dc.description.abstract | The issue of text representation is very important in text retrieval and natural language processing. The way the data is represented can significantly affect the efficiency of storage, retrieval, routing techniques, query formulation, and information extraction. This thesis describes a potential solution to encode documents efficiently and effectively so that information may be easily retrieved. In this thesis, we present a technique for encoding textual data in a representation format called the Intelligent Document Format (IDF). The IDF encodes English text so as to store textual data efficiently, permitting retrieval of text at sentence-, paragraph-, and document levels, and assisting term searching and retrieving as well as providing linguistic processing such as morphological analysis and sense disambiguation. To illustrate the IDF encoding method, we describe IDFconvert, an IDF encoder and decoder program, and carry out encoding experiments on the electronic-text version of Dracula (Stoker 1897). | |
| dc.format.extent | 82 p. | |
| dc.identifier.citation | Source: Masters Abstracts International, Volume: 36-01, page: 0212. | |
| dc.identifier.isbn | 9780612209565 | |
| dc.identifier.uri | http://hdl.handle.net/10393/4480 | |
| dc.identifier.uri | http://dx.doi.org/10.20381/ruor-13877 | |
| dc.publisher | University of Ottawa (Canada) | |
| dc.subject.classification | Computer Science. | |
| dc.title | Intelligent document format: A text encoding scheme. | |
| dc.type | Thesis |
Files
Original bundle
1 - 1 of 1
