Repository logo

An evaluation of a rule-based parser of English sentences.

dc.contributor.advisorSzpakowicz, Stan,
dc.contributor.authorScarlett, Elizabeth A.
dc.date.accessioned2009-03-23T18:20:47Z
dc.date.available2009-03-23T18:20:47Z
dc.date.created2000
dc.date.issued2000
dc.degree.levelMasters
dc.degree.nameM.C.S.
dc.description.abstractDIPETT (Domain Independent Parser of English Technical Text) is a broad-coverage parser of English technical text that is used primarily in the TANKA (Text Analysis for Knowledge Acquisition) project. The TANKA project seeks to build a model of a technical domain by semi-automatically processing written text that describes the domain. No other source of domain-specific knowledge is available. The accuracy and completeness of a semantic representation generated by TANKA is partly determined by the accuracy of DIPETT's syntactic analysis of the text. The thesis argues that a test suite for a broad coverage natural language parser must necessarily be systematic, broad in its coverage of phenomena tested, and corpus-like in its coverage of phenomenon interaction. A test suite of example sentences extracted from Quirk et. al.'s comprehensive English grammar is proposed, and the results of evaluating DIPETT on that suite are compared with the evaluation results on a publicly available test suite, TSNLP (Test Suites for Natural Language Processing). (Abstract shortened by UMI.)
dc.format.extent206 p.
dc.identifier.citationSource: Masters Abstracts International, Volume: 39-05, page: 1409.
dc.identifier.isbn9780612585010
dc.identifier.urihttp://hdl.handle.net/10393/9086
dc.identifier.urihttp://dx.doi.org/10.20381/ruor-16137
dc.publisherUniversity of Ottawa (Canada)
dc.subject.classificationLanguage, Linguistics.
dc.titleAn evaluation of a rule-based parser of English sentences.
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
MQ58501.PDF
Size:
10.07 MB
Format:
Adobe Portable Document Format