An evaluation of a rule-based parser of English sentences.

Scarlett, Elizabeth A.

An evaluation of a rule-based parser of English sentences.

dc.contributor.advisor	Szpakowicz, Stan,
dc.contributor.author	Scarlett, Elizabeth A.
dc.date.accessioned	2009-03-23T18:20:47Z
dc.date.available	2009-03-23T18:20:47Z
dc.date.created	2000
dc.date.issued	2000
dc.degree.level	Masters
dc.degree.name	M.C.S.
dc.description.abstract	DIPETT (Domain Independent Parser of English Technical Text) is a broad-coverage parser of English technical text that is used primarily in the TANKA (Text Analysis for Knowledge Acquisition) project. The TANKA project seeks to build a model of a technical domain by semi-automatically processing written text that describes the domain. No other source of domain-specific knowledge is available. The accuracy and completeness of a semantic representation generated by TANKA is partly determined by the accuracy of DIPETT's syntactic analysis of the text. The thesis argues that a test suite for a broad coverage natural language parser must necessarily be systematic, broad in its coverage of phenomena tested, and corpus-like in its coverage of phenomenon interaction. A test suite of example sentences extracted from Quirk et. al.'s comprehensive English grammar is proposed, and the results of evaluating DIPETT on that suite are compared with the evaluation results on a publicly available test suite, TSNLP (Test Suites for Natural Language Processing). (Abstract shortened by UMI.)
dc.format.extent	206 p.
dc.identifier.citation	Source: Masters Abstracts International, Volume: 39-05, page: 1409.
dc.identifier.isbn	9780612585010
dc.identifier.uri	http://hdl.handle.net/10393/9086
dc.identifier.uri	http://dx.doi.org/10.20381/ruor-16137
dc.publisher	University of Ottawa (Canada)
dc.subject.classification	Language, Linguistics.
dc.title	An evaluation of a rule-based parser of English sentences.
dc.type	Thesis

Fichiers

Trousse originale

Voici les éléments 1 - 1 sur 1

Nom:: MQ58501.PDF
Taille:: 10.07 MB
Format:: Adobe Portable Document Format

Télécharger

Collections

Thèses, 1910 - 2010 // Theses, 1910 - 2010