An evaluation of a rule-based parser of English sentences.

Scarlett, Elizabeth A.

An evaluation of a rule-based parser of English sentences.

Fichiers

MQ58501.PDF (10.07 MB)

Date

2000

Authors

Scarlett, Elizabeth A.

Éditeur

University of Ottawa (Canada)

Résumé

DIPETT (Domain Independent Parser of English Technical Text) is a broad-coverage parser of English technical text that is used primarily in the TANKA (Text Analysis for Knowledge Acquisition) project. The TANKA project seeks to build a model of a technical domain by semi-automatically processing written text that describes the domain. No other source of domain-specific knowledge is available. The accuracy and completeness of a semantic representation generated by TANKA is partly determined by the accuracy of DIPETT's syntactic analysis of the text. The thesis argues that a test suite for a broad coverage natural language parser must necessarily be systematic, broad in its coverage of phenomena tested, and corpus-like in its coverage of phenomenon interaction. A test suite of example sentences extracted from Quirk et. al.'s comprehensive English grammar is proposed, and the results of evaluating DIPETT on that suite are compared with the evaluation results on a publicly available test suite, TSNLP (Test Suites for Natural Language Processing). (Abstract shortened by UMI.)

Citation

Source: Masters Abstracts International, Volume: 39-05, page: 1409.

URI

http://hdl.handle.net/10393/9086
http://dx.doi.org/10.20381/ruor-16137

Collections

Thèses, 1910 - 2010 // Theses, 1910 - 2010

Notice complète

An evaluation of a rule-based parser of English sentences.

Fichiers

Date

Authors

Nom de la revue

ISSN de la revue

Titre du volume

Éditeur

Résumé

Description

Mots-clés

Citation

URI

Collections

Approbation

Évaluation

Complété par

Référencé par