Centre for Artificial Intelligence of UNL

Browse our site

About

People

Research Areas

Projects

Publications

Seminars

News

You are here:

Publication details

Publication details

Main information

Title:	AN HISTORICAL LINGUISTICS CORPUS (Centuries XVI-XIX)
Publication date:	December 2012
Citation:	banza2012a
Abstract:	In this paper we present an evolutionary approach to the part-of-speech tagging problem. The goal of part-of- speech tagging is to assign to each word of a text its part-of-speech. The task is not straightforward, because a large percentage of words has more than one possible part-of-speech, and the right choice is determined by the surrounding word’s part-of-speeches. This means that to solve this problem we need a method to disambiguate a word’s possible tags set. Traditionally there are two groups of methods used to tackle this task. The first group is based on statistical data concerning the different context’s possibilities for a word, while the second group is based on rules, normally designed by human experts, that capture the language properties. In this work we present a solution that tries to incorporate both these approaches.

In proceedings

Authors:	Ana Paula Banza, Irene Rodrigues, José Saias, Filomena Gonçalves
Book title:	proceedings of the International Conference on Historical Corpora 2012
Series:	-
Publisher:	Organised by LOEWE Priority Program
Address:	-
Volume:	-
Pages:	-
ISBN:	-
ISSN:	-
Note:	-
Url address:	-

Export formats

Plain text:	Ana Paula Banza and Irene Rodrigues and José Saias and Filomena Gonçalves, AN HISTORICAL LINGUISTICS CORPUS (Centuries XVI-XIX), , proceedings of the International Conference on Historical Corpora 2012, Organised by LOEWE Priority Program, December 2012.
HTML:	Ana Paula Banza, <a href="/people/members/view.php?code=032b48c4371cf1d1523215c3f02c42de" class="author">Irene Rodrigues</a>, José Saias and Filomena Gonçalves, <b>AN HISTORICAL LINGUISTICS CORPUS (Centuries XVI-XIX)</b>, <u>proceedings of the International Conference on Historical Corpora 2012</u>, Organised by LOEWE Priority Program, December 2012.
BibTeX:	@inproceedings {banza2012a, author = {Ana Paula Banza and Irene Rodrigues and Jos{\'e} Saias and Filomena Gon\c{c}alves}, title = {AN HISTORICAL LINGUISTICS CORPUS (Centuries XVI-XIX)}, booktitle = {proceedings of the International Conference on Historical Corpora 2012}, publisher = {Organised by LOEWE Priority Program}, abstract = {In this paper we present an evolutionary approach to the part-of-speech tagging problem. The goal of part-of- speech tagging is to assign to each word of a text its part-of-speech. The task is not straightforward, because a large percentage of words has more than one possible part-of-speech, and the right choice is determined by the surrounding word’s part-of-speeches. This means that to solve this problem we need a method to disambiguate a word’s possible tags set. Traditionally there are two groups of methods used to tackle this task. The first group is based on statistical data concerning the different context’s possibilities for a word, while the second group is based on rules, normally designed by human experts, that capture the language properties. In this work we present a solution that tries to incorporate both these approaches.}, month = {December}, year = {2012}, }

Publication's urls

Full url:	/publications/view.php?code=7e58cf1c45f97c24f6e86c5c62a889da
Friendly url:	/publications/view.php?code=banza2012a

Departamento de Informática, FCT/UNL
Quinta da Torre 2829-516 CAPARICA - Portugal
Tel. (+351) 21 294 8536 FAX (+351) 21 294 8541