Browse our site
About
People
Research Areas
Projects
Publications
Books
Book chapters
Journal articles
In proceedings
M. Sc. Dissertations
Ph. D. Dissertations
Technical reports
Seminars
News
You are here:
Home
Publications
View
Publication details
Go back
Publication details
Main information
Title:
AN HISTORICAL LINGUISTICS CORPUS (Centuries XVI-XIX)
Publication date:
December 2012
Citation:
banza2012a
Abstract:
In this paper we present an evolutionary approach to the part-of-speech tagging problem. The goal of part-of- speech tagging is to assign to each word of a text its part-of-speech. The task is not straightforward, because a large percentage of words has more than one possible part-of-speech, and the right choice is determined by the surrounding word’s part-of-speeches. This means that to solve this problem we need a method to disambiguate a word’s possible tags set. Traditionally there are two groups of methods used to tackle this task. The first group is based on statistical data concerning the different context’s possibilities for a word, while the second group is based on rules, normally designed by human experts, that capture the language properties. In this work we present a solution that tries to incorporate both these approaches.
In proceedings
Authors:
Ana Paula Banza,
Irene Rodrigues
, José Saias, Filomena Gonçalves
Book title:
proceedings of the International Conference on Historical Corpora 2012
Series:
-
Publisher:
Organised by LOEWE Priority Program
Address:
-
Volume:
-
Pages:
-
ISBN:
-
ISSN:
-
Note:
-
Url address:
-
Export formats
Plain text:
Ana Paula Banza and Irene Rodrigues and José Saias and Filomena Gonçalves, AN HISTORICAL LINGUISTICS CORPUS (Centuries XVI-XIX), , proceedings of the International Conference on Historical Corpora 2012, Organised by LOEWE Priority Program, December 2012.
HTML:
Ana Paula Banza, <a href="/people/members/view.php?code=032b48c4371cf1d1523215c3f02c42de" class="author">Irene Rodrigues</a>, José Saias and Filomena Gonçalves, <b>AN HISTORICAL LINGUISTICS CORPUS (Centuries XVI-XIX)</b>, <u>proceedings of the International Conference on Historical Corpora 2012</u>, Organised by LOEWE Priority Program, December 2012.
BibTeX:
@inproceedings {banza2012a, author = {Ana Paula Banza and Irene Rodrigues and Jos{\'e} Saias and Filomena Gon\c{c}alves}, title = {AN HISTORICAL LINGUISTICS CORPUS (Centuries XVI-XIX)}, booktitle = {proceedings of the International Conference on Historical Corpora 2012}, publisher = {Organised by LOEWE Priority Program}, abstract = {In this paper we present an evolutionary approach to the part-of-speech tagging problem. The goal of part-of- speech tagging is to assign to each word of a text its part-of-speech. The task is not straightforward, because a large percentage of words has more than one possible part-of-speech, and the right choice is determined by the surrounding word’s part-of-speeches. This means that to solve this problem we need a method to disambiguate a word’s possible tags set. Traditionally there are two groups of methods used to tackle this task. The first group is based on statistical data concerning the different context’s possibilities for a word, while the second group is based on rules, normally designed by human experts, that capture the language properties. In this work we present a solution that tries to incorporate both these approaches.}, month = {December}, year = {2012}, }
Publication's urls
Full url:
/publications/view.php?code=7e58cf1c45f97c24f6e86c5c62a889da
Friendly url:
/publications/view.php?code=banza2012a
Go back
Departamento de Informática, FCT/UNL
Quinta da Torre 2829-516 CAPARICA - Portugal
Tel. (+351) 21 294 8536 FAX (+351) 21 294 8541