Back to first pageBack to first page Centre for Artificial Intelligence of UNL
Browse our site

PGR - Selective Access to the information contained in the Opinions of the Portuguese Republic

Project information
Description

Access to the opinions of the Portuguese Republic Attorney (PGR) via web, by incorporating knowledge about Portuguese Language (namely a large lexicon, and multi-word units automatically extracted from the PGR corpus) in the search engine used.

Started in January 1998 and was concluded in 2001.

Project

Participating entities: Heurística, CENTRIA - UNL, Procuradoria Geral da República.

Funding

Funding entity: Agência de Inovação.

CENTRIA

Principal researcher: Gabriel Pereira Lopes.

Results

1) Automatic extraction of thesaurus from partially parsed PGR corpus. The results of this effort were not yet inserted in the search engine used in this pro ject. 2) Supervised and unsupervised classification of documents of this collection of opinions. The first method used a neural network based approach and the key words used in those documents. The unsupervised classification used automatically extracted multi-word lexical units and statistical methods. Both must still be incorporated in the search engine used. 3) Statistically based parallel text alignment and translation equivalents extraction from parallel corpora continued. However it is: still required a large effort in order to enable access to the opinions of the Portuguese General Attorney, using any of the European Community languages. Work in the framework of project TRADAUT-PT will provide a large basis for making this possible, at least for English and French speaking people. #28 publications and a demo, together with final report.

Other information

Project website


Centre for Artificial Intelligence of UNL
Departamento de Informática, FCT/UNL
Quinta da Torre 2829-516 CAPARICA - Portugal
Tel. (+351) 21 294 8536 FAX (+351) 21 294 8541

Fundacao_FCT