Back to first pageBack to first page Centre for Artificial Intelligence of UNL
Browse our site
You are here:

Publication details

Publication details
Main information
Extraction and Transformation of Data From Semi-Structured Text Files Using a Declarative Approach
June 2007
RaminhosP07:iceis
The Web is a major source of textual information, with a human-readable semi-structured format, referring to multiple domains. Traditional ETL approaches following the development of specific source code for each data source and based on multiple domain / computer-science experts interactions, become an inadequate solution. This paper presents a novel approach to ETL, based on its decomposition in two phases: ETD (Extraction, Transformation and Data Delivery) and IL (Integration and Loading).
In proceedings
Ricardo Raminhos, João Moura Pires
Jorge Cardoso, José Cordeiro, Joaquim Filipe
Proceedings of the Ninth International Conference on Enterprise Information Systems
-
INSTICC Press
Funchal, Madeira, Portugal
-
199-205
9789728865887
-
-
-
Publication files
- click here to download - pdf 168 KB
Export formats
Ricardo Raminhos and João Moura Pires, Extraction and Transformation of Data From Semi-Structured Text Files Using a Declarative Approach, in: Jorge Cardoso and José Cordeiro and Joaquim Filipe (eds), Proceedings of the Ninth International Conference on Enterprise Information Systems, INSTICC Press, Funchal, Madeira, Portugal, ISBN 9789728865887, Pag. 199-205, June 2007.
Ricardo Raminhos and <a href="/people/members/view.php?code=542b14e1830dcf7566974fd36b6fccc7" class="author">João Moura Pires</a>, <b>Extraction and Transformation of Data From Semi-Structured Text Files Using a Declarative Approach</b>, in: Jorge Cardoso, José Cordeiro and Joaquim Filipe (eds), <u>Proceedings of the Ninth International Conference on Enterprise Information Systems</u>, <a href="http://www.insticc.net/" title="Link to external entity..." target="_blank" class="publisher">INSTICC Press</a>, Funchal, Madeira, Portugal, ISBN 9789728865887, Pag. 199-205, June 2007.
@inproceedings {RaminhosP07:iceis, author = {Ricardo Raminhos and Jo{\~a}o Moura Pires}, editor = {Jorge Cardoso and Jos{\'e} Cordeiro and Joaquim Filipe}, title = {Extraction and Transformation of Data From Semi-Structured Text Files Using a Declarative Approach}, booktitle = {Proceedings of the Ninth International Conference on Enterprise Information Systems}, publisher = {INSTICC Press}, address = {Funchal, Madeira, Portugal}, pages = {199-205}, isbn = {9789728865887}, abstract = {The Web is a major source of textual information, with a human-readable semi-structured format, referring to multiple domains. Traditional ETL approaches following the development of specific source code for each data source and based on multiple domain / computer-science experts interactions, become an inadequate solution. This paper presents a novel approach to ETL, based on its decomposition in two phases: ETD (Extraction, Transformation and Data Delivery) and IL (Integration and Loading).}, keywords = {ETL, Semi-Structured Text Files,}, month = {June}, year = {2007}, }
Publication's urls
/publications/view.php?code=205f55844f99858cdab7459d0d03bb07
/publications/view.php?code=RaminhosP07:iceis

Centre for Artificial Intelligence of UNL
Departamento de Informática, FCT/UNL
Quinta da Torre 2829-516 CAPARICA - Portugal
Tel. (+351) 21 294 8536 FAX (+351) 21 294 8541

Fundacao_FCT