Browse our site
About
People
Research Areas
Projects
Publications
Books
Book chapters
Journal articles
In proceedings
M. Sc. Dissertations
Ph. D. Dissertations
Technical reports
Seminars
News
You are here:
Home
Publications
View
Publication details
Go back
Publication details
Main information
Title:
Extraction and Transformation of Data From Semi-Structured Text Files Using a Declarative Approach
Publication date:
June 2007
Citation:
RaminhosP07:iceis
Abstract:
The Web is a major source of textual information, with a human-readable semi-structured format, referring to multiple domains. Traditional ETL approaches following the development of specific source code for each data source and based on multiple domain / computer-science experts interactions, become an inadequate solution. This paper presents a novel approach to ETL, based on its decomposition in two phases: ETD (Extraction, Transformation and Data Delivery) and IL (Integration and Loading).
In proceedings
Authors:
Ricardo Raminhos,
João Moura Pires
Editors:
Jorge Cardoso, José Cordeiro, Joaquim Filipe
Book title:
Proceedings of the Ninth International Conference on Enterprise Information Systems
Series:
-
Publisher:
INSTICC Press
Address:
Funchal, Madeira, Portugal
Volume:
-
Pages:
199-205
ISBN:
9789728865887
ISSN:
-
Note:
-
Url address:
-
Publication files
File #1:
- click here to download -
pdf 168 KB
Export formats
Plain text:
Ricardo Raminhos and João Moura Pires, Extraction and Transformation of Data From Semi-Structured Text Files Using a Declarative Approach, in: Jorge Cardoso and José Cordeiro and Joaquim Filipe (eds), Proceedings of the Ninth International Conference on Enterprise Information Systems, INSTICC Press, Funchal, Madeira, Portugal, ISBN 9789728865887, Pag. 199-205, June 2007.
HTML:
Ricardo Raminhos and <a href="/people/members/view.php?code=542b14e1830dcf7566974fd36b6fccc7" class="author">João Moura Pires</a>, <b>Extraction and Transformation of Data From Semi-Structured Text Files Using a Declarative Approach</b>, in: Jorge Cardoso, José Cordeiro and Joaquim Filipe (eds), <u>Proceedings of the Ninth International Conference on Enterprise Information Systems</u>, <a href="http://www.insticc.net/" title="Link to external entity..." target="_blank" class="publisher">INSTICC Press</a>, Funchal, Madeira, Portugal, ISBN 9789728865887, Pag. 199-205, June 2007.
BibTeX:
@inproceedings {RaminhosP07:iceis, author = {Ricardo Raminhos and Jo{\~a}o Moura Pires}, editor = {Jorge Cardoso and Jos{\'e} Cordeiro and Joaquim Filipe}, title = {Extraction and Transformation of Data From Semi-Structured Text Files Using a Declarative Approach}, booktitle = {Proceedings of the Ninth International Conference on Enterprise Information Systems}, publisher = {INSTICC Press}, address = {Funchal, Madeira, Portugal}, pages = {199-205}, isbn = {9789728865887}, abstract = {The Web is a major source of textual information, with a human-readable semi-structured format, referring to multiple domains. Traditional ETL approaches following the development of specific source code for each data source and based on multiple domain / computer-science experts interactions, become an inadequate solution. This paper presents a novel approach to ETL, based on its decomposition in two phases: ETD (Extraction, Transformation and Data Delivery) and IL (Integration and Loading).}, keywords = {ETL, Semi-Structured Text Files,}, month = {June}, year = {2007}, }
Publication's urls
Full url:
/publications/view.php?code=205f55844f99858cdab7459d0d03bb07
Friendly url:
/publications/view.php?code=RaminhosP07:iceis
Go back
Departamento de Informática, FCT/UNL
Quinta da Torre 2829-516 CAPARICA - Portugal
Tel. (+351) 21 294 8536 FAX (+351) 21 294 8541