Efficiency evaluation of open source ETL tools

Majchrzak TA, Jansen T, Kuchen H


Abstract
Business intelligence (BI) is considered to have a high impact on businesses. Research activity has risen in the last years. An important part of BI systems is a well performing implementation of the Extract, Transform, and Load (ETL) process. In typical BI projects, implementing the ETL process can be the task with the greatest effort. However, little work is published on ETL applications and in particular on open source ETL tools. We have analyzed open source ETL tools especially with regard to their performance. In this paper we present the analysis' background and highlight related work. We then sketch the test setup, show the detailed results for Talend Open Studio and Pentaho Data Integration, and discuss our observations. Eventually, we draw a conclusion and point out future work.



Publication type
Research article in proceedings (conference)

Peer reviewed
Yes

Publication status
Published

Year
2011

Conference
ACM Symposium on Applied Computing

Venue
TaiChung, Taiwan

Journal
Proceedings of the ACM Symposium on Applied Computing

Book title
Proceedings of the 2011 ACM Symposium on Applied Computing

Editor
Chu William C. , Wong W. Eric, Palakal Mathew J., Hung Chih-Cheng

Start page
287

End page
294

Publisher
ACM

Language
English

ISBN
978-1-4503-0113-8

DOI

Full text