ETL:
Extract, Transform, Load (ETL) is the general procedure of copying data
from one or more sources into a destination system which represents the data
differently from the source(s). The term comes from the three basic steps
needed: extracting (selecting and exporting) data from the source, transforming
the way the data is represented to the form expected by the destination, and
loading (reading or importing) the transformed data into the destination system.
ETLs tools:
List of common batch run/incumbent ETL tools:
- IBM
InfoSphere DataStage
- Informatica
Power Centre
- Microsoft
SSIS (SQL Server Integration Services)
- Oracle
Data Integrator
- Oracle
Warehouse Builder (OWB)
- SAP
Data Services
- SAS
Data Management
- PowerCenter
Informatica
- Elixir
Repertoire for Data ETL
- Data
Migrator (IBI)
- Talend
Studio for Data Integration
- Sagent
Data Flow
- Actian
DataConnect
- Open
Text Integration Center
- Cognos
Data Manager
- CloverETL
- Centerprise
Data Integrator
- IBM
Infosphere Warehouse Edition
- Pentaho
Data Integration
- Adeptia
Integration Server
- Syncsort
DMX
- QlikView
Expressor
- Relational
Junction ETL Manager (Sesame Software)
Open
source ETL tools
These solutions are the evolutionary middle step between incumbent batch-based tools and fully managed cloud-based solutions. They solve some of the problems that batch run tools do not, for example, handling real-time streaming data.
List
of common open source ETL tools:
- Apache
Kafka
- Apache
NiFi
- CloverETL
- Jaspersoft
- Pentaho Kettle
- Talend Open Studio
Cloud-native
ETL tools
Today's
ETL tools are cloud-based and run in real time. Cloud-based means your ETL
solution is managed and you need not worry about hardware costs, scaling,
replication, or security, because these are usually built-in.
List of common cloud-native ETL tools:
- Alooma
- Fivetran
- Matillion
- Snaplogic
- Stitch
Data
The
demand for real-time support has moved the model from batch processing to one based
on message queues and streams. Kafka has become the leading distributed message
queue, and companies like Alooma have built SaaS or on-prem ETL solutions atop
it.
Batch processing of ETL work makes little sense if your data (or insights from it) are needed instantly. And many applications work this way today — a tweet or social media update goes live immediately, not tomorrow!
Here's a list of common real-time ETL tools:
- Alooma
- Confluent
- StreamSets
- Striim
BI tools
BI
and Analytics tools are about everything you do with the data to get insights
once you've captured it. These include tools for visualization, data science
analysis, analytics and KPIs:
List of common BI and analytics tools:
- SAP
Business Intelligence
- MicroStrategy
- Dundas
BI
- Yellowfin
BI
- TIBCO
Spotfire
- Hevo
Data
- Microsoft
Power BI
- Looker
- Clear
Analytics
- Tableau
- Oracle
BI
- Domo
- QlikView
- Pentaho
- TIBCO
Jaspersoft
- BIRT
- IBM
Cognos Analytics
- Style
Intelligence
- Netlink
No comments:
Post a Comment