Wednesday, 30 August 2023

ETL and ETL Tools and Business Intelligence and Business Intelligence tools

ETL:

Extract, Transform, Load (ETL) is the general procedure of copying data from one or more sources into a destination system which represents the data differently from the source(s). The term comes from the three basic steps needed: extracting (selecting and exporting) data from the source, transforming the way the data is represented to the form expected by the destination, and loading (reading or importing) the transformed data into the destination system.

ETLs tools:

List of common batch run/incumbent ETL tools:

  • IBM InfoSphere DataStage
  • Informatica Power Centre
  • Microsoft SSIS (SQL Server Integration Services)
  • Oracle Data Integrator
  • Oracle Warehouse Builder (OWB)
  • SAP Data Services
  • SAS Data Management
  • PowerCenter Informatica
  • Elixir Repertoire for Data ETL
  • Data Migrator (IBI)
  • Talend Studio for Data Integration
  • Sagent Data Flow
  • Actian DataConnect
  • Open Text Integration Center
  • Cognos Data Manager
  • CloverETL
  • Centerprise Data Integrator
  • IBM Infosphere Warehouse Edition
  • Pentaho Data Integration
  • Adeptia  Integration Server
  • Syncsort DMX
  • QlikView Expressor
  • Relational Junction ETL Manager (Sesame Software)

Open source ETL tools

These solutions are the evolutionary middle step between incumbent batch-based tools and fully managed cloud-based solutions. They solve some of the problems that batch run tools do not, for example, handling real-time streaming data.

List of common open source ETL tools:

  • Apache Kafka
  • Apache NiFi
  • CloverETL
  • Jaspersoft
  • Pentaho Kettle
  • Talend Open Studio

Cloud-native ETL tools

Today's ETL tools are cloud-based and run in real time. Cloud-based means your ETL solution is managed and you need not worry about hardware costs, scaling, replication, or security, because these are usually built-in.

List of common cloud-native ETL tools:

  • Alooma
  • Fivetran
  • Matillion
  • Snaplogic
  • Stitch Data
Real-time ETL tools

The demand for real-time support has moved the model from batch processing to one based on message queues and streams. Kafka has become the leading distributed message queue, and companies like Alooma have built SaaS or on-prem ETL solutions atop it.

Batch processing of ETL work makes little sense if your data (or insights from it) are needed instantly. And many applications work this way today — a tweet or social media update goes live immediately, not tomorrow!

Here's a list of common real-time ETL tools:

  • Alooma
  • Confluent
  • StreamSets
  • Striim

 

BI tools

BI and Analytics tools are about everything you do with the data to get insights once you've captured it. These include tools for visualization, data science analysis, analytics and KPIs:

List of common BI and analytics tools:

  • SAP Business Intelligence
  • MicroStrategy
  • Dundas BI
  • Yellowfin BI
  • TIBCO Spotfire
  • Hevo Data
  • Microsoft Power BI
  • Looker
  • Clear Analytics
  • Tableau
  • Oracle BI
  • Domo
  • QlikView        
  • Pentaho
  • TIBCO Jaspersoft
  • BIRT
  • IBM Cognos Analytics
  • Style Intelligence
  • Netlink

No comments:

Post a Comment

The Difference Between a Program Manager and a Project Manager

The Difference Between a Program Manager and a Project Manager A program manager manages multiple projects, and sometimes multiple program...