All task areas

Data Engineering

Pipelines, warehouses, transforms

4 tasks4 tools
1

Orchestrate data pipelines

airflow
Data Engineering

Apache Airflow

The leading workflow orchestration platform for data pipelines. Airflow lets you define, schedule, and monitor complex DAG-based pipelines in Python. The standard for data engineers and ML pipeline orchestration.

PythonOpen SourceSelf-hosted
93
Trust
Excellent
Compare:vs dbt
2

Transform & model warehouse data

dbt
Data Engineering

dbt

The leading data transformation tool for analytics engineers. dbt lets you write SQL SELECT statements and handles materialization, testing, documentation, and lineage. It transformed how data teams work.

PythonSQLOpen Source
52
Trust
Limited
3

Query a cloud data warehouse

snowflake
Data Engineering

Snowflake

The leading cloud data warehouse. Snowflake separates compute from storage, scales elastically, and supports structured and semi-structured data. The primary target warehouse for most modern data stacks paired with dbt.

SQLSaaSPaid
80
Trust
Strong
4

Build a real-time stream pipeline

kafka
Data Engineering

Apache Kafka

The dominant distributed event streaming platform. Kafka handles high-throughput, durable message queues for real-time data pipelines and event-driven architectures. The backbone of modern data infrastructure at scale.

JavaOpen SourceSelf-hosted
92
Trust
Excellent