apache/gobblin

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/apache/gobblin)

apache / gobblin

A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.

☆2,270

Alternatives and similar repositories for gobblin

Users that are interested in gobblin are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LinkedInAttic / camus
View on GitHub
LinkedIn's previous generation Kafka to HDFS pipeline.
☆881Aug 27, 2020Updated 5 years ago
apache / pinot
View on GitHub
Apache Pinot - A realtime distributed OLAP datastore
☆6,117Updated this week
linkedin / dr-elephant
View on GitHub
Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark
☆1,370Aug 22, 2023Updated 2 years ago
apache / incubator-heron
View on GitHub
Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter
☆3,629Mar 1, 2023Updated 3 years ago
apache / druid
View on GitHub
Apache Druid: a high performance real-time analytics database.
☆14,034Updated this week
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
prestodb / presto
View on GitHub
The official home of the Presto distributed SQL query engine for big data
☆16,719Updated this week
uber-archive / AthenaX
View on GitHub
SQL-based streaming analytics platform at scale
☆1,224Jun 21, 2020Updated 6 years ago
yahoo / CMAK
View on GitHub
CMAK is a tool for managing Apache Kafka clusters
☆11,925Aug 2, 2023Updated 2 years ago
azkaban / azkaban
View on GitHub
Azkaban workflow manager.
☆4,511Jul 3, 2024Updated 2 years ago
apache / kylin
View on GitHub
Apache Kylin
☆3,769Jul 16, 2026Updated last week
datahub-project / datahub
View on GitHub
The Context Platform for your Data and AI Stack
☆12,320Updated this week
Alluxio / alluxio
View on GitHub
Alluxio, data orchestration for analytics and machine learning in the cloud
☆7,213Apr 29, 2025Updated last year
apache / zeppelin
View on GitHub
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
☆6,645Updated this week
Netflix / metacat
View on GitHub
☆1,687Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
apache / hudi
View on GitHub
Upserts, Deletes And Incremental Processing on Big Data.
☆6,192Updated this week
confluentinc / kafka-connect-hdfs
View on GitHub
Kafka Connect HDFS connector
☆27Updated this week
linkedin / databus
View on GitHub
Source-agnostic distributed change data capture system
☆3,678Sep 28, 2023Updated 2 years ago
apache / drill
View on GitHub
Apache Drill is a distributed MPP query layer for self describing data
☆2,022Jul 15, 2026Updated last week
pinterest / secor
View on GitHub
Secor is a service implementing Kafka log persistence
☆1,857Mar 10, 2026Updated 4 months ago
apache / beam
View on GitHub
Apache Beam is a unified programming model for Batch and Streaming data processing.
☆8,636Updated this week
apache / kudu
View on GitHub
Mirror of Apache Kudu
☆1,904Updated this week
Teradata / kylo
View on GitHub
Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies…
☆1,111Jan 12, 2023Updated 3 years ago
apache / flink
View on GitHub
Apache Flink
☆26,202Updated this week
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
spark-jobserver / spark-jobserver
View on GitHub
REST job server for Apache Spark
☆2,837Mar 3, 2026Updated 4 months ago
apache / helix
View on GitHub
Mirror of Apache Helix
☆503Jul 9, 2026Updated last week
linkedin / Burrow
View on GitHub
Kafka Consumer Lag Checking
☆3,953Jul 16, 2026Updated last week
apache / carbondata
View on GitHub
High performance data store solution
☆1,448Jul 4, 2026Updated 2 weeks ago
airbnb / airpal
View on GitHub
Web UI for PrestoDB.
☆2,746May 20, 2021Updated 5 years ago
delta-io / delta
View on GitHub
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…
☆8,924Updated this week
apache / calcite
View on GitHub
Apache Calcite
☆5,160Updated this week
apache / samza
View on GitHub
Mirror of Apache Samza
☆846May 15, 2026Updated 2 months ago
apache / griffin
View on GitHub
Mirror of Apache griffin
☆1,172Aug 3, 2025Updated 11 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
OryxProject / oryx
View on GitHub
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
☆1,783Aug 16, 2021Updated 4 years ago
apache / ignite
View on GitHub
Apache Ignite
☆5,073Updated this week
uber / marmaray
View on GitHub
Generic Data Ingestion & Dispersal Library for Hadoop
☆483Mar 19, 2023Updated 3 years ago
linkedin / kafka-monitor
View on GitHub
Xinfra Monitor monitors the availability of Kafka clusters by producing synthetic workloads using end-to-end pipelines to obtain derived …
☆2,061Mar 9, 2025Updated last year
cloudera / hue
View on GitHub
Open source SQL Query Assistant service for Databases/Warehouses
☆1,413Updated this week
Netflix / genie
View on GitHub
Distributed Big Data Orchestration Service
☆1,763Jul 13, 2026Updated last week
confluentinc / schema-registry
View on GitHub
Confluent Schema Registry for Kafka
☆2,455Updated this week