LinkedInAttic / apache-incubator-gobblinLinks
Gobblin is a distributed big data integration framework (ingestion, replication, compliance, retention) for batch and streaming systems. Gobblin features integrations with Apache Hadoop, Apache Kafka, Salesforce, S3, MySQL, Google etc.
☆11Updated 7 years ago
Alternatives and similar repositories for apache-incubator-gobblin
Users that are interested in apache-incubator-gobblin are comparing it to the libraries listed below
Sorting:
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy☆61Updated last year
- Demonstration of a Hive Input Format for Iceberg☆26Updated 4 years ago
- Mirror of Apache Tephra (Incubating)☆32Updated 2 years ago
- Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http:…☆70Updated 2 years ago
- The Apache Storm implementation of the Bullet backend☆40Updated 2 years ago
- Pulsar IO Kafka Connector☆24Updated 2 years ago
- Example application demonstrating how to integrate all of the components of Hortonworks DataFlow.☆14Updated 8 years ago
- Common utilities for Apache Kafka☆36Updated last year
- A High Performance Cluster Consumer for Kafka that creates Avro (boom) files in Hadoop in time based directory paths☆42Updated 9 years ago
- A library for strong, schema based conversion between 'natural' JSON documents and Avro☆18Updated last year
- Flink Examples☆39Updated 9 years ago
- Generic Model Serving Implementation leveraging Flink☆19Updated 6 years ago
- Insight Engineering Platform Components☆91Updated 2 weeks ago
- A distributed database with a built in streaming data platform☆58Updated 5 months ago
- Ambari stack service for easily installing and managing Solr on HDP cluster☆19Updated 6 years ago
- Quark is a data virtualization engine over analytic databases.☆98Updated 8 years ago
- An example of building kubernetes operator (Flink) using Abstract operator's framework☆26Updated 6 years ago
- Spark Connector to read and write with Pulsar☆115Updated last week
- Dione - a Spark and HDFS indexing library☆52Updated last year
- LinkedIn's version of Apache Calcite☆23Updated this week
- Cascading on Apache Flink®☆54Updated last year
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆30Updated 9 years ago
- Toolkit that can bundle any Spring Boot application into an Apache Ambari Service, enabling Ambari to provision, manage and monitor the s…☆13Updated 9 years ago
- Temporal_Graph_library☆25Updated 6 years ago
- A shim for using Cassandra as a backend for OpenTSDB. Not to be used as a general Cassandra client.☆7Updated 6 years ago
- A template-based cluster provisioning system☆61Updated 2 years ago
- Fluorite: Apache Calcite trace analyzer☆12Updated 6 years ago
- A framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.☆130Updated 6 months ago
- A composable framework for fast and scalable data analytics☆57Updated 2 years ago