LinkedInAttic / apache-incubator-gobblinLinks
Gobblin is a distributed big data integration framework (ingestion, replication, compliance, retention) for batch and streaming systems. Gobblin features integrations with Apache Hadoop, Apache Kafka, Salesforce, S3, MySQL, Google etc.
☆11Updated 7 years ago
Alternatives and similar repositories for apache-incubator-gobblin
Users that are interested in apache-incubator-gobblin are comparing it to the libraries listed below
Sorting:
- Cascading on Apache Flink®☆54Updated last year
- Mirror of Apache Tephra (Incubating)☆32Updated 2 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy☆61Updated last year
- Jetstream Esper Processor implementation☆23Updated 9 years ago
- Kaltura's next generation Analytics solution based on Spark, Cassandra and Kafka☆12Updated 2 years ago
- A shim for using Cassandra as a backend for OpenTSDB. Not to be used as a general Cassandra client.☆7Updated 6 years ago
- A template-based cluster provisioning system☆61Updated 2 years ago
- Using the Parquet file format (with Avro) to process data with Apache Flink☆14Updated 9 years ago
- Example using Grafana with Druid☆11Updated 10 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 9 years ago
- Common utilities for Apache Kafka☆36Updated last year
- Read druid segments from hadoop☆10Updated 8 years ago
- Mirror of Apache DirectMemory☆52Updated last year
- NuCypher for Kafka. Start building from this module (it fetches the appropriate branch from Kafka repository)☆18Updated 7 years ago
- Annotation driven Java object writer for ORC with runtime code generation for speed.☆21Updated last year
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- Hadoop log aggregator and dashboard☆191Updated 11 years ago
- A Kafka Streams process to convert __consumer_offsets to a JSON-readable topic☆13Updated 5 years ago
- Temporal_Graph_library☆25Updated 6 years ago
- Cassandra appenders for Log4j☆20Updated 2 years ago
- This project provides utilities and wrappers around ZooKeeper☆27Updated 10 years ago
- Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http:…☆70Updated 2 years ago
- Llama - Low Latency Application MAster☆34Updated 2 years ago
- Insight Engineering Platform Components☆91Updated this week
- Apache Tephra: Transactions for HBase.☆157Updated 8 months ago
- Toolkit that can bundle any Spring Boot application into an Apache Ambari Service, enabling Ambari to provision, manage and monitor the s…☆13Updated 9 years ago
- A library for financial and time series calculations on Apache Spark☆28Updated 9 years ago
- Fast and scalable timeseries database☆25Updated 4 years ago
- Utility code for java and jvm-based languages☆52Updated 2 years ago