LinkedInAttic / apache-incubator-gobblin
Gobblin is a distributed big data integration framework (ingestion, replication, compliance, retention) for batch and streaming systems. Gobblin features integrations with Apache Hadoop, Apache Kafka, Salesforce, S3, MySQL, Google etc.
☆11Updated 7 years ago
Related projects: ⓘ
- UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy☆59Updated 9 months ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated last year
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 8 years ago
- An example of building kubernetes operator (Flink) using Abstract operator's framework☆26Updated 5 years ago
- Docker Image for Kudu☆38Updated 5 years ago
- Temporal_Graph_library☆25Updated 5 years ago
- Cascading on Apache Flink®☆54Updated 7 months ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 10 years ago
- Apache Amaterasu☆56Updated 4 years ago
- ☆40Updated this week
- Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http:…☆71Updated last year
- A shim for using Cassandra as a backend for OpenTSDB. Not to be used as a general Cassandra client.☆7Updated 5 years ago
- ☆47Updated 4 years ago
- Use cases built on SnappyData. Use cases contained here: 1. Ad Analytics 2. Streaming data ingestion from RabbitMQ.☆32Updated 2 years ago
- Demo quering counts of a event stream with Apache Flink☆23Updated 6 years ago
- A template-based cluster provisioning system☆61Updated last year
- A Kafka Streams process to convert __consumer_offsets to a JSON-readable topic☆13Updated 4 years ago
- Demonstration of a Hive Input Format for Iceberg☆26Updated 3 years ago
- A High Performance Cluster Consumer for Kafka that creates Avro (boom) files in Hadoop in time based directory paths☆42Updated 8 years ago
- ☆23Updated this week
- Scripts to build a Docker image with Apache Impala with Kudu support (no HDFS needed)☆17Updated 3 years ago
- Read druid segments from hadoop☆11Updated 7 years ago
- Flink Examples☆39Updated 8 years ago
- ☆18Updated this week
- Plot live-stats as graph from ApacheSpark application using Lightning-viz☆18Updated 7 years ago
- Fast and scalable timeseries database☆25Updated 4 years ago
- Common utilities for Apache Kafka☆36Updated last year
- Mirror of Apache MRQL (Incubating)☆17Updated 7 years ago
- A distributed generic query layer for Apache Kafka Interactive Queries☆26Updated 6 years ago