pranab / chomboLinks
Big Data ETL and Utilities for Hadoop Map Reduce, Spark and Storm
☆102Updated last year
Alternatives and similar repositories for chombo
Users that are interested in chombo are comparing it to the libraries listed below
Sorting:
- ☆49Updated 5 years ago
- StreamLine - Streaming Analytics☆164Updated last year
- Flink Examples☆39Updated 9 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆41Updated 8 years ago
- Spark structured streaming with Kafka data source and writing to Cassandra☆62Updated 5 years ago
- An example Apache Beam project.☆111Updated 8 years ago
- Build configuration-driven ETL pipelines on Apache Spark☆159Updated 2 years ago
- spark + drools☆103Updated 3 years ago
- High performance HBase / Spark SQL engine☆28Updated 3 years ago
- A visual ETL development and debugging tool for big data☆153Updated 2 years ago
- Apache Flink™ training material website☆78Updated 5 years ago
- Collection of examples integrating NiFi with stream process frameworks.☆59Updated 8 years ago
- DataQuality for BigData☆144Updated last year
- UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy☆61Updated last year
- Java Client of the Spark Job Server implementing the arranged Rest APIs☆50Updated 4 years ago
- Helpful user defined fuctions / table generating functions for Hive☆101Updated 9 years ago
- Simple examle for Spark Streaming over Kafka topic☆106Updated 4 years ago
- Apache Spark based ETL Engine☆71Updated 8 years ago
- Example project showing how to use Hive UDFs in Apache Spark☆55Updated 6 years ago
- ☆105Updated 5 years ago
- Testbench for experimenting with Apache Hive at any data scale.☆64Updated 8 years ago
- Quark is a data virtualization engine over analytic databases.☆98Updated 8 years ago
- DataFibers Data Service☆31Updated 3 years ago
- ☆25Updated 8 years ago
- Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…☆26Updated 4 years ago
- This repository is to help with the Partner Demonstration of the Apache Atlas project.☆30Updated 9 years ago
- Continuous scalable web crawler built on top of Flink and crawler-commons☆52Updated 6 years ago
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆73Updated 5 years ago
- This is an example of real time stream processing using Spark Streaming, Kafka & Elasticsearch.☆41Updated 8 years ago
- A demo repository for "streaming etl" with Apache Flink☆44Updated 9 years ago