xmlking / cdc-kafka-hadoopLinks
MySQL to NoSQL real time dataflow
☆18Updated 8 years ago
Alternatives and similar repositories for cdc-kafka-hadoop
Users that are interested in cdc-kafka-hadoop are comparing it to the libraries listed below
Sorting:
- Real-time analytics in Apache Flume☆52Updated 9 years ago
- Big Data ETL and Utilities for Hadoop Map Reduce, Spark and Storm☆103Updated last year
- Ambari stack for easily installing and managing Redis on HDP cluster☆14Updated 10 years ago
- ☆50Updated 5 years ago
- Java Client of the Spark Job Server implementing the arranged Rest APIs☆51Updated 4 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆41Updated 8 years ago
- A demo repository for "streaming etl" with Apache Flink☆44Updated 9 years ago
- Learn how to use Spark SQL and HSpark connector package to create / query data tables that reside in HBase region servers☆69Updated 2 months ago
- conbine flume,spark-streaming and redis for real-time computing☆22Updated 11 years ago
- Distributed SQL base Realtime Streaming Computation Framework On Apache Storm, Spark☆12Updated 9 years ago
- Reads a HBase table and writes the out as Text, Seq, Avro, or Parquet☆28Updated 11 years ago
- Few scripts to automate daily data loads from RDBMS to Partitioned Avro Hive table☆30Updated 11 years ago
- Self-contained examples using Apache Spark with the functional features of Java 8☆65Updated 7 years ago
- Flink performance tests☆28Updated 6 years ago
- High performance HBase / Spark SQL engine☆28Updated 3 years ago
- Some extensions to Flume to help with collecting logs and storing as Avro.☆17Updated 11 years ago
- Using Spark SQLContext, HiveContext & Spark DataFrames API with ElasticSearch, Cassandra & MongoDB☆22Updated 9 years ago
- Flink Examples☆38Updated 9 years ago
- Example project showing how to use Hive UDFs in Apache Spark☆55Updated 6 years ago
- StreamLine - Streaming Analytics☆165Updated 2 years ago
- ElasticSearch integration for Apache Spark☆47Updated 9 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 3 years ago
- Apache Spark based ETL Engine☆71Updated 9 years ago
- Customer Product search clicks analytics using big data Hadoop, Hive, Oozie, ElasticSearch, Akka, Spring Data☆73Updated 3 years ago
- Code for processing AVRO data in Spark Streaming + Kafka (DirectKafka approach with custom offset management in zookeeper)☆29Updated 9 years ago
- ☆56Updated 11 years ago
- Ambari service for Presto☆44Updated 10 months ago
- This is a simple CEP Engine leveraging the Kafka Streams platform☆16Updated 8 years ago
- A Fully HiveServer2-like Multi-tenancy Spark Thrift Server Supporting Impersonation and Multi-SparkContext with Ranger Authorization (GO …☆10Updated 3 years ago
- CDAP Cube Dataset Guide☆12Updated 8 years ago