cdap-guides / cdap-cube-guide
CDAP Cube Dataset Guide
☆12Updated 7 years ago
Alternatives and similar repositories for cdap-cube-guide:
Users that are interested in cdap-cube-guide are comparing it to the libraries listed below
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 9 years ago
- Cascading on Apache Flink®☆54Updated last year
- SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka☆29Updated 8 years ago
- Spark Connector for Hazelcast☆22Updated 3 years ago
- Muppet☆126Updated 3 years ago
- ByteBuffer collection classes for java and jvm-based languages.☆33Updated 6 years ago
- Flink Examples☆39Updated 8 years ago
- source examples to support the "Cascading for the Impatient" blog post series☆79Updated 8 years ago
- High performance HBase / Spark SQL engine☆28Updated 2 years ago
- Utilities for converting to and from JSON from Avro records via Hadoop streaming or Hive.☆29Updated 4 years ago
- Examples of user defined functions for Apache Drill☆19Updated 7 years ago
- Flink performance tests☆28Updated 5 years ago
- Embedded Kafka for testing and quick prototyping.☆14Updated 8 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Updated 11 years ago
- Scriptable scheduler for periodical Hadoop workflows☆22Updated 7 years ago
- Integration for Cascading and Apache Hive☆26Updated 7 years ago
- ☆17Updated 9 years ago
- Collection of generic Apache Flink operators☆17Updated 7 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- Mirror of Apache Tephra (Incubating)☆31Updated last year
- Reads a HBase table and writes the out as Text, Seq, Avro, or Parquet☆28Updated 10 years ago
- Sparking Using Java8☆17Updated 9 years ago
- Ansible playbook for automated HDP 2.x deployment install with Kerberos☆19Updated 8 years ago
- functionstest☆33Updated 8 years ago
- Scala stuff☆18Updated 5 years ago
- A custom watcher plugin for Elasticsearch that feeds Apache Kafka☆11Updated 6 years ago
- A High Performance Cluster Consumer for Kafka that creates Avro (boom) files in Hadoop in time based directory paths☆42Updated 8 years ago
- ☆33Updated 10 years ago
- Use cases built on SnappyData. Use cases contained here: 1. Ad Analytics 2. Streaming data ingestion from RabbitMQ.☆32Updated 2 years ago
- Apache Spark jobs such as Principal Coordinate Analysis.☆74Updated 8 years ago