Azure / azure-data-lake-store-java
Microsoft Azure Data Lake Store Filesystem Library for Java
☆20Updated last year
Related projects: ⓘ
- Use cases built on SnappyData. Use cases contained here: 1. Ad Analytics 2. Streaming data ingestion from RabbitMQ.☆32Updated 2 years ago
- Apache Beam Site☆29Updated last week
- ☆18Updated this week
- TPC-DS benchmark kit with some modifications/additions☆10Updated 8 years ago
- ☆14Updated this week
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 8 years ago
- Random implementation notes☆33Updated 11 years ago
- proof-of-concept implementation of Pig-on-Spark integrated at the logical node level☆28Updated 2 years ago
- A basic example of how to read and write streaming data using Apache Spark and Kafka on HDInsight☆14Updated last year
- Apache Amaterasu☆56Updated 4 years ago
- Apache Incubator Proposal for Heron☆22Updated 8 years ago
- ☆37Updated this week
- UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy☆59Updated 9 months ago
- A utility for generating Oozie workflows from a YAML definition☆48Updated 5 years ago
- Library for organizing batch processing pipelines in Apache Spark☆41Updated 7 years ago
- Cascading on Apache Flink®☆54Updated 7 months ago
- Tool for visualizing Apache Oozie pipelines☆12Updated 8 years ago
- Common components used across the datamountaineer kafka connect connectors☆21Updated 3 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- Spark cloud integration: tests, cloud committers and more☆19Updated 6 months ago
- Dione - a Spark and HDFS indexing library☆49Updated 6 months ago
- Schema Registry integration for Apache Spark☆39Updated last year
- Mirror of Apache Slider☆78Updated 5 years ago
- ☆62Updated this week
- Use Cascading Taps and Scalding DSL with Spark☆49Updated 7 years ago
- Avro Schema Shredder is a REST API that enables storage of Avro Schemas in Apache Atlas. This API enables an organization to use Apache A…☆13Updated 7 years ago
- Integration tests for Spark☆69Updated last year
- ODPi specifications, developed by ODPi Runtime and ODPi Operations projects. Currently in Emeritus status☆35Updated 5 years ago
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆72Updated 3 years ago
- Google Dataflow Runner for Apache Flink™ (deprecated; please use the up-to-date Beam Runner)☆88Updated 8 years ago
- hRaven collects run time data and statistics from MapReduce jobs in an easily queryable format☆126Updated 2 years ago