uber / uberscriptquery
UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy
☆59Updated 9 months ago
Related projects: ⓘ
- High performance HBase / Spark SQL engine☆28Updated 2 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 2 years ago
- Cascading on Apache Flink®☆54Updated 7 months ago
- Enabling Spark Optimization through Cross-stack Monitoring and Visualization☆47Updated 7 years ago
- A High Performance Cluster Consumer for Kafka that creates Avro (boom) files in Hadoop in time based directory paths☆42Updated 8 years ago
- Druid indexing plugin for using Spark in batch jobs☆101Updated 2 years ago
- Spark SQL index for Parquet tables☆132Updated 3 years ago
- Apache Flink™ training material website☆79Updated 4 years ago
- Generic Model Serving Implementation leveraging Flink☆20Updated 5 years ago
- A sink to save Spark Structured Streaming DataFrame into Hive table☆23Updated 6 years ago
- ☆71Updated 7 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 10 years ago
- Spark structured streaming with Kafka data source and writing to Cassandra☆64Updated 4 years ago
- Reads a HBase table and writes the out as Text, Seq, Avro, or Parquet☆28Updated 10 years ago
- Quark is a data virtualization engine over analytic databases.☆98Updated 7 years ago
- Helpful user defined fuctions / table generating functions for Hive☆101Updated 8 years ago
- Spark Example using Phoenix to interact with HBase☆16Updated 7 years ago
- Sample UDF and UDAs for Impala.☆63Updated 4 years ago
- proof-of-concept implementation of Pig-on-Spark integrated at the logical node level☆28Updated 2 years ago
- Mirror of Apache Lens☆60Updated 4 years ago
- ☆54Updated 10 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆41Updated 7 years ago
- Based off the design of SparkOnHBase. This Repo will support Spark, Spark Streaming, and Spark SQL integration with Kudu.☆51Updated 8 years ago
- Flink Examples☆39Updated 8 years ago
- TeraSort for Spark and Flink which uses a range partitioner based on sampling☆23Updated 8 years ago
- Mirror of Apache Slider☆78Updated 5 years ago
- Apache Calcite Tutorial☆33Updated 8 years ago
- Flowmix is a flexible event processing engine for Apache Storm. It supports complex correlations of events via sliding/tumbling windows. …☆54Updated 8 years ago
- Schema Registry integration for Apache Spark☆39Updated last year
- Flink performance tests☆20Updated 9 years ago