UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy
☆64Dec 17, 2023Updated 2 years ago
Alternatives and similar repositories for uberscriptquery
Users that are interested in uberscriptquery are comparing it to the libraries listed below
Sorting:
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆29May 15, 2020Updated 5 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆40Jun 29, 2017Updated 8 years ago
- S3 backed ContentsManager for jupyter notebooks☆14Feb 10, 2016Updated 10 years ago
- Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream …☆22Feb 6, 2017Updated 9 years ago
- A Scala version of my `sbtmkdirs` shell script☆11Feb 27, 2021Updated 5 years ago
- A framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.☆131Jan 17, 2025Updated last year
- A custom watcher plugin for Elasticsearch that feeds Apache Kafka☆11Mar 9, 2018Updated 8 years ago
- Second generation of the ICGC DCC release ETL built on Spark☆10Apr 8, 2019Updated 6 years ago
- ☆16Feb 24, 2017Updated 9 years ago
- Lab project to showcase Flink's performance differences between using a SQL query and implementing the same logic via the DataStream API☆14Apr 15, 2020Updated 5 years ago
- Quick Akka Micro Dag Prototype☆13Apr 8, 2016Updated 9 years ago
- Generic Model Serving Implementation leveraging Flink☆19Jan 3, 2019Updated 7 years ago
- Redis search and indexing in Java☆16Sep 26, 2016Updated 9 years ago
- An experiment to inject a customized parser using SparkSessionExtension☆16Jan 1, 2018Updated 8 years ago
- Spark SQL index for Parquet tables☆134May 6, 2021Updated 4 years ago
- Simple implementations of forward- and backward-mode automatic differentation in Scala☆23Jun 21, 2018Updated 7 years ago
- A quotation-based Scala DSL for scalable data analysis.☆63Jul 7, 2022Updated 3 years ago
- Java chat example app☆11Mar 11, 2022Updated 4 years ago
- RESP (REdis Serialization Protocol) encoder and decoder.☆19Dec 6, 2015Updated 10 years ago
- Cache File System optimized for columnar formats and object stores☆187Aug 11, 2022Updated 3 years ago
- Spark SQL DBF Library☆16Jan 2, 2015Updated 11 years ago
- 使用spark + kudu的案例☆15Sep 13, 2017Updated 8 years ago
- ☆13Aug 13, 2018Updated 7 years ago
- customer visualization for splunk using echarts☆15May 11, 2017Updated 8 years ago
- Camus Compressor merges files created by Camus and saves them in a compressed format.☆13Mar 20, 2023Updated 3 years ago
- Code examples for my blog posts☆22Nov 7, 2018Updated 7 years ago
- JanusGraph: an open-source, distributed graph database☆14Aug 10, 2017Updated 8 years ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆280Aug 3, 2018Updated 7 years ago
- Library for organizing batch processing pipelines in Apache Spark☆42Jan 4, 2017Updated 9 years ago
- A DSL for constructing json-like data sturctures☆39Feb 27, 2024Updated 2 years ago
- Plot live-stats as graph from ApacheSpark application using Lightning-viz☆18Jul 3, 2017Updated 8 years ago
- Implementation of a Big Data (batch and stream) distributed processing engine in Java using Akka actors.☆12Feb 20, 2023Updated 3 years ago
- Scala API for distributed closures on Apache Ignite☆11Jun 6, 2015Updated 10 years ago
- Db2 JDBC connector for Trino☆19Jan 6, 2023Updated 3 years ago
- Apache Parquet reader in Scala without Apache Spark - developed at Purdue University☆12Feb 17, 2017Updated 9 years ago
- Benchmark of common hash functions☆10Sep 15, 2019Updated 6 years ago
- xpath_proto_builder is a library to convert objects (JSON, XML, POJO) into protobuf using xpath notation.☆18Nov 15, 2022Updated 3 years ago
- Big Data Toolkit for the JVM☆147Nov 4, 2020Updated 5 years ago
- A set of tools to ease working with Zookeeper and Kafka.☆23Jan 22, 2016Updated 10 years ago