UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy
☆64Dec 17, 2023Updated 2 years ago
Alternatives and similar repositories for uberscriptquery
Users that are interested in uberscriptquery are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆28May 15, 2020Updated 6 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆40Jun 29, 2017Updated 8 years ago
- S3 backed ContentsManager for jupyter notebooks☆14Feb 10, 2016Updated 10 years ago
- Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream …☆22Feb 6, 2017Updated 9 years ago
- A framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.☆132Jan 17, 2025Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A custom watcher plugin for Elasticsearch that feeds Apache Kafka☆11Mar 9, 2018Updated 8 years ago
- Second generation of the ICGC DCC release ETL built on Spark☆10Apr 8, 2019Updated 7 years ago
- Lab project to showcase Flink's performance differences between using a SQL query and implementing the same logic via the DataStream API☆14Apr 15, 2020Updated 6 years ago
- Quick Akka Micro Dag Prototype☆13Apr 8, 2016Updated 10 years ago
- Generic Model Serving Implementation leveraging Flink☆19Jan 3, 2019Updated 7 years ago
- Redis search and indexing in Java☆16Sep 26, 2016Updated 9 years ago
- Experimenting with Vaadin in OSGi☆14Apr 28, 2010Updated 16 years ago
- Spark SQL index for Parquet tables☆134May 6, 2021Updated 5 years ago
- Simple implementations of forward- and backward-mode automatic differentation in Scala☆23Jun 21, 2018Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆14Oct 5, 2022Updated 3 years ago
- RESP (REdis Serialization Protocol) encoder and decoder.☆19Dec 6, 2015Updated 10 years ago
- Cache File System optimized for columnar formats and object stores☆188Aug 11, 2022Updated 3 years ago
- Spark SQL DBF Library☆16Jan 2, 2015Updated 11 years ago
- 使用spark + kudu的案例☆15Sep 13, 2017Updated 8 years ago
- customer visualization for splunk using echarts☆15May 11, 2017Updated 9 years ago
- Code examples for my blog posts☆22Nov 7, 2018Updated 7 years ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆281Aug 3, 2018Updated 7 years ago
- Library for organizing batch processing pipelines in Apache Spark☆42Jan 4, 2017Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A DSL for constructing json-like data sturctures☆39Feb 27, 2024Updated 2 years ago
- Plot live-stats as graph from ApacheSpark application using Lightning-viz☆18Jul 3, 2017Updated 8 years ago
- Implementation of a Big Data (batch and stream) distributed processing engine in Java using Akka actors.☆12Feb 20, 2023Updated 3 years ago
- Scala API for distributed closures on Apache Ignite☆11Jun 6, 2015Updated 10 years ago
- Db2 JDBC connector for Trino☆19Jan 6, 2023Updated 3 years ago
- Apache Parquet reader in Scala without Apache Spark - developed at Purdue University☆12Feb 17, 2017Updated 9 years ago
- A prototype meta DSL that generates Delite DSL implementations from a specification-like program.☆51Feb 28, 2017Updated 9 years ago
- A set of tools to ease working with Zookeeper and Kafka.☆23Jan 22, 2016Updated 10 years ago
- A tool to validate data, built around Apache Spark.☆102May 13, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Apache Streams☆80Apr 24, 2025Updated last year
- Transporter for integrating OpenLineage with OpenMetadata☆18Sep 10, 2025Updated 8 months ago
- A minimal seed template for an Akka gRPC with Scala build☆19Jan 22, 2026Updated 3 months ago
- Stream computing platform for bigdata☆408Apr 24, 2024Updated 2 years ago
- Look for SQL injection attacks in python source code☆126Mar 5, 2019Updated 7 years ago
- Waimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.☆76Apr 24, 2024Updated 2 years ago
- Python (PyMC) adaptation of the R code from "Doing Bayesian Data Analysis"☆65Apr 18, 2017Updated 9 years ago