Drizzle integration with Apache Spark
☆120Sep 11, 2018Updated 7 years ago
Alternatives and similar repositories for drizzle-spark
Users that are interested in drizzle-spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Spark Structured Streaming Kafka 0.8 Source Implementation☆35Apr 27, 2017Updated 9 years ago
- Spark Connector for Hazelcast☆22Jun 9, 2021Updated 4 years ago
- Mirror of Apache crail (Incubating)☆151Jul 3, 2022Updated 3 years ago
- JVM integration for Weld☆16Sep 24, 2018Updated 7 years ago
- Enabling Spark Optimization through Cross-stack Monitoring and Visualization☆47Aug 23, 2017Updated 8 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Sparkling Water provides H2O functionality inside Spark cluster☆977Nov 5, 2025Updated 6 months ago
- Stream Data Mining Library for Spark Streaming☆497Apr 16, 2023Updated 3 years ago
- ☆13Mar 8, 2018Updated 8 years ago
- Schema Registry integration for Apache Spark☆40Nov 16, 2022Updated 3 years ago
- Apache Incubator Proposal for Heron☆22Feb 17, 2016Updated 10 years ago
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,034Nov 21, 2022Updated 3 years ago
- Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...☆646Dec 17, 2023Updated 2 years ago
- Mirror of Apache Apex core☆350Jun 7, 2021Updated 4 years ago
- An extension of Yahoo's Benchmarks☆109Dec 18, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,008Oct 5, 2022Updated 3 years ago
- Custom state store providers for Apache Spark☆92Feb 14, 2025Updated last year
- An efficient updatable key-value store for Apache Spark☆255Mar 11, 2017Updated 9 years ago
- Simple Lambda Architecture implementation based on Apache Spark (Core, SQL, Streaming)☆40Feb 19, 2017Updated 9 years ago
- A Hivemall wrapper for Spark☆31Apr 21, 2016Updated 10 years ago
- ☆10Apr 10, 2014Updated 12 years ago
- Remote shuffle service for Apache Spark to store shuffle data on remote servers.☆335Sep 29, 2023Updated 2 years ago
- a tailored Apache Calcite for Apache Kylin, more details at http://mail-archives.apache.org/mod_mbox/kylin-dev/201704.mbox/%3CCAF7etT=wEB…☆14Nov 7, 2025Updated 6 months ago
- ☆102Mar 23, 2020Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Distributed Prometheus time series database☆1,464May 20, 2026Updated last week
- Templates for projects based on top of H2O.☆39Mar 17, 2025Updated last year
- Distributed Neural Networks for Spark☆610Jul 23, 2020Updated 5 years ago
- A library you can include in your Spark job to validate the counters and perform operations on success. Goal is scala/java/python support…☆110Feb 1, 2018Updated 8 years ago
- a benchmark to test scalability of xgboost4j-spark and relevant projects☆22Dec 20, 2019Updated 6 years ago
- A NiFi client library for JVM languages☆13Mar 18, 2016Updated 10 years ago
- Example of orchestrating dependent Databricks jobs using Airflow☆11Dec 19, 2019Updated 6 years ago
- TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.☆3,852Jul 10, 2023Updated 2 years ago
- Data pipeline automation tool☆28Jan 11, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- CS294 RISE Course Material☆32Jan 23, 2019Updated 7 years ago
- Real Time Analytics and Data Pipelines based on Spark Streaming☆531Oct 24, 2019Updated 6 years ago
- Large-scale event processing with Akka Persistence and Apache Spark☆272Jun 18, 2016Updated 9 years ago
- A chef cookbook for deploying spark☆30Apr 14, 2013Updated 13 years ago
- Implementation of Monte Carlo Word Movers Distance in Python with TensorFlow☆12Sep 12, 2016Updated 9 years ago
- Sparrow scheduling platform (U.C. Berkeley).☆328Jul 25, 2020Updated 5 years ago
- Mirror of Apache Bahir☆337Jul 7, 2023Updated 2 years ago