amplab / drizzle-sparkView external linksLinks
Drizzle integration with Apache Spark
☆120Sep 11, 2018Updated 7 years ago
Alternatives and similar repositories for drizzle-spark
Users that are interested in drizzle-spark are comparing it to the libraries listed below
Sorting:
- Spark Structured Streaming Kafka 0.8 Source Implementation☆35Apr 27, 2017Updated 8 years ago
- JVM integration for Weld☆16Sep 24, 2018Updated 7 years ago
- Sparkling Water provides H2O functionality inside Spark cluster☆977Nov 5, 2025Updated 3 months ago
- Mirror of Apache crail (Incubating)☆151Jul 3, 2022Updated 3 years ago
- Spark Connector for Hazelcast☆22Jun 9, 2021Updated 4 years ago
- Templates for projects based on top of H2O.☆38Mar 17, 2025Updated 10 months ago
- Stream Data Mining Library for Spark Streaming☆500Apr 16, 2023Updated 2 years ago
- A library you can include in your Spark job to validate the counters and perform operations on success. Goal is scala/java/python support…☆108Feb 1, 2018Updated 8 years ago
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,035Nov 21, 2022Updated 3 years ago
- A Hivemall wrapper for Spark☆31Apr 21, 2016Updated 9 years ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,007Oct 5, 2022Updated 3 years ago
- Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...☆647Dec 17, 2023Updated 2 years ago
- CS294 RISE Course Material☆32Jan 23, 2019Updated 7 years ago
- Enabling Spark Optimization through Cross-stack Monitoring and Visualization☆47Aug 23, 2017Updated 8 years ago
- Real Time Analytics and Data Pipelines based on Spark Streaming☆531Oct 24, 2019Updated 6 years ago
- Large-scale event processing with Akka Persistence and Apache Spark☆273Jun 18, 2016Updated 9 years ago
- ☆33Jan 9, 2016Updated 10 years ago
- An efficient updatable key-value store for Apache Spark☆254Mar 11, 2017Updated 8 years ago
- Distributed Neural Networks for Spark☆608Jul 23, 2020Updated 5 years ago
- Serverless proxy for Spark cluster☆324Oct 29, 2020Updated 5 years ago
- Mirror of Apache Bahir☆335Jul 7, 2023Updated 2 years ago
- Distributed Prometheus time series database☆1,462Feb 4, 2026Updated last week
- Mirror of Apache Toree (Incubating)☆749Feb 7, 2026Updated last week
- Distributed Temporal Graph Analytics with Apache Flink☆252Jan 11, 2026Updated last month
- The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.☆10Jun 1, 2015Updated 10 years ago
- Example of orchestrating dependent Databricks jobs using Airflow☆11Dec 19, 2019Updated 6 years ago
- Sprint Planning / Scrum Poker online tool (Akka/Socko Websockets)☆19Dec 22, 2015Updated 10 years ago
- Computation using data flow graphs for scalable machine learning☆35Apr 20, 2017Updated 8 years ago
- Data pipeline automation tool☆27Jan 11, 2024Updated 2 years ago
- training material☆47Oct 24, 2024Updated last year
- analytics tool kit☆43Jan 23, 2017Updated 9 years ago
- Write your Spark data to Kafka seamlessly☆174Jul 10, 2024Updated last year
- Source code for Reactive Application Development☆61Sep 12, 2017Updated 8 years ago
- Prescriptive Applications over Kite and Hadoop☆12Oct 14, 2015Updated 10 years ago
- Visualize streaming machine learning in Spark☆177Jun 29, 2017Updated 8 years ago
- Active learning of GP hyperparameters following Garnett, et al., "Active Learning of Linear Embeddings for Gaussian Processes," (UAI 2014…☆16Aug 4, 2017Updated 8 years ago
- ☆50Sep 29, 2020Updated 5 years ago
- ☆22May 31, 2016Updated 9 years ago
- The missing MatPlotLib for Scala + Spark☆731Jan 30, 2022Updated 4 years ago