An efficient updatable key-value store for Apache Spark
☆254Mar 11, 2017Updated 9 years ago
Alternatives and similar repositories for spark-indexedrdd
Users that are interested in spark-indexedrdd are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Persistent Adaptive Radix Trees in Java☆82Oct 5, 2020Updated 5 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆475Apr 18, 2017Updated 8 years ago
- Secondary sort and streaming reduce for Apache Spark☆78Jul 3, 2023Updated 2 years ago
- Enabling queries on compressed data.☆282Dec 16, 2023Updated 2 years ago
- Spatial In-Memory Big data Analytics☆125Feb 26, 2019Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆146Jan 26, 2016Updated 10 years ago
- Distributed Neural Networks for Spark☆611Jul 23, 2020Updated 5 years ago
- Distributed Matrix Library☆72Jan 28, 2017Updated 9 years ago
- REST job server for Apache Spark☆2,844Mar 3, 2026Updated 3 weeks ago
- A library for time series analysis on Apache Spark☆1,198Oct 13, 2020Updated 5 years ago
- Scripts to launch cluster used for Strata☆33Feb 11, 2014Updated 12 years ago
- Low level integration of Spark and Kafka☆131Mar 15, 2018Updated 8 years ago
- Locality Sensitive Hashing for Apache Spark☆198Nov 1, 2016Updated 9 years ago
- Recipes and examples for Apache Spark☆13Jan 21, 2015Updated 11 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Base classes to use when writing tests with Spark☆1,549Updated this week
- Stream Data Mining Library for Spark Streaming☆498Apr 16, 2023Updated 2 years ago
- Scala client for the Lightning data visualization server (WIP)☆47Jun 25, 2019Updated 6 years ago
- [NOTE: Repository has moved to github.com/amplab/spark-ec2]☆57Aug 10, 2015Updated 10 years ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,007Oct 5, 2022Updated 3 years ago
- Additional useful algorithms that can be used with spark.☆24Dec 24, 2014Updated 11 years ago
- LocationSpark: A Distributed In-Memory Data Management System for Big Spatial Data☆43Jan 6, 2017Updated 9 years ago
- Large scale query engine benchmark☆99Apr 5, 2016Updated 9 years ago
- Spark Connector for Hazelcast☆22Jun 9, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Training materials for Strata, AMP Camp, etc☆149Nov 20, 2015Updated 10 years ago
- Fast JVM collection☆60Mar 8, 2015Updated 11 years ago
- BlinkDB: Sub-Second Approximate Queries on Very Large Data.☆660Feb 6, 2014Updated 12 years ago
- 2D R-Tree implementation in Scala☆115Oct 8, 2019Updated 6 years ago
- Interactive and Reactive Data Science using Scala and Spark.☆3,147May 16, 2023Updated 2 years ago
- functionstest☆33Oct 25, 2016Updated 9 years ago
- Native, optimized access to HBase Data through Spark SQL/Dataframe Interfaces☆315Apr 12, 2022Updated 3 years ago
- Mirror of Apache Toree (Incubating)☆749Mar 20, 2026Updated last week
- Large-scale event processing with Akka Persistence and Apache Spark☆273Jun 18, 2016Updated 9 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Distributed Prometheus time series database☆1,459Updated this week
- Scala extensions for the Kryo serialization library☆619Aug 19, 2024Updated last year
- Geo Spatial Data Analytics on Spark☆536Aug 26, 2021Updated 4 years ago
- Automatic offload of user-written Spark kernels to accelerators☆18Oct 25, 2016Updated 9 years ago
- DistML provide a supplement to mllib to support model-parallel on Spark☆170Feb 6, 2017Updated 9 years ago
- A Distributed Matrix Operations Library Built on Top of Spark☆109Dec 28, 2016Updated 9 years ago
- SparkOnHBase☆278Mar 30, 2021Updated 4 years ago