amplab / spark-indexedrddView external linksLinks
An efficient updatable key-value store for Apache Spark
☆254Mar 11, 2017Updated 8 years ago
Alternatives and similar repositories for spark-indexedrdd
Users that are interested in spark-indexedrdd are comparing it to the libraries listed below
Sorting:
- Secondary sort and streaming reduce for Apache Spark☆78Jul 3, 2023Updated 2 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆476Apr 18, 2017Updated 8 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆146Jan 26, 2016Updated 10 years ago
- Enabling queries on compressed data.☆282Dec 16, 2023Updated 2 years ago
- Distributed Neural Networks for Spark☆611Jul 23, 2020Updated 5 years ago
- Distributed Matrix Library☆72Jan 28, 2017Updated 9 years ago
- REST job server for Apache Spark☆2,845Jul 8, 2025Updated 7 months ago
- Spatial In-Memory Big data Analytics☆123Feb 26, 2019Updated 6 years ago
- A library for time series analysis on Apache Spark☆1,195Oct 13, 2020Updated 5 years ago
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,034Nov 21, 2022Updated 3 years ago
- Locality Sensitive Hashing for Apache Spark☆196Nov 1, 2016Updated 9 years ago
- Low level integration of Spark and Kafka☆130Mar 15, 2018Updated 7 years ago
- Stream Data Mining Library for Spark Streaming☆500Apr 16, 2023Updated 2 years ago
- Recipes and examples for Apache Spark☆13Jan 21, 2015Updated 11 years ago
- Base classes to use when writing tests with Spark☆1,550Dec 22, 2025Updated last month
- Spark Connector for Hazelcast☆22Jun 9, 2021Updated 4 years ago
- Benchmarks of BLAS libraries with Scala interface☆30Jan 21, 2016Updated 10 years ago
- A Distributed Matrix Operations Library Built on Top of Spark☆109Dec 28, 2016Updated 9 years ago
- LocationSpark: A Distributed In-Memory Data Management System for Big Spatial Data☆43Jan 6, 2017Updated 9 years ago
- Mirror of Apache Toree (Incubating)☆749Feb 7, 2026Updated last week
- Scala extensions for the Kryo serialization library☆618Aug 19, 2024Updated last year
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,007Oct 5, 2022Updated 3 years ago
- Large-scale event processing with Akka Persistence and Apache Spark☆273Jun 18, 2016Updated 9 years ago
- Distributed Prometheus time series database☆1,464Updated this week
- Real Time Analytics and Data Pipelines based on Spark Streaming☆531Oct 24, 2019Updated 6 years ago
- Interactive and Reactive Data Science using Scala and Spark.☆3,151May 16, 2023Updated 2 years ago
- Additional useful algorithms that can be used with spark.☆24Dec 24, 2014Updated 11 years ago
- ☆110Apr 17, 2017Updated 8 years ago
- DistML provide a supplement to mllib to support model-parallel on Spark☆169Feb 6, 2017Updated 9 years ago
- Scala client for the Lightning data visualization server (WIP)☆47Jun 25, 2019Updated 6 years ago
- Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning☆1,786Aug 16, 2021Updated 4 years ago
- Automatic offload of user-written Spark kernels to accelerators☆18Oct 25, 2016Updated 9 years ago
- MLeap allows for easily putting Spark ML pipelines into production☆78Oct 27, 2016Updated 9 years ago
- Large scale query engine benchmark☆99Apr 5, 2016Updated 9 years ago
- Native, optimized access to HBase Data through Spark SQL/Dataframe Interfaces☆315Apr 12, 2022Updated 3 years ago
- Joins for skewed datasets in Spark☆57Aug 18, 2017Updated 8 years ago
- Fast JVM collection☆60Mar 8, 2015Updated 10 years ago
- Glint: High performance scala parameter server☆170Jul 20, 2018Updated 7 years ago
- ☆92Nov 15, 2015Updated 10 years ago