An efficient updatable key-value store for Apache Spark
☆255Mar 11, 2017Updated 9 years ago
Alternatives and similar repositories for spark-indexedrdd
Users that are interested in spark-indexedrdd are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Persistent Adaptive Radix Trees in Java☆83Oct 5, 2020Updated 5 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆473Apr 18, 2017Updated 9 years ago
- Secondary sort and streaming reduce for Apache Spark☆78Jul 3, 2023Updated 2 years ago
- Enabling queries on compressed data.☆283Dec 16, 2023Updated 2 years ago
- Spatial In-Memory Big data Analytics☆125Feb 26, 2019Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆146Jan 26, 2016Updated 10 years ago
- Distributed Neural Networks for Spark☆610Jul 23, 2020Updated 5 years ago
- Distributed Matrix Library☆73Jan 28, 2017Updated 9 years ago
- REST job server for Apache Spark☆2,843Mar 3, 2026Updated 2 months ago
- A library for time series analysis on Apache Spark☆1,197Oct 13, 2020Updated 5 years ago
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,034Nov 21, 2022Updated 3 years ago
- Scripts to launch cluster used for Strata☆33Feb 11, 2014Updated 12 years ago
- Low level integration of Spark and Kafka☆131Mar 15, 2018Updated 8 years ago
- Locality Sensitive Hashing for Apache Spark☆198Nov 1, 2016Updated 9 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Recipes and examples for Apache Spark☆13Jan 21, 2015Updated 11 years ago
- Base classes to use when writing tests with Spark☆1,554Apr 20, 2026Updated last month
- Stream Data Mining Library for Spark Streaming☆497Apr 16, 2023Updated 3 years ago
- Scala client for the Lightning data visualization server (WIP)☆47Jun 25, 2019Updated 6 years ago
- [NOTE: Repository has moved to github.com/amplab/spark-ec2]☆57Aug 10, 2015Updated 10 years ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,008Oct 5, 2022Updated 3 years ago
- Additional useful algorithms that can be used with spark.☆24Dec 24, 2014Updated 11 years ago
- LocationSpark: A Distributed In-Memory Data Management System for Big Spatial Data☆43Jan 6, 2017Updated 9 years ago
- Large scale query engine benchmark☆99Apr 5, 2016Updated 10 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Spark Connector for Hazelcast☆22Jun 9, 2021Updated 4 years ago
- Training materials for Strata, AMP Camp, etc☆150Nov 20, 2015Updated 10 years ago
- Fast JVM collection☆60Mar 8, 2015Updated 11 years ago
- BlinkDB: Sub-Second Approximate Queries on Very Large Data.☆661Feb 6, 2014Updated 12 years ago
- 2D R-Tree implementation in Scala☆115Oct 8, 2019Updated 6 years ago
- Interactive and Reactive Data Science using Scala and Spark.☆3,150May 16, 2023Updated 3 years ago
- functionstest☆33Oct 25, 2016Updated 9 years ago
- Native, optimized access to HBase Data through Spark SQL/Dataframe Interfaces☆316Apr 12, 2022Updated 4 years ago
- Mirror of Apache Toree (Incubating)☆750Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Large-scale event processing with Akka Persistence and Apache Spark☆272Jun 18, 2016Updated 9 years ago
- Distributed Prometheus time series database☆1,464May 20, 2026Updated last week
- Scala extensions for the Kryo serialization library☆620Aug 19, 2024Updated last year
- Geo Spatial Data Analytics on Spark☆534Aug 26, 2021Updated 4 years ago
- Automatic offload of user-written Spark kernels to accelerators☆18Oct 25, 2016Updated 9 years ago
- DistML provide a supplement to mllib to support model-parallel on Spark☆170Feb 6, 2017Updated 9 years ago
- A Distributed Matrix Operations Library Built on Top of Spark☆110Dec 28, 2016Updated 9 years ago