amplab/spark-indexedrdd

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/amplab/spark-indexedrdd)

amplab / spark-indexedrdd

An efficient updatable key-value store for Apache Spark

☆255

Alternatives and similar repositories for spark-indexedrdd

Users that are interested in spark-indexedrdd are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ankurdave / part
View on GitHub
Persistent Adaptive Radix Trees in Java
☆83Oct 5, 2020Updated 5 years ago
amplab / keystone
View on GitHub
Simplifying robust end-to-end machine learning on Apache Spark.
☆473Apr 18, 2017Updated 9 years ago
tresata / spark-sorted
View on GitHub
Secondary sort and streaming reduce for Apache Spark
☆77Jul 3, 2023Updated 3 years ago
InitialDLab / Simba
View on GitHub
Spatial In-Memory Big data Analytics
☆125Feb 26, 2019Updated 7 years ago
amplab / succinct
View on GitHub
Enabling queries on compressed data.
☆282Dec 16, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
databricks / spark-csv
View on GitHub
CSV Data Source for Apache Spark 1.x
☆1,057Dec 13, 2018Updated 7 years ago
databricks / tensorframes
View on GitHub
[DEPRECATED] Tensorflow wrapper for DataFrames on Apache Spark
☆744Jul 30, 2024Updated last year
collectivemedia / spark-ext
View on GitHub
Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark
☆145Jan 26, 2016Updated 10 years ago
amplab / SparkNet
View on GitHub
Distributed Neural Networks for Spark
☆610Jul 23, 2020Updated 6 years ago
amplab / ml-matrix
View on GitHub
Distributed Matrix Library
☆73Jan 28, 2017Updated 9 years ago
spark-jobserver / spark-jobserver
View on GitHub
REST job server for Apache Spark
☆2,836Mar 3, 2026Updated 4 months ago
databricks / spark-perf
View on GitHub
Performance tests for Apache Spark
☆392Jul 9, 2018Updated 8 years ago
sryza / spark-timeseries
View on GitHub
A library for time series analysis on Apache Spark
☆1,197Oct 13, 2020Updated 5 years ago
TIBCOSoftware / snappydata
View on GitHub
Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…
☆1,032Nov 21, 2022Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
tresata / spark-kafka
View on GitHub
Low level integration of Spark and Kafka
☆129Mar 15, 2018Updated 8 years ago
mrsqueeze / spark-hash
View on GitHub
Locality Sensitive Hashing for Apache Spark
☆198Nov 1, 2016Updated 9 years ago
tresata / spark-columnar
View on GitHub
☆15Mar 4, 2015Updated 11 years ago
amplab / training-scripts
View on GitHub
Scripts to launch cluster used for Strata
☆33Feb 11, 2014Updated 12 years ago
brightcove-archive / ooyala_spark-jobserver
View on GitHub
REST job server for Spark. Note that this is *not* the mainline open source version. For that, go to https://github.com/spark-jobserver…
☆345May 19, 2017Updated 9 years ago
holdenk / spark-testing-base
View on GitHub
Base classes to use when writing tests with Spark
☆1,555Apr 20, 2026Updated 3 months ago
huawei-noah / streamDM
View on GitHub
Stream Data Mining Library for Spark Streaming
☆497Apr 16, 2023Updated 3 years ago
calrissian / spark-jetty-server
View on GitHub
Recipes and examples for Apache Spark
☆13Jan 21, 2015Updated 11 years ago
databricks / spark-avro
View on GitHub
Avro Data Source for Apache Spark
☆537Dec 19, 2018Updated 7 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
cloudera / livy
View on GitHub
Livy is an open source REST interface for interacting with Apache Spark from anywhere
☆1,007Oct 5, 2022Updated 3 years ago
AtlasPilotPuppy / SparkAlgorithms
View on GitHub
Additional useful algorithms that can be used with spark.
☆24Dec 24, 2014Updated 11 years ago
lightning-viz / lightning-scala
View on GitHub
Scala client for the Lightning data visualization server (WIP)
☆47Jun 25, 2019Updated 7 years ago
twitter / algebird
View on GitHub
Abstract Algebra for Scala
☆2,299Nov 21, 2025Updated 8 months ago
purduedb / LocationSpark
View on GitHub
LocationSpark: A Distributed In-Memory Data Management System for Big Spatial Data
☆43Jan 6, 2017Updated 9 years ago
twitter / chill
View on GitHub
Scala extensions for the Kryo serialization library
☆617Aug 19, 2024Updated last year
amplab / benchmark
View on GitHub
Large scale query engine benchmark
☆99Apr 5, 2016Updated 10 years ago
rxin / jvm-unsafe-utils
View on GitHub
Fast JVM collection
☆60Mar 8, 2015Updated 11 years ago
meetuparchive / archery
View on GitHub
2D R-Tree implementation in Scala
☆115Oct 8, 2019Updated 6 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
sameeragarwal / blinkdb
View on GitHub
BlinkDB: Sub-Second Approximate Queries on Very Large Data.
☆660Feb 6, 2014Updated 12 years ago
databricks / spark-sql-perf
View on GitHub
☆623Feb 26, 2022Updated 4 years ago
Huawei-Spark / Spark-SQL-on-HBase
View on GitHub
Native, optimized access to HBase Data through Spark SQL/Dataframe Interfaces
☆316Apr 12, 2022Updated 4 years ago
spark-notebook / spark-notebook
View on GitHub
Interactive and Reactive Data Science using Scala and Spark.
☆3,142May 16, 2023Updated 3 years ago
apache / incubator-toree
View on GitHub
Mirror of Apache Toree (Incubating)
☆750Updated this week
krasserm / akka-analytics
View on GitHub
Large-scale event processing with Akka Persistence and Apache Spark
☆271Jun 18, 2016Updated 10 years ago
hbutani / spark-datetime
View on GitHub
functionstest
☆33Oct 25, 2016Updated 9 years ago