Secondary sort and streaming reduce for Apache Spark
☆78Jul 3, 2023Updated 2 years ago
Alternatives and similar repositories for spark-sorted
Users that are interested in spark-sorted are comparing it to the libraries listed below
Sorting:
- Joins for skewed datasets in Spark☆57Aug 18, 2017Updated 8 years ago
- An efficient updatable key-value store for Apache Spark☆254Mar 11, 2017Updated 8 years ago
- Low level integration of Spark and Kafka☆130Mar 15, 2018Updated 7 years ago
- Benchmarks of artificial neural network library for Spark MLlib☆11Dec 3, 2015Updated 10 years ago
- Additional useful algorithms that can be used with spark.☆24Dec 24, 2014Updated 11 years ago
- something to help you spark☆64Oct 23, 2018Updated 7 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆475Apr 18, 2017Updated 8 years ago
- Another, hopefully better, implementation of ALS on Spark☆14May 20, 2015Updated 10 years ago
- MLeap allows for easily putting Spark ML pipelines into production☆78Oct 27, 2016Updated 9 years ago
- A library for time series analysis on Apache Spark☆1,196Oct 13, 2020Updated 5 years ago
- Scripts for generating Grafana dashboards for monitoring Spark jobs☆242Mar 26, 2015Updated 10 years ago
- Miscellaneous functionality for manipulating Apache Spark RDDs.☆22Dec 29, 2018Updated 7 years ago
- spray based client for aws☆38May 4, 2016Updated 9 years ago
- Automatic offload of user-written Spark kernels to accelerators☆18Oct 25, 2016Updated 9 years ago
- Apache Spark jobs such as Principal Coordinate Analysis.☆75Jan 30, 2017Updated 9 years ago
- Distributed Streaming Quantiles (for PySpark)☆38Jan 30, 2014Updated 12 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Dec 28, 2016Updated 9 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆146Jan 26, 2016Updated 10 years ago
- The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.☆10Jun 1, 2015Updated 10 years ago
- A package full of linear algebra operators for Apache Spark MLlib's linalg package☆10Sep 9, 2015Updated 10 years ago
- Regularized latent variable mixed membership modeling☆13Aug 12, 2013Updated 12 years ago
- ☆18Sep 7, 2014Updated 11 years ago
- Distributed Tensorflow Implementation of Asynchronous DDPG☆12Oct 25, 2017Updated 8 years ago
- open source version of the Bonsai library☆26Feb 4, 2016Updated 10 years ago
- Scala EDN parser based on Parboiled2☆38Aug 12, 2015Updated 10 years ago
- Spark library for doing exploratory data analysis in a scalable way☆43Jan 17, 2016Updated 10 years ago
- Splash Project for parallel stochastic learning☆93Jun 16, 2017Updated 8 years ago
- Skeleton project with static analysis provided by scalac switches, Wartremover, and Scalastyle☆29Nov 6, 2016Updated 9 years ago
- Reactive Factorization Engine☆104Feb 18, 2015Updated 11 years ago
- Argument parsing in Scala☆84Mar 27, 2023Updated 2 years ago
- Power a Spark Stream from anywhere in your Akka Stream Flow☆12Mar 1, 2016Updated 10 years ago
- An R-like GLM package for Apache Spark☆10Aug 6, 2015Updated 10 years ago
- kamon netty integration☆10Aug 30, 2020Updated 5 years ago
- Data science repo to help others☆12Feb 10, 2016Updated 10 years ago
- Command line tool that transpiles scala code into java code.☆12Sep 26, 2015Updated 10 years ago
- unix domain sockets that look just like tcp sockets☆11Jun 21, 2018Updated 7 years ago
- A library you can include in your Spark job to validate the counters and perform operations on success. Goal is scala/java/python support…☆108Feb 1, 2018Updated 8 years ago
- Java 8 and Spark learning through examples☆43Nov 10, 2017Updated 8 years ago
- Akka cluster management using etcd☆67Apr 18, 2016Updated 9 years ago