src-d / sparkpickle
Pure Python implementation of reading SequenceFile-s with pickles written by Spark's saveAsPickleFile()
☆24Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for sparkpickle
- Demo code contrasting Google Dataflow (Apache Beam) with Apache Spark☆14Updated 8 years ago
- Spark Parameter Optimization and Tuning☆31Updated 6 years ago
- CrowdRec reference framework☆32Updated 7 years ago
- ☆38Updated 8 years ago
- FluRS: A Python library for streaming recommendation algorithms☆109Updated 2 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- Building Annoy Index on Apache Spark☆72Updated 3 years ago
- Scala wrapper for Annoy☆58Updated 2 years ago
- Utilities to work with Scala/Java code with py4j☆40Updated 10 months ago
- functionstest☆33Updated 8 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆21Updated 8 years ago
- Ranking algorithms for Spark machine learning pipeline☆14Updated 6 years ago
- Distributed Streaming Quantiles (for PySpark)☆37Updated 10 years ago
- Factorization Machines on Spark and Glint☆25Updated 8 years ago
- Python client for Spark Jobserver Rest API☆39Updated 4 years ago
- Yggdrasil: Faster Decision Trees Using Column Partitioning in Spark☆31Updated 6 years ago
- A curated inventory of machine learning methods available on the Apache Spark platform, both in official and third party libraries.☆65Updated 7 years ago
- An Apache Spark-shell backend for IPython☆105Updated 3 years ago
- Data Exploration Using Spark 2.0☆14Updated 6 years ago
- Seldon Spark Jobs☆26Updated 9 years ago
- ☆11Updated 5 years ago
- ☆31Updated 4 years ago
- Asynchronous actions for PySpark☆45Updated 2 years ago
- Sketching data structures for scala, including t-digest☆15Updated 3 years ago
- A tool and library for easily deploying applications on Apache YARN☆142Updated 8 months ago
- CUDA kernel and JNI code which is called by Apache Spark's MLlib.☆19Updated 8 years ago
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 8 years ago
- A framework for writing Spark 2.x applications in a pretty way☆84Updated last year
- Spark ML Lib serving library☆48Updated 6 years ago
- Locality Sensitive Hashing for Apache Spark☆88Updated 2 years ago