lensacom / sparkit-learn
PySpark + Scikit-learn = Sparkit-learn
☆1,154Updated 4 years ago
Alternatives and similar repositories for sparkit-learn:
Users that are interested in sparkit-learn are comparing it to the libraries listed below
- Distributed Deep learning with Keras & Spark☆1,574Updated 2 years ago
- MLeap: Deploy ML Pipelines to Production☆1,515Updated 5 months ago
- Sparkling Water provides H2O functionality inside Spark cluster☆972Updated 5 months ago
- A scalable machine learning library on Apache Spark☆793Updated 3 years ago
- Sparkling Pandas☆363Updated last year
- A library for time series analysis on Apache Spark☆1,193Updated 4 years ago
- GraphFrames is a package for Apache Spark which provides DataFrame-based Graphs☆1,047Updated 2 weeks ago
- ☆522Updated 3 years ago
- Python library for converting Scikit-Learn pipelines to PMML☆692Updated 3 weeks ago
- Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks☆1,650Updated last year
- Distributed Neural Networks for Spark☆604Updated 4 years ago
- Easy to use library to bring Tensorflow on Apache Spark☆296Updated last year
- Mirror of Apache Toree (Incubating)☆742Updated 2 months ago
- scalable analysis of images and time series☆823Updated 8 years ago
- TensorFlow on Spark☆297Updated 7 years ago
- Highly interpretable classifiers for scikit learn, producing easily understood decision rules instead of black box models☆490Updated 7 years ago
- Distributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.☆622Updated 6 years ago
- A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2…☆1,884Updated 2 years ago
- Jupyter magics and kernels for working with remote Spark clusters☆1,349Updated 2 months ago
- [UNMAINTAINED] Automated machine learning for analytics & production☆1,645Updated 4 years ago
- Code to accompany Advanced Analytics with Spark from O'Reilly Media☆1,528Updated 7 months ago
- Java library and command-line application for converting Apache Spark ML pipelines to PMML☆267Updated 2 months ago
- TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.☆3,873Updated last year
- A pure Python implementation of Apache Spark's RDD and DStream interfaces.☆268Updated 8 months ago
- Pandas integration with sklearn☆2,828Updated last year
- Learn the pyspark API through pictures and simple examples☆170Updated 4 years ago
- Machine Learning toolbox for Humans☆696Updated 9 months ago
- Visualize streaming machine learning in Spark☆176Updated 7 years ago
- Stream Data Mining Library for Spark Streaming☆494Updated 2 years ago
- Code base for the Learning PySpark book (in preparation)☆624Updated 6 years ago