lensacom / sparkit-learnLinks
PySpark + Scikit-learn = Sparkit-learn
☆1,154Updated 4 years ago
Alternatives and similar repositories for sparkit-learn
Users that are interested in sparkit-learn are comparing it to the libraries listed below
Sorting:
- A library for time series analysis on Apache Spark☆1,194Updated 4 years ago
- Distributed Deep learning with Keras & Spark☆1,574Updated 2 years ago
- Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks☆1,653Updated last year
- Sparkling Water provides H2O functionality inside Spark cluster☆974Updated 6 months ago
- Python library for converting Scikit-Learn pipelines to PMML☆693Updated last week
- MLeap: Deploy ML Pipelines to Production☆1,516Updated 6 months ago
- Jupyter magics and kernels for working with remote Spark clusters☆1,355Updated this week
- A scalable machine learning library on Apache Spark☆795Updated 3 years ago
- Sparkling Pandas☆363Updated last year
- Distributed Neural Networks for Spark☆603Updated 4 years ago
- A Python tool that automatically cleans data sets and readies them for analysis.☆1,067Updated 6 years ago
- Distributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.☆623Updated 6 years ago
- Mirror of Apache Toree (Incubating)☆743Updated 3 weeks ago
- REST web service for the true real-time scoring (<1 ms) of Scikit-Learn, R and Apache Spark models☆583Updated 8 months ago
- PySpark-Tutorial provides basic algorithms using PySpark☆1,223Updated this week
- Tools for exploratory data analysis in Python☆646Updated last year
- GraphFrames is a package for Apache Spark which provides DataFrame-based Graphs☆1,054Updated this week
- scalable analysis of images and time series☆823Updated 8 years ago
- Highly interpretable classifiers for scikit learn, producing easily understood decision rules instead of black box models☆490Updated 7 years ago
- TensorFlow on Spark☆297Updated 7 years ago
- A pure Python implementation of Apache Spark's RDD and DStream interfaces.☆269Updated 8 months ago
- k-Nearest Neighbors algorithm on Spark☆240Updated last year
- Python interface to Hive and Presto. 🐝☆1,682Updated 9 months ago
- Java library and command-line application for converting Scikit-Learn pipelines to PMML☆538Updated last week
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,007Updated 2 years ago
- A set of simple Python scripts for pre-processing large files☆273Updated last year
- Interactive and Reactive Data Science using Scala and Spark.☆3,152Updated 2 years ago
- Code to accompany Advanced Analytics with Spark from O'Reilly Media☆1,530Updated 8 months ago
- Mirror of Apache Hivemall (incubating)☆312Updated 2 years ago
- [UNMAINTAINED] Automated machine learning for analytics & production☆1,648Updated 4 years ago