lensacom / sparkit-learn
PySpark + Scikit-learn = Sparkit-learn
☆1,154Updated 3 years ago
Alternatives and similar repositories for sparkit-learn:
Users that are interested in sparkit-learn are comparing it to the libraries listed below
- A library for time series analysis on Apache Spark☆1,193Updated 4 years ago
- Sparkling Water provides H2O functionality inside Spark cluster☆967Updated 2 weeks ago
- ☆512Updated 2 years ago
- ☆1,002Updated this week
- Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks☆1,638Updated 8 months ago
- Sparkling Pandas☆362Updated last year
- Distributed Deep learning with Keras & Spark☆1,574Updated last year
- A scalable machine learning library on Apache Spark☆792Updated 3 years ago
- Python library for converting Scikit-Learn pipelines to PMML☆688Updated last month
- Mirror of Apache Toree (Incubating)☆740Updated 3 weeks ago
- Jupyter magics and kernels for working with remote Spark clusters☆1,331Updated last week
- A pure Python implementation of Apache Spark's RDD and DStream interfaces.☆262Updated 2 months ago
- Distributed Neural Networks for Spark☆604Updated 4 years ago
- MLeap: Deploy ML Pipelines to Production☆1,506Updated this week
- scalable analysis of images and time series☆815Updated 7 years ago
- Highly interpretable classifiers for scikit learn, producing easily understood decision rules instead of black box models☆489Updated 7 years ago
- Pandas integration with sklearn☆2,815Updated last year
- Examples for High Performance Spark☆505Updated last month
- Learn the pyspark API through pictures and simple examples☆168Updated 3 years ago
- Interactive and Reactive Data Science using Scala and Spark.☆3,155Updated last year
- Easy to use library to bring Tensorflow on Apache Spark☆298Updated last year
- Docker build for Apache Spark☆673Updated 2 years ago
- A Time Series Library for Apache Spark☆1,006Updated 4 years ago
- Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning☆1,787Updated 3 years ago
- Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark☆1,485Updated 2 weeks ago