lensacom / sparkit-learn
PySpark + Scikit-learn = Sparkit-learn
☆1,153Updated 4 years ago
Alternatives and similar repositories for sparkit-learn:
Users that are interested in sparkit-learn are comparing it to the libraries listed below
- Distributed Deep learning with Keras & Spark☆1,572Updated last year
- A scalable machine learning library on Apache Spark☆793Updated 3 years ago
- Jupyter magics and kernels for working with remote Spark clusters☆1,346Updated 2 weeks ago
- MLeap: Deploy ML Pipelines to Production☆1,515Updated 3 months ago
- Mirror of Apache Toree (Incubating)☆741Updated last month
- A library for time series analysis on Apache Spark☆1,192Updated 4 years ago
- ☆520Updated 3 years ago
- Sparkling Water provides H2O functionality inside Spark cluster☆966Updated 4 months ago
- Sparkling Pandas☆362Updated last year
- Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks☆1,643Updated last year
- GraphFrames is a package for Apache Spark which provides DataFrame-based Graphs☆1,032Updated this week
- Distributed Neural Networks for Spark☆603Updated 4 years ago
- Python library for converting Scikit-Learn pipelines to PMML☆691Updated this week
- A pure Python implementation of Apache Spark's RDD and DStream interfaces.☆268Updated 6 months ago
- Distributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.☆622Updated 6 years ago
- TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.☆3,875Updated last year
- Java library and command-line application for converting Scikit-Learn pipelines to PMML☆537Updated this week
- Code to accompany Advanced Analytics with Spark from O'Reilly Media☆1,531Updated 5 months ago
- Interactive and Reactive Data Science using Scala and Spark.☆3,146Updated last year
- Pandas integration with sklearn☆2,824Updated last year
- Tools for exploratory data analysis in Python☆646Updated last year
- Information for setting up for the BerkeleyX Spark Intro MOOC, and lab assignments for the course☆349Updated 4 years ago
- Highly interpretable classifiers for scikit learn, producing easily understood decision rules instead of black box models☆488Updated 7 years ago
- TensorFlow on Spark☆297Updated 7 years ago
- Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning☆1,782Updated 3 years ago
- scalable analysis of images and time series☆821Updated 8 years ago
- k-Nearest Neighbors algorithm on Spark☆239Updated last year
- Java library and command-line application for converting Apache Spark ML pipelines to PMML☆267Updated last month
- A Python tool that automatically cleans data sets and readies them for analysis.☆1,063Updated 5 years ago
- REST web service for the true real-time scoring (<1 ms) of Scikit-Learn, R and Apache Spark models☆581Updated 6 months ago