lensacom / sparkit-learnLinks
PySpark + Scikit-learn = Sparkit-learn
☆1,154Updated 4 years ago
Alternatives and similar repositories for sparkit-learn
Users that are interested in sparkit-learn are comparing it to the libraries listed below
Sorting:
- A library for time series analysis on Apache Spark☆1,195Updated 5 years ago
- Sparkling Pandas☆364Updated 2 years ago
- Distributed Deep learning with Keras & Spark☆1,577Updated 2 years ago
- ☆525Updated 3 weeks ago
- A scalable machine learning library on Apache Spark☆796Updated 4 years ago
- Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks☆1,667Updated last year
- Sparkling Water provides H2O functionality inside Spark cluster☆977Updated last month
- Python library for converting Scikit-Learn pipelines to PMML☆698Updated 2 weeks ago
- MLeap: Deploy ML Pipelines to Production☆1,528Updated this week
- Code to accompany Advanced Analytics with Spark from O'Reilly Media☆1,531Updated last year
- Learn the pyspark API through pictures and simple examples☆170Updated 4 years ago
- REST web service for the true real-time scoring (<1 ms) of Scikit-Learn, R and Apache Spark models☆587Updated 2 weeks ago
- Mirror of Apache Toree (Incubating)☆749Updated 3 weeks ago
- GraphFrames is a package for Apache Spark which provides DataFrame-based Graphs☆1,118Updated 2 weeks ago
- Distributed Neural Networks for Spark☆608Updated 5 years ago
- Distributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.☆623Updated 7 years ago
- scalable analysis of images and time series☆824Updated 8 years ago
- Jupyter magics and kernels for working with remote Spark clusters☆1,362Updated 3 months ago
- Java library and command-line application for converting Scikit-Learn pipelines to PMML☆539Updated 2 weeks ago
- A pure Python implementation of Apache Spark's RDD and DStream interfaces.☆270Updated last year
- Spark 2.0 Python Machine Learning examples☆98Updated 6 years ago
- Python DB API 2.0 client for Impala and Hive (HiveServer2 protocol)☆741Updated 4 months ago
- Information for setting up for the BerkeleyX Spark Intro MOOC, and lab assignments for the course☆346Updated 4 years ago
- A pure python HDFS client☆859Updated 3 years ago
- Code base for the Learning PySpark book (in preparation)☆628Updated 6 years ago
- Visualize streaming machine learning in Spark☆177Updated 8 years ago
- TensorFlow on Spark☆296Updated 8 years ago
- Stream Data Mining Library for Spark Streaming☆496Updated 2 years ago
- A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2…☆1,890Updated 3 years ago
- Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning☆1,784Updated 4 years ago