danielenricocahall / elephas
Distributed Deep learning with Keras & Spark
☆19Updated 4 months ago
Alternatives and similar repositories for elephas:
Users that are interested in elephas are comparing it to the libraries listed below
- Joblib Apache Spark Backend☆245Updated 2 weeks ago
- Train and run Pytorch models on Apache Spark.☆339Updated last year
- Repository for the research and implementation of categorical encoding into a Featuretools-compatible Python library☆51Updated 2 years ago
- A simplified version of featuretools for Spark☆31Updated 5 years ago
- Spark ML implementation of SOM algorithm (Kohonen self-organizing map)☆18Updated 3 years ago
- ☆79Updated 4 years ago
- Easy converter pandas -> tfrecords & tfrecords -> pandas☆38Updated 2 years ago
- Pipeline Profiler is a tool for visualizing machine learning pipelines generated by AutoML tools.☆84Updated last year
- Distributed XGBoost on Ray☆148Updated 10 months ago
- ☆98Updated last week
- python library for automated dataset normalization☆114Updated last year
- Spark implementation of computing Shapley Values using monte-carlo approximation☆74Updated 2 years ago
- A collection of Machine Learning examples to get started with deploying RAPIDS in the Cloud☆141Updated 5 months ago
- General Interpretability Package☆58Updated 2 years ago
- an easy way to define preprocessing data pipeline (similar to sklean-pandas but for Spark ML)☆17Updated 6 years ago
- 🍦 Deployment tool for online machine learning models☆97Updated 2 years ago
- xverse (XuniVerse) is collection of transformers for feature engineering and feature selection☆117Updated last year
- LightGBM on Ray☆47Updated last year
- Random stuff I've been working on☆28Updated last year
- Using MLflow with a PostgreSQL Database Tracking URI and a Minio Artifact URI, and MLflow Registry☆12Updated 4 years ago
- Automated Data Science and Machine Learning library to optimize workflow.☆104Updated 2 years ago
- Distributed scikit-learn meta-estimators in PySpark☆284Updated last week
- big data technologies comparisons for cleaning, manipulating and generally wrangling data in purpose of analysis and machine learning.☆65Updated 4 years ago
- A Python wrapper for XGBoost4J-Spark classes.☆47Updated last year
- OptimalFlow is an omni-ensemble and scalable automated machine learning Python toolkit, which uses Pipeline Cluster Traversal Experiments…☆27Updated last year
- A package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn fr…☆57Updated 3 years ago
- Time Series Forecasting Framework☆41Updated 2 years ago
- Companion Notebooks and Data for Data Science with Python and Dask from Manning Publications☆51Updated 4 years ago
- Nyoka is a Python library that helps to export ML models into PMML (PMML 4.4.1 Standard).☆185Updated last year
- Implementation of the Adaptive XGBoost classifier for evolving data streams☆43Updated 4 years ago