dmmiller612 / sparktorch
Train and run Pytorch models on Apache Spark.
☆339Updated last year
Alternatives and similar repositories for sparktorch:
Users that are interested in sparktorch are comparing it to the libraries listed below
- Joblib Apache Spark Backend☆245Updated 3 weeks ago
- Read and write Tensorflow TFRecord data from Apache Spark.☆293Updated last year
- ☆346Updated 3 years ago
- Easy to use library to bring Tensorflow on Apache Spark☆296Updated last year
- Distributed scikit-learn meta-estimators in PySpark☆284Updated last week
- A scalable nearest neighbor search library in Apache Spark☆262Updated 6 years ago
- A distributed Spark/Scala implementation of the isolation forest algorithm for unsupervised outlier detection, featuring support for scal…☆239Updated last month
- Factorization Machines for Recommendation and Ranking Problems with Implicit Feedback Data☆176Updated 8 months ago
- Distributed XGBoost on Ray☆148Updated 10 months ago
- ThunderGBM: Fast GBDTs and Random Forests on GPUs☆699Updated last month
- Pytorch Lightning Distributed Accelerators using Ray☆210Updated last year
- A deep ranking personalization framework☆134Updated last year
- High performance model preprocessing library on PyTorch☆650Updated last year
- NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale da…☆1,081Updated 8 months ago
- XGBoost GPU accelerated on Spark example applications☆52Updated 2 years ago
- A Tensorflow 2.0 implementation of TabNet.☆242Updated 2 years ago
- Randomized SVD of large sparse matrices on Spark☆77Updated 2 years ago
- Isolation Forest on Spark☆227Updated 6 months ago
- Solutions to Recommender Systems competitions☆200Updated 2 years ago
- A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch☆1,344Updated 2 months ago
- Universal model exchange and serialization format for decision tree forests☆766Updated last week
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,832Updated last year
- ☆95Updated last year
- A collection of Machine Learning examples to get started with deploying RAPIDS in the Cloud☆141Updated 6 months ago
- Drift Detection for your PyTorch Models☆316Updated 2 years ago
- Train TensorFlow models on YARN in just a few lines of code!☆88Updated last year
- Neural Oblivious Decision Ensembles for Deep Learning on Tabular Data☆486Updated 4 years ago
- ☆126Updated last month
- Spark On Angel, arming Spark with a powerful Parameter Server, which enable Spark to train very big models☆84Updated 2 years ago
- MLeap: Deploy ML Pipelines to Production☆1,515Updated 5 months ago