dmmiller612 / sparktorch
Train and run Pytorch models on Apache Spark.
☆339Updated last year
Related projects ⓘ
Alternatives and complementary repositories for sparktorch
- Joblib Apache Spark Backend☆242Updated 2 months ago
- Distributed scikit-learn meta-estimators in PySpark☆285Updated 6 months ago
- ☆337Updated 3 years ago
- Read and write Tensorflow TFRecord data from Apache Spark.☆290Updated 6 months ago
- A deep ranking personalization framework☆132Updated last year
- Easy to use library to bring Tensorflow on Apache Spark☆298Updated last year
- Distributed XGBoost on Ray☆143Updated 4 months ago
- Neural Oblivious Decision Ensembles for Deep Learning on Tabular Data☆475Updated 3 years ago
- Universal model exchange and serialization format for decision tree forests☆738Updated this week
- A Tensorflow 2.0 implementation of TabNet.☆239Updated last year
- Factorization Machines for Recommendation and Ranking Problems with Implicit Feedback Data☆171Updated 2 months ago
- OpenML AutoML Benchmarking Framework☆405Updated this week
- High performance model preprocessing library on PyTorch☆648Updated 7 months ago
- NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale da…☆1,047Updated 2 months ago
- A distributed Spark/Scala implementation of the isolation forest algorithm for unsupervised outlier detection, featuring support for scal…☆229Updated 2 weeks ago
- Coarse-grained lineage and tracing for machine learning pipelines.☆466Updated last year
- Code & Data for "Tabular Transformers for Modeling Multivariate Time Series" (ICASSP, 2021)☆325Updated last year
- A collection of Machine Learning examples to get started with deploying RAPIDS in the Cloud☆138Updated last week
- ThunderGBM: Fast GBDTs and Random Forests on GPUs☆693Updated 9 months ago
- Drift Detection for your PyTorch Models☆312Updated 2 years ago
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,798Updated 11 months ago
- Pytorch Lightning Distributed Accelerators using Ray☆211Updated last year
- Library for exploring and validating machine learning data☆763Updated last week
- For recording and retrieving metadata associated with ML developer and data scientist workflows.☆625Updated 2 weeks ago
- Solutions to Recommender Systems competitions☆199Updated 2 years ago
- A scalable nearest neighbor search library in Apache Spark☆262Updated 5 years ago
- MLOps Platform☆271Updated last week
- Easy hyperparameter optimization and automatic result saving across machine learning algorithms and libraries☆706Updated 3 years ago
- Spark implementation of computing Shapley Values using monte-carlo approximation☆74Updated last year