uber / petastorm
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
☆1,810Updated last year
Alternatives and similar repositories for petastorm:
Users that are interested in petastorm are comparing it to the libraries listed below
- Automated Machine Learning on Kubernetes☆1,531Updated this week
- MLeap: Deploy ML Pipelines to Production☆1,504Updated last month
- TFX is an end-to-end platform for deploying production ML pipelines☆2,122Updated 3 weeks ago
- A low-latency prediction-serving system☆1,407Updated 3 years ago
- Open Source ML Model Versioning, Metadata, and Experiment Management☆1,710Updated 5 months ago
- For recording and retrieving metadata associated with ML developer and data scientist workflows.☆633Updated 2 months ago
- Scalable Machine Learning with Dask☆912Updated last month
- Library for exploring and validating machine learning data☆768Updated this week
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models☆4,430Updated this week
- MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle☆3,591Updated this week
- Distributed Computing for AI Made Simple☆1,041Updated last year
- PyTorch elastic training☆730Updated 2 years ago
- NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale da…☆1,063Updated 4 months ago
- Universal model exchange and serialization format for decision tree forests☆750Updated this week
- Experiment tracking, ML developer tools☆874Updated last year
- Extended pickling support for Python objects☆1,688Updated this week
- High performance model preprocessing library on PyTorch☆651Updated 9 months ago
- Hummingbird compiles trained ML models into tensor computation for faster inference.☆3,372Updated last week
- A uniform interface to run deep learning models from multiple frameworks☆937Updated last year
- A model-agnostic visual debugging tool for machine learning☆1,650Updated last year
- Adaptive Experimentation Platform☆2,406Updated this week
- Integration of TensorFlow with other open-source frameworks☆1,373Updated 3 months ago
- TonY is a framework to natively run deep learning frameworks on Apache Hadoop.☆706Updated last year
- A library for debugging/inspecting machine learning classifiers and explaining their predictions☆2,763Updated 2 years ago
- Model analysis tools for TensorFlow☆1,259Updated last week
- Hopsworks - Data-Intensive AI platform with a Feature Store☆1,188Updated 2 months ago
- Dataset, streaming, and file system extensions maintained by TensorFlow SIG-IO☆715Updated this week
- Input pipeline framework☆985Updated this week