uber / petastormLinks
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
☆1,875Updated 3 weeks ago
Alternatives and similar repositories for petastorm
Users that are interested in petastorm are comparing it to the libraries listed below
Sorting:
- A low-latency prediction-serving system☆1,422Updated 4 years ago
- Open Source ML Model Versioning, Metadata, and Experiment Management☆1,745Updated last year
- Library for exploring and validating machine learning data☆779Updated 7 months ago
- MLeap: Deploy ML Pipelines to Production☆1,530Updated 2 weeks ago
- For recording and retrieving metadata associated with ML developer and data scientist workflows.☆670Updated last week
- TFX is an end-to-end platform for deploying production ML pipelines☆2,171Updated 2 weeks ago
- Hopsworks - Data-Intensive AI platform with a Feature Store☆1,280Updated 11 months ago
- Automated Machine Learning on Kubernetes☆1,654Updated last week
- Distributed Computing for AI Made Simple☆1,047Updated 2 years ago
- Dataset, streaming, and file system extensions maintained by TensorFlow SIG-IO☆735Updated 2 months ago
- Input pipeline framework☆989Updated 5 months ago
- Model analysis tools for TensorFlow☆1,267Updated 5 months ago
- NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale da…☆1,135Updated 3 months ago
- A uniform interface to run deep learning models from multiple frameworks☆941Updated 2 years ago
- Scalable Machine Learning with Dask☆944Updated 4 months ago
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models☆4,713Updated this week
- PyTorch elastic training☆728Updated 3 years ago
- TonY is a framework to natively run deep learning frameworks on Apache Hadoop.☆710Updated 2 years ago
- MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle☆3,692Updated this week
- Integration of TensorFlow with other open-source frameworks☆1,374Updated last year
- A model-agnostic visual debugging tool for machine learning☆1,672Updated 11 months ago
- Kubeflow’s superfood for Data Scientists☆660Updated last week
- Jupyter magics and kernels for working with remote Spark clusters☆1,364Updated 4 months ago
- Universal model exchange and serialization format for decision tree forests☆803Updated this week
- High performance model preprocessing library on PyTorch☆647Updated last year
- Train and run Pytorch models on Apache Spark.☆342Updated 2 years ago
- Experiment tracking, ML developer tools☆898Updated 9 months ago
- Mesh TensorFlow: Model Parallelism Made Easier☆1,624Updated 2 years ago
- TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows…☆2,271Updated 2 years ago
- The Open Source Feature Store for AI/ML☆6,661Updated this week