uber / petastorm
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
☆1,811Updated last year
Alternatives and similar repositories for petastorm:
Users that are interested in petastorm are comparing it to the libraries listed below
- Open Source ML Model Versioning, Metadata, and Experiment Management☆1,714Updated 6 months ago
- A low-latency prediction-serving system☆1,407Updated 3 years ago
- NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale da…☆1,064Updated 4 months ago
- Automated Machine Learning on Kubernetes☆1,535Updated this week
- Hopsworks - Data-Intensive AI platform with a Feature Store☆1,188Updated 2 months ago
- TFX is an end-to-end platform for deploying production ML pipelines☆2,127Updated last month
- MLeap: Deploy ML Pipelines to Production☆1,508Updated 2 months ago
- Library for exploring and validating machine learning data☆768Updated last week
- High performance model preprocessing library on PyTorch☆651Updated 10 months ago
- For recording and retrieving metadata associated with ML developer and data scientist workflows.☆633Updated 3 months ago
- Distributed Computing for AI Made Simple☆1,041Updated last year
- A model-agnostic visual debugging tool for machine learning☆1,650Updated last year
- Scalable Machine Learning with Dask☆916Updated 2 months ago
- MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle☆3,597Updated this week
- cuML - RAPIDS Machine Learning Library☆4,376Updated this week
- The Open Source Feature Store for Machine Learning☆5,762Updated this week
- Extended pickling support for Python objects☆1,692Updated 2 weeks ago
- PyTorch elastic training☆730Updated 2 years ago
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models☆4,437Updated this week
- A uniform interface to run deep learning models from multiple frameworks☆936Updated last year
- Pytorch domain library for recommendation systems☆2,021Updated this week
- Kubeflow’s superfood for Data Scientists☆631Updated 2 years ago
- Universal model exchange and serialization format for decision tree forests☆750Updated 2 weeks ago
- Algorithms for explaining machine learning models☆2,434Updated last month
- Input pipeline framework☆984Updated last week
- Multi Model Server is a tool for serving neural net models for inference☆1,002Updated 8 months ago
- Algorithms for outlier, adversarial and drift detection☆2,288Updated last week
- A library for debugging/inspecting machine learning classifiers and explaining their predictions☆2,763Updated 2 years ago
- A system for quickly generating training data with weak supervision☆5,824Updated 8 months ago
- Model analysis tools for TensorFlow☆1,259Updated 3 weeks ago