uber / petastormLinks
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
☆1,858Updated last week
Alternatives and similar repositories for petastorm
Users that are interested in petastorm are comparing it to the libraries listed below
Sorting:
- A low-latency prediction-serving system☆1,420Updated 4 years ago
- Open Source ML Model Versioning, Metadata, and Experiment Management☆1,740Updated last year
- TFX is an end-to-end platform for deploying production ML pipelines☆2,163Updated 3 months ago
- MLeap: Deploy ML Pipelines to Production☆1,523Updated 9 months ago
- Library for exploring and validating machine learning data☆773Updated 3 months ago
- Distributed Computing for AI Made Simple☆1,047Updated 2 years ago
- Hopsworks - Data-Intensive AI platform with a Feature Store☆1,252Updated 7 months ago
- Automated Machine Learning on Kubernetes☆1,628Updated this week
- For recording and retrieving metadata associated with ML developer and data scientist workflows.☆659Updated 5 months ago
- NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale da…☆1,100Updated last week
- Dataset, streaming, and file system extensions maintained by TensorFlow SIG-IO☆731Updated 3 weeks ago
- A model-agnostic visual debugging tool for machine learning☆1,670Updated 7 months ago
- Scalable Machine Learning with Dask☆942Updated 4 months ago
- Train and run Pytorch models on Apache Spark.☆340Updated 2 years ago
- Input pipeline framework☆988Updated last month
- High performance model preprocessing library on PyTorch☆645Updated last year
- Universal model exchange and serialization format for decision tree forests☆785Updated this week
- Hummingbird compiles trained ML models into tensor computation for faster inference.☆3,481Updated 2 months ago
- Adaptive Experimentation Platform☆2,573Updated last week
- PyTorch elastic training☆730Updated 3 years ago
- Experiment tracking, ML developer tools☆888Updated 4 months ago
- MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle☆3,676Updated 2 weeks ago
- A uniform interface to run deep learning models from multiple frameworks☆940Updated last year
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models☆4,634Updated this week
- Multi Model Server is a tool for serving neural net models for inference☆1,018Updated last year
- A Redis module for serving tensors and executing deep learning graphs☆839Updated last month
- TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows…☆2,270Updated last year
- Model analysis tools for TensorFlow☆1,269Updated last month
- TonY is a framework to natively run deep learning frameworks on Apache Hadoop.☆708Updated last year
- Gin provides a lightweight configuration framework for Python☆2,125Updated this week