uber / petastormLinks
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
☆1,867Updated last week
Alternatives and similar repositories for petastorm
Users that are interested in petastorm are comparing it to the libraries listed below
Sorting:
- A low-latency prediction-serving system☆1,420Updated 4 years ago
- Open Source ML Model Versioning, Metadata, and Experiment Management☆1,740Updated last year
- MLeap: Deploy ML Pipelines to Production☆1,528Updated 11 months ago
- Distributed Computing for AI Made Simple☆1,047Updated 2 years ago
- Library for exploring and validating machine learning data☆778Updated 4 months ago
- Automated Machine Learning on Kubernetes☆1,637Updated this week
- Hopsworks - Data-Intensive AI platform with a Feature Store☆1,258Updated 8 months ago
- For recording and retrieving metadata associated with ML developer and data scientist workflows.☆663Updated 7 months ago
- TFX is an end-to-end platform for deploying production ML pipelines☆2,169Updated last week
- Scalable Machine Learning with Dask☆940Updated last month
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models☆4,668Updated this week
- A model-agnostic visual debugging tool for machine learning☆1,670Updated 9 months ago
- NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale da…☆1,107Updated 2 weeks ago
- Train and run Pytorch models on Apache Spark.☆340Updated 2 years ago
- Universal model exchange and serialization format for decision tree forests☆794Updated this week
- TonY is a framework to natively run deep learning frameworks on Apache Hadoop.☆709Updated 2 years ago
- A uniform interface to run deep learning models from multiple frameworks☆939Updated last year
- PyTorch elastic training☆729Updated 3 years ago
- Input pipeline framework☆988Updated 3 months ago
- Dataset, streaming, and file system extensions maintained by TensorFlow SIG-IO☆733Updated 2 months ago
- Kubeflow’s superfood for Data Scientists☆641Updated last week
- MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle☆3,684Updated 2 weeks ago
- Model analysis tools for TensorFlow☆1,267Updated 3 months ago
- Adaptive Experimentation Platform☆2,597Updated last week
- High performance model preprocessing library on PyTorch☆644Updated last year
- An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and more☆853Updated 2 weeks ago
- Multi Model Server is a tool for serving neural net models for inference☆1,024Updated last year
- Experiment tracking, ML developer tools☆888Updated 6 months ago
- Hummingbird compiles trained ML models into tensor computation for faster inference.☆3,496Updated 3 months ago
- MLRun is an open source MLOps platform for quickly building and managing continuous ML applications across their lifecycle. MLRun integra…☆1,607Updated this week