uber / petastormLinks
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
☆1,867Updated 3 weeks ago
Alternatives and similar repositories for petastorm
Users that are interested in petastorm are comparing it to the libraries listed below
Sorting:
- A low-latency prediction-serving system☆1,419Updated 4 years ago
- MLeap: Deploy ML Pipelines to Production☆1,529Updated last year
- Open Source ML Model Versioning, Metadata, and Experiment Management☆1,741Updated last year
- TFX is an end-to-end platform for deploying production ML pipelines☆2,170Updated last month
- Library for exploring and validating machine learning data☆778Updated 5 months ago
- Automated Machine Learning on Kubernetes☆1,642Updated this week
- Hopsworks - Data-Intensive AI platform with a Feature Store☆1,258Updated 9 months ago
- For recording and retrieving metadata associated with ML developer and data scientist workflows.☆666Updated 7 months ago
- Distributed Computing for AI Made Simple☆1,047Updated 2 years ago
- NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale da…☆1,130Updated last month
- Scalable Machine Learning with Dask☆942Updated 2 months ago
- TonY is a framework to natively run deep learning frameworks on Apache Hadoop.☆709Updated 2 years ago
- Universal model exchange and serialization format for decision tree forests☆795Updated this week
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models☆4,683Updated this week
- Dataset, streaming, and file system extensions maintained by TensorFlow SIG-IO☆734Updated this week
- A model-agnostic visual debugging tool for machine learning☆1,669Updated 9 months ago
- High performance model preprocessing library on PyTorch☆646Updated last year
- MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle☆3,684Updated 2 weeks ago
- Model analysis tools for TensorFlow☆1,268Updated 3 months ago
- Adaptive Experimentation Platform☆2,625Updated last week
- Kubeflow’s superfood for Data Scientists☆647Updated last week
- PyTorch elastic training☆729Updated 3 years ago
- A uniform interface to run deep learning models from multiple frameworks☆941Updated last year
- Train and run Pytorch models on Apache Spark.☆341Updated 2 years ago
- Hummingbird compiles trained ML models into tensor computation for faster inference.☆3,499Updated 4 months ago
- Extended pickling support for Python objects☆1,860Updated 3 weeks ago
- Input pipeline framework☆989Updated 3 months ago
- Jupyter magics and kernels for working with remote Spark clusters☆1,362Updated 2 months ago
- Integration of TensorFlow with other open-source frameworks☆1,373Updated last year
- Experiment tracking, ML developer tools☆888Updated 7 months ago