uber / petastormLinks
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
☆1,839Updated last year
Alternatives and similar repositories for petastorm
Users that are interested in petastorm are comparing it to the libraries listed below
Sorting:
- Open Source ML Model Versioning, Metadata, and Experiment Management☆1,723Updated 10 months ago
- A low-latency prediction-serving system☆1,416Updated 4 years ago
- TFX is an end-to-end platform for deploying production ML pipelines☆2,141Updated last week
- MLeap: Deploy ML Pipelines to Production☆1,516Updated 6 months ago
- Hopsworks - Data-Intensive AI platform with a Feature Store☆1,225Updated 3 months ago
- Distributed Computing for AI Made Simple☆1,043Updated 2 years ago
- Library for exploring and validating machine learning data☆771Updated last week
- PyTorch elastic training☆729Updated 2 years ago
- For recording and retrieving metadata associated with ML developer and data scientist workflows.☆649Updated last month
- Automated Machine Learning on Kubernetes☆1,580Updated 2 weeks ago
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models☆4,531Updated this week
- The Open Source Feature Store for AI/ML☆6,108Updated this week
- Hummingbird compiles trained ML models into tensor computation for faster inference.☆3,440Updated last month
- Scalable Machine Learning with Dask☆938Updated 3 weeks ago
- MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle☆3,641Updated last month
- A model-agnostic visual debugging tool for machine learning☆1,666Updated 3 months ago
- NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale da…☆1,083Updated 8 months ago
- Universal model exchange and serialization format for decision tree forests☆772Updated this week
- A uniform interface to run deep learning models from multiple frameworks☆934Updated last year
- BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.☆1,971Updated 2 years ago
- Model analysis tools for TensorFlow☆1,265Updated last month
- High performance model preprocessing library on PyTorch☆650Updated last year
- A multi-model machine learning feature embedding database☆638Updated 5 years ago
- TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows…☆2,259Updated last year
- Multi Model Server is a tool for serving neural net models for inference☆1,010Updated last year
- PyTorch extensions for high performance and large scale training.☆3,322Updated last month
- cuML - RAPIDS Machine Learning Library☆4,727Updated this week
- Adaptive Experimentation Platform☆2,492Updated this week
- TonY is a framework to natively run deep learning frameworks on Apache Hadoop.☆706Updated last year
- Input pipeline framework☆986Updated last month