uber / petastormLinks
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
☆1,847Updated last year
Alternatives and similar repositories for petastorm
Users that are interested in petastorm are comparing it to the libraries listed below
Sorting:
- Open Source ML Model Versioning, Metadata, and Experiment Management☆1,729Updated 11 months ago
- A low-latency prediction-serving system☆1,416Updated 4 years ago
- Automated Machine Learning on Kubernetes☆1,609Updated this week
- TFX is an end-to-end platform for deploying production ML pipelines☆2,150Updated last month
- For recording and retrieving metadata associated with ML developer and data scientist workflows.☆651Updated 3 months ago
- Hopsworks - Data-Intensive AI platform with a Feature Store☆1,237Updated 5 months ago
- Library for exploring and validating machine learning data☆772Updated 3 weeks ago
- MLeap: Deploy ML Pipelines to Production☆1,515Updated 7 months ago
- Distributed Computing for AI Made Simple☆1,045Updated 2 years ago
- A model-agnostic visual debugging tool for machine learning☆1,668Updated 5 months ago
- NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale da…☆1,094Updated 10 months ago
- Hummingbird compiles trained ML models into tensor computation for faster inference.☆3,446Updated this week
- MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle☆3,655Updated 2 weeks ago
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models☆4,582Updated this week
- Dataset, streaming, and file system extensions maintained by TensorFlow SIG-IO☆731Updated last month
- Scalable Machine Learning with Dask☆940Updated 2 months ago
- A uniform interface to run deep learning models from multiple frameworks☆935Updated last year
- Universal model exchange and serialization format for decision tree forests☆778Updated this week
- Kubeflow’s superfood for Data Scientists☆639Updated this week
- PyTorch elastic training☆728Updated 3 years ago
- TonY is a framework to natively run deep learning frameworks on Apache Hadoop.☆706Updated last year
- Adaptive Experimentation Platform☆2,531Updated this week
- Train and run Pytorch models on Apache Spark.☆339Updated 2 years ago
- MLRun is an open source MLOps platform for quickly building and managing continuous ML applications across their lifecycle. MLRun integra…☆1,558Updated this week
- Model analysis tools for TensorFlow☆1,268Updated this week
- Elyra extends JupyterLab with an AI centric approach.☆1,940Updated last month
- High performance model preprocessing library on PyTorch☆649Updated last year
- The Open Source Feature Store for AI/ML☆6,220Updated this week
- Experiment tracking, ML developer tools☆883Updated 2 months ago
- Multi Model Server is a tool for serving neural net models for inference☆1,012Updated last year