uber / petastormLinks
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
☆1,840Updated last year
Alternatives and similar repositories for petastorm
Users that are interested in petastorm are comparing it to the libraries listed below
Sorting:
- A low-latency prediction-serving system☆1,416Updated 4 years ago
- Open Source ML Model Versioning, Metadata, and Experiment Management☆1,723Updated 10 months ago
- Hummingbird compiles trained ML models into tensor computation for faster inference.☆3,443Updated 2 months ago
- Automated Machine Learning on Kubernetes☆1,585Updated last week
- Library for exploring and validating machine learning data☆771Updated last week
- MLeap: Deploy ML Pipelines to Production☆1,515Updated 6 months ago
- MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle☆3,645Updated last week
- Scalable Machine Learning with Dask☆938Updated last month
- A model-agnostic visual debugging tool for machine learning☆1,667Updated 4 months ago
- Hopsworks - Data-Intensive AI platform with a Feature Store☆1,228Updated 4 months ago
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models☆4,554Updated this week
- For recording and retrieving metadata associated with ML developer and data scientist workflows.☆651Updated 2 months ago
- Distributed Computing for AI Made Simple☆1,043Updated 2 years ago
- TFX is an end-to-end platform for deploying production ML pipelines☆2,148Updated this week
- A system for quickly generating training data with weak supervision☆5,870Updated last year
- Algorithms for explaining machine learning models☆2,521Updated last week
- A uniform interface to run deep learning models from multiple frameworks☆934Updated last year
- The Open Source Feature Store for AI/ML☆6,144Updated this week
- Algorithms for outlier, adversarial and drift detection☆2,386Updated 2 weeks ago
- Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.☆4,313Updated 6 months ago
- Input pipeline framework☆985Updated last week
- A library for debugging/inspecting machine learning classifiers and explaining their predictions☆2,771Updated 2 months ago
- Jupyter magics and kernels for working with remote Spark clusters☆1,354Updated 3 weeks ago
- Source code/webpage/demos for the What-If Tool☆957Updated 9 months ago
- NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale da…☆1,089Updated 9 months ago
- Model analysis tools for TensorFlow☆1,265Updated this week
- High performance model preprocessing library on PyTorch☆650Updated last year
- Kubeflow’s superfood for Data Scientists☆635Updated 2 weeks ago
- 📚 Parameterize, execute, and analyze notebooks☆6,197Updated 2 months ago
- PyTorch elastic training☆728Updated 3 years ago