uber / petastormLinks
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
☆1,871Updated last week
Alternatives and similar repositories for petastorm
Users that are interested in petastorm are comparing it to the libraries listed below
Sorting:
- Open Source ML Model Versioning, Metadata, and Experiment Management☆1,744Updated last year
- A low-latency prediction-serving system☆1,421Updated 4 years ago
- MLeap: Deploy ML Pipelines to Production☆1,530Updated 3 weeks ago
- TFX is an end-to-end platform for deploying production ML pipelines☆2,168Updated 2 weeks ago
- For recording and retrieving metadata associated with ML developer and data scientist workflows.☆667Updated 9 months ago
- Library for exploring and validating machine learning data☆780Updated 6 months ago
- Distributed Computing for AI Made Simple☆1,047Updated 2 years ago
- Automated Machine Learning on Kubernetes☆1,648Updated last week
- Hopsworks - Data-Intensive AI platform with a Feature Store☆1,272Updated 10 months ago
- NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale da…☆1,135Updated 2 months ago
- Scalable Machine Learning with Dask☆944Updated 3 months ago
- Dataset, streaming, and file system extensions maintained by TensorFlow SIG-IO☆735Updated last month
- A model-agnostic visual debugging tool for machine learning☆1,672Updated 11 months ago
- MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle☆3,688Updated last week
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models☆4,704Updated this week
- Hummingbird compiles trained ML models into tensor computation for faster inference.☆3,519Updated 5 months ago
- Universal model exchange and serialization format for decision tree forests☆800Updated 2 weeks ago
- A uniform interface to run deep learning models from multiple frameworks☆941Updated 2 years ago
- High performance model preprocessing library on PyTorch☆647Updated last year
- Model analysis tools for TensorFlow☆1,268Updated 5 months ago
- Experiment tracking, ML developer tools☆894Updated 8 months ago
- Kubeflow’s superfood for Data Scientists☆657Updated 3 weeks ago
- PyTorch elastic training☆728Updated 3 years ago
- Input pipeline framework☆990Updated 5 months ago
- Multi Model Server is a tool for serving neural net models for inference☆1,025Updated last year
- TonY is a framework to natively run deep learning frameworks on Apache Hadoop.☆710Updated 2 years ago
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,132Updated last week
- Adaptive Experimentation Platform☆2,679Updated last week
- Elyra extends JupyterLab with an AI centric approach.☆1,974Updated last month
- Integration of TensorFlow with other open-source frameworks☆1,373Updated last year