uber / petastormView on GitHub
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
1,879Jan 2, 2026Updated last month

Alternatives and similar repositories for petastorm

Users that are interested in petastorm are comparing it to the libraries listed below

Sorting:

Are these results useful?