NVIDIA / multi-storage-clientLinks
Unified high-performance Python client for object and file stores.
☆53Updated 2 weeks ago
Alternatives and similar repositories for multi-storage-client
Users that are interested in multi-storage-client are comparing it to the libraries listed below
Sorting:
- Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)☆462Updated last week
- A tool to configure, launch and manage your machine learning experiments.☆212Updated this week
- Container plugin for Slurm Workload Manager☆405Updated 2 weeks ago
- Scalable and Performant Data Loading☆359Updated this week
- 🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash…☆276Updated last month
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆409Updated this week
- Load compute kernels from the Hub☆354Updated 2 weeks ago
- Speed up model training by fixing data loading.☆566Updated 2 weeks ago
- Megatron's multi-modal data loader☆299Updated 2 weeks ago
- Where GPUs get cooked 👩🍳🔥☆343Updated 3 months ago
- The Amazon S3 Connector for PyTorch delivers high throughput for PyTorch training jobs that access and store data in Amazon S3.☆195Updated last week
- 🚀 Collection of components for development, training, tuning, and inference of foundation models leveraging PyTorch native components.☆217Updated 3 weeks ago
- NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the …☆241Updated last week
- Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support☆223Updated this week
- Pipeline Parallelism for PyTorch☆783Updated last year
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆162Updated last week
- torchax is a PyTorch frontend for JAX. It gives JAX the ability to author JAX programs using familiar PyTorch syntax. It also provides JA…☆159Updated last week
- Provide Python access to the NVML library for GPU diagnostics☆253Updated 3 months ago
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆79Updated 2 weeks ago
- PyTorch Single Controller☆937Updated this week
- TPU inference for vLLM, with unified JAX and PyTorch support.☆205Updated this week
- Module, Model, and Tensor Serialization/Deserialization☆282Updated 4 months ago
- This repository hosts code that supports the testing infrastructure for the PyTorch organization. For example, this repo hosts the logic …☆103Updated this week
- Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimenta…☆543Updated 2 weeks ago
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆397Updated 6 months ago
- A library to analyze PyTorch traces.☆452Updated 2 weeks ago
- ☆274Updated 2 weeks ago
- ☆149Updated last month
- JAX-Toolbox☆370Updated this week
- This repository contains the experimental PyTorch native float8 training UX☆227Updated last year