coreweave / tensorizerLinks
Module, Model, and Tensor Serialization/Deserialization
☆232Updated last week
Alternatives and similar repositories for tensorizer
Users that are interested in tensorizer are comparing it to the libraries listed below
Sorting:
- ☆214Updated this week
- CUDA checkpoint and restore utility☆339Updated 4 months ago
- ☆308Updated 9 months ago
- NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the …☆169Updated this week
- A library to analyze PyTorch traces.☆379Updated this week
- IBM development fork of https://github.com/huggingface/text-generation-inference☆60Updated 3 weeks ago
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆366Updated last week
- 🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash…☆249Updated this week
- NVIDIA NCCL Tests for Distributed Training☆91Updated last week
- OpenAI compatible API for TensorRT LLM triton backend☆207Updated 10 months ago
- Distributed Model Serving Framework☆168Updated 3 weeks ago
- Controller for ModelMesh☆230Updated 3 weeks ago
- ☆118Updated last year
- High-performance safetensors model loader☆34Updated this week
- Docker image NVIDIA GH200 machines - optimized for vllm serving and hf trainer finetuning☆41Updated 3 months ago
- This repository contains the experimental PyTorch native float8 training UX☆222Updated 10 months ago
- ☆34Updated last week
- Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.☆95Updated 2 weeks ago
- ☆260Updated 2 weeks ago
- Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.☆202Updated last month
- The Triton backend for the PyTorch TorchScript models.☆149Updated 2 weeks ago
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆334Updated this week
- GPUd automates monitoring, diagnostics, and issue identification for GPUs☆362Updated this week
- The Triton backend for the ONNX Runtime.☆148Updated 2 weeks ago
- PyTorch per step fault tolerance (actively under development)☆302Updated this week
- 🚀 Collection of components for development, training, tuning, and inference of foundation models leveraging PyTorch native components.☆196Updated this week
- Google TPU optimizations for transformers models☆112Updated 4 months ago
- ☆193Updated 3 weeks ago
- Home for OctoML PyTorch Profiler☆113Updated 2 years ago
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆186Updated this week