coreweave / tensorizer
Module, Model, and Tensor Serialization/Deserialization
☆217Updated 3 weeks ago
Alternatives and similar repositories for tensorizer:
Users that are interested in tensorizer are comparing it to the libraries listed below
- ☆169Updated this week
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆350Updated this week
- CUDA checkpoint and restore utility☆305Updated last month
- ☆30Updated this week
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆290Updated this week
- ☆296Updated 6 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆261Updated 5 months ago
- ☆136Updated 2 weeks ago
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆175Updated this week
- A library to analyze PyTorch traces.☆342Updated this week
- ☆235Updated this week
- 🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of O…☆289Updated last month
- Pipeline Parallelism for PyTorch☆757Updated 6 months ago
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆154Updated 3 months ago
- FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.☆755Updated 6 months ago
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆106Updated this week
- PyTorch per step fault tolerance (actively under development)☆262Updated this week
- ☆199Updated last year
- A top-like tool for monitoring GPUs in a cluster☆85Updated last year
- Blazing fast training of 🤗 Transformers on Graphcore IPUs☆85Updated last year
- Home for OctoML PyTorch Profiler☆107Updated last year
- Benchmark suite for LLMs from Fireworks.ai☆69Updated last month
- NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the …☆96Updated 2 weeks ago
- NVIDIA NCCL Tests for Distributed Training☆82Updated this week
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆120Updated 2 weeks ago
- OpenAI compatible API for TensorRT LLM triton backend☆200Updated 7 months ago
- This repository hosts code that supports the testing infrastructure for the PyTorch organization. For example, this repo hosts the logic …☆90Updated this week
- ☆54Updated 5 months ago
- The Amazon S3 Connector for PyTorch delivers high throughput for PyTorch training jobs that access and store data in Amazon S3.☆147Updated this week
- Distributed Model Serving Framework☆158Updated 2 weeks ago