coreweave / tensorizerLinks

Module, Model, and Tensor Serialization/Deserialization

☆250

Alternatives and similar repositories for tensorizer

Users that are interested in tensorizer are comparing it to the libraries listed below

Sorting:

run-ai / runai-model-streamer
☆231Updated this week
imbue-ai / cluster-health
☆313Updated 11 months ago
pytorch / torchft
Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)
☆372Updated this week
NVIDIA / cuda-checkpoint
CUDA checkpoint and restore utility
☆353Updated 6 months ago
pytorch / torchx
TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…
☆378Updated this week
AI-Hypercomputer / JetStream
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…
☆364Updated last month
NVIDIA / nvidia-resiliency-ext
NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the …
☆194Updated last week
facebookresearch / HolisticTraceAnalysis
A library to analyze PyTorch traces.
☆400Updated this week
neuralmagic / nm-vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
☆265Updated 9 months ago
leptonai / gpud
GPUd automates monitoring, diagnostics, and issue identification for GPUs
☆401Updated this week
octoml / octoml-profile
Home for OctoML PyTorch Profiler
☆113Updated 2 years ago
foundation-model-stack / fastsafetensors
High-performance safetensors model loader
☆52Updated 2 weeks ago
anyscale / llm-continuous-batching-benchmarks
☆120Updated last year
huggingface / optimum-benchmark
🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of O…
☆307Updated 2 months ago
huggingface / kernels
Load compute kernels from the Hub
☆214Updated last week
neuralmagic / compressed-tensors
A safetensors extension to efficiently store sparse quantized tensors on disk
☆141Updated this week
intel / llm-on-ray
Pretrain, finetune and serve LLMs on Intel platforms with Ray
☆128Updated 3 weeks ago
pytorch-labs / float8_experimental
This repository contains the experimental PyTorch native float8 training UX
☆224Updated last year
fw-ai / benchmark
Benchmark suite for LLMs from Fireworks.ai
☆76Updated 3 weeks ago
foundation-model-stack / fms-fsdp
🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash…
☆258Updated last week
pytorch / torchsnapshot
A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…
☆158Updated last month
neuralmagic / AutoFP8
☆195Updated 2 months ago
huggingface / optimum-habana
Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)
☆191Updated this week
coreweave / ml-containers
☆38Updated this week
npuichigo / openai_trtllm
OpenAI compatible API for TensorRT LLM triton backend
☆209Updated last year
microsoft / varuna
☆251Updated last year
google / saxml
☆142Updated 2 weeks ago
AI-Hypercomputer / maxdiffusion
☆237Updated this week
foundation-model-stack / foundation-model-stack
🚀 Collection of components for development, training, tuning, and inference of foundation models leveraging PyTorch native components.
☆206Updated this week
IBM / text-generation-inference
IBM development fork of https://github.com/huggingface/text-generation-inference
☆61Updated 2 months ago