Simple, safe way to store and distribute tensors
β3,671Mar 26, 2026Updated this week
Alternatives and similar repositories for safetensors
Users that are interested in safetensors are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (iβ¦β9,580Updated this week
- Accessible large language models via k-bit quantization for PyTorch.β8,078Updated this week
- Minimalist ML framework for Rustβ19,833Updated this week
- Large Language Model Text Generation Inferenceβ10,815Mar 21, 2026Updated last week
- π Accelerate inference and training of π€ Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimizationβ¦β3,341Mar 13, 2026Updated 2 weeks ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Hackable and optimized Transformers building blocks, supporting a composable construction.β10,388Mar 18, 2026Updated last week
- Development repository for the Triton language and compilerβ18,781Updated this week
- π₯ Fast State-of-the-Art Tokenizers optimized for Research and Productionβ10,554Mar 20, 2026Updated last week
- Fast and memory-efficient exact attentionβ22,938Updated this week
- Tensor library for machine learningβ14,294Mar 16, 2026Updated last week
- π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.β20,841Mar 18, 2026Updated last week
- Flax is a neural network library for JAX that is designed for flexibility.β7,136Updated this week
- π€ Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.β33,157Updated this week
- Rust bindings for the C++ api of PyTorch.β5,331Updated this week
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.β41,925Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMsβ74,135Updated this week
- PyTorch extensions for high performance and large scale training.β3,404Apr 26, 2025Updated 11 months ago
- Train transformer language models with reinforcement learning.β17,781Updated this week
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and moreβ35,190Updated this week
- PyTorch native quantization and sparsity for training and inferenceβ2,746Updated this week
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)β9,438Feb 20, 2026Updated last month
- Minimalistic large language model 3D-parallelism trainingβ2,626Feb 19, 2026Updated last month
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.β17,683Feb 8, 2026Updated last month
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- π€ Evaluate: A library for easily evaluating machine learning models and datasets.β2,434Mar 10, 2026Updated 2 weeks ago
- SGLang is a high-performance serving framework for large language models and multimodal models.β25,041Updated this week
- TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizatβ¦β13,169Updated this week
- Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.β30,974Updated this week
- A blazing fast inference solution for text embeddings modelsβ4,625Updated this week
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (Nβ¦β4,711Mar 16, 2026Updated last week
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.β1,078Apr 17, 2024Updated last year
- Transformer related optimization, including BERT, GPTβ6,400Mar 27, 2024Updated 2 years ago
- Build and share delightful machine learning apps, all in Python. π Star to support our work!β42,151Updated this week
- NordVPN Threat Protection Proβ’ β’ AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Ongoing research training transformer models at scaleβ15,827Updated this week
- Burn is a next generation tensor library and Deep Learning Framework that doesn't compromise on flexibility, efficiency and portability.β14,739Updated this week
- A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hβ¦β3,246Updated this week
- LLM inference in C/C++β99,811Updated this week
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.β10,472Updated this week
- PyTorch native post-training libraryβ5,713Updated this week
- A pytorch quantization backend for optimumβ1,034Nov 21, 2025Updated 4 months ago