huggingface / safetensorsView external linksLinks
Simple, safe way to store and distribute tensors
β3,619Feb 4, 2026Updated last week
Alternatives and similar repositories for safetensors
Users that are interested in safetensors are comparing it to the libraries listed below
Sorting:
- π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (iβ¦β9,491Feb 6, 2026Updated last week
- Accessible large language models via k-bit quantization for PyTorch.β7,952Updated this week
- Minimalist ML framework for Rustβ19,395Updated this week
- π Accelerate inference and training of π€ Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimizationβ¦β3,291Feb 9, 2026Updated last week
- Hackable and optimized Transformers building blocks, supporting a composable construction.β10,336Feb 5, 2026Updated last week
- Large Language Model Text Generation Inferenceβ10,769Jan 8, 2026Updated last month
- Development repository for the Triton language and compilerβ18,429Updated this week
- π₯ Fast State-of-the-Art Tokenizers optimized for Research and Productionβ10,465Feb 9, 2026Updated last week
- Fast and memory-efficient exact attentionβ22,231Updated this week
- Tensor library for machine learningβ13,946Updated this week
- π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.β20,619Feb 9, 2026Updated last week
- Flax is a neural network library for JAX that is designed for flexibility.β7,077Updated this week
- π€ Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.β32,768Updated this week
- Rust bindings for the C++ api of PyTorch.β5,282Jan 22, 2026Updated 3 weeks ago
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.β41,627Updated this week
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and moreβ34,848Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMsβ70,205Updated this week
- Train transformer language models with reinforcement learning.β17,360Updated this week
- PyTorch native quantization and sparsity for training and inferenceβ2,691Updated this week
- PyTorch extensions for high performance and large scale training.β3,397Apr 26, 2025Updated 9 months ago
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)β9,395Jan 26, 2026Updated 3 weeks ago
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.β17,258Feb 8, 2026Updated last week
- SGLang is a high-performance serving framework for large language models and multimodal models.β23,547Updated this week
- Minimalistic large language model 3D-parallelism trainingβ2,559Updated this week
- TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizatβ¦β12,867Updated this week
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (Nβ¦β4,704Jan 12, 2026Updated last month
- π€ Evaluate: A library for easily evaluating machine learning models and datasets.β2,413Jan 20, 2026Updated 3 weeks ago
- Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.β30,823Feb 4, 2026Updated last week
- Transformer related optimization, including BERT, GPTβ6,392Mar 27, 2024Updated last year
- A blazing fast inference solution for text embeddings modelsβ4,495Updated this week
- Ongoing research training transformer models at scaleβ15,213Updated this week
- Build and share delightful machine learning apps, all in Python. π Star to support our work!