Simple, safe way to store and distribute tensors
β3,645Feb 27, 2026Updated last week
Alternatives and similar repositories for safetensors
Users that are interested in safetensors are comparing it to the libraries listed below
Sorting:
- π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (iβ¦β9,528Updated this week
- Accessible large language models via k-bit quantization for PyTorch.β8,019Updated this week
- Minimalist ML framework for Rustβ19,600Updated this week
- π Accelerate inference and training of π€ Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimizationβ¦β3,310Feb 9, 2026Updated last month
- Hackable and optimized Transformers building blocks, supporting a composable construction.β10,356Feb 20, 2026Updated 2 weeks ago
- Large Language Model Text Generation Inferenceβ10,795Jan 8, 2026Updated 2 months ago
- Development repository for the Triton language and compilerβ18,573Updated this week
- π₯ Fast State-of-the-Art Tokenizers optimized for Research and Productionβ10,497Feb 28, 2026Updated last week
- Fast and memory-efficient exact attentionβ22,460Updated this week
- Tensor library for machine learningβ14,195Feb 27, 2026Updated last week
- π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.β20,717Updated this week
- Flax is a neural network library for JAX that is designed for flexibility.β7,105Updated this week
- π€ Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.β32,923Updated this week
- Rust bindings for the C++ api of PyTorch.β5,306Jan 22, 2026Updated last month
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.β41,759Updated this week
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and moreβ34,987Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMsβ71,883Updated this week
- Train transformer language models with reinforcement learning.β17,523Updated this week
- PyTorch native quantization and sparsity for training and inferenceβ2,722Updated this week
- PyTorch extensions for high performance and large scale training.β3,400Apr 26, 2025Updated 10 months ago
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)β9,415Feb 20, 2026Updated 2 weeks ago
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.β17,473Feb 8, 2026Updated last month
- SGLang is a high-performance serving framework for large language models and multimodal models.β24,216Updated this week
- Minimalistic large language model 3D-parallelism trainingβ2,588Feb 19, 2026Updated 2 weeks ago
- TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizatβ¦β12,993Updated this week
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (Nβ¦β4,706Feb 27, 2026Updated last week
- π€ Evaluate: A library for easily evaluating machine learning models and datasets.β2,422Jan 20, 2026Updated last month
- Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.β30,906Updated this week
- Transformer related optimization, including BERT, GPTβ6,398Mar 27, 2024Updated last year
- Ongoing research training transformer models at scaleβ15,535Updated this week
- Build and share delightful machine learning apps, all in Python. π Star to support our work!β41,921Updated this week
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.β10,406Updated this week
- A blazing fast inference solution for text embeddings modelsβ4,553Feb 25, 2026Updated last week
- PyTorch native post-training libraryβ5,697Updated this week
- Burn is a next generation tensor library and Deep Learning Framework that doesn't compromise on flexibility, efficiency and portability.β14,535Updated this week
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.β1,075Apr 17, 2024Updated last year
- A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Autoβ¦β16,843Updated this week
- LLM inference in C/C++β96,322Mar 2, 2026Updated last week
- Extremely fast Query Engine for DataFrames, written in Rustβ37,654Updated this week