Low-bit optimizers for PyTorch
β139Oct 9, 2023Updated 2 years ago
Alternatives and similar repositories for low-bit-optimizers
Users that are interested in low-bit-optimizers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch implementation of "Oscillation-Reduced MXFP4 Training for Vision Transformers" on DeiT Model Pre-trainingβ40May 4, 2026Updated last month
- HALO: Hadamard-Assisted Low-Precision Optimization and Training method for finetuning LLMs. π The official implementation of https://arxβ¦β29Feb 17, 2025Updated last year
- β157Jun 22, 2023Updated 3 years ago
- β63Jul 21, 2024Updated last year
- A Tight-fisted Optimizerβ52Mar 7, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [Neurips 2022] β Back Razor: Memory-Efficient Transfer Learning by Self-Sparsified Backpropogationβ, Ziyu Jiang*, Xuxi Chen*, Xueqin Huanβ¦β19Mar 14, 2023Updated 3 years ago
- Official implementation of ICML 2024 paper "ExCP: Extreme LLM Checkpoint Compression via Weight-Momentum Joint Shrinking".β48Jul 12, 2024Updated last year
- code for Scaling Laws of RoPE-based Extrapolationβ73Oct 16, 2023Updated 2 years ago
- [ICML 2026] Elastic Diffusion Transformer: Accelerating SOTA generation models (e.g., Qwen-Image, Hunyuan3d ) through adaptive computatioβ¦β45May 1, 2026Updated 2 months ago
- Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".β279Nov 3, 2023Updated 2 years ago
- β233Jun 11, 2024Updated 2 years ago
- Implementation of NM sparsity recipe presented in the paper "Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers".β11Feb 5, 2024Updated 2 years ago
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)β81Aug 30, 2023Updated 2 years ago
- Official PyTorch implementation of CD-MOEβ12Mar 18, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projectionβ1,698Oct 28, 2024Updated last year
- Microsoft Automatic Mixed Precision Libraryβ637Dec 1, 2025Updated 7 months ago
- [ICLR 2025] COAT: Compressing Optimizer States and Activation for Memory-Efficient FP8 Trainingβ264Aug 9, 2025Updated 10 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limitβ63Jun 21, 2023Updated 3 years ago
- β125Mar 18, 2026Updated 3 months ago
- LOMO: LOw-Memory Optimizationβ994Jul 2, 2024Updated last year
- The official implementation of the EMNLP 2023 paper LLM-FP4β225Dec 15, 2023Updated 2 years ago
- Linear Attention Sequence Parallelism (LASP)β87Jun 4, 2024Updated 2 years ago
- See https://github.com/cuda-mode/triton-index/ instead!β11May 8, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- GPTQLoRA: Efficient Finetuning of Quantized LLMs with GPTQβ101May 30, 2023Updated 3 years ago
- Collaborative Training of Large Language Models in an Efficient Wayβ421Aug 28, 2024Updated last year
- Deep neural network framework for multiple GPUsβ34Jun 20, 2015Updated 11 years ago
- Decensoring Hentaiβ13Sep 19, 2022Updated 3 years ago
- Implements the SM3-II adaptive optimization algorithm for PyTorch.β33Sep 3, 2024Updated last year
- QuTLASS: CUTLASS-Powered Quantized BLAS for Deep Learningβ185Nov 11, 2025Updated 7 months ago
- Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)β58Nov 8, 2024Updated last year
- β10Apr 24, 2023Updated 3 years ago
- FLOPS counter for all your GPU benchmarking needsβ13Aug 8, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- β32Dec 17, 2025Updated 6 months ago
- AudioSR-Upsampling (any -> 48kHz)β42Feb 13, 2024Updated 2 years ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"β23Jun 7, 2025Updated last year
- [ICML 2024] SqueezeLLM: Dense-and-Sparse Quantizationβ722Aug 13, 2024Updated last year
- Code for Neurips24 paper: QuaRot, an end-to-end 4-bit inference of large language models.β517Nov 26, 2024Updated last year
- β54Jul 18, 2024Updated last year
- Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793β458May 13, 2025Updated last year