Low-bit optimizers for PyTorch
β138Oct 9, 2023Updated 2 years ago
Alternatives and similar repositories for low-bit-optimizers
Users that are interested in low-bit-optimizers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch implementation of "Oscillation-Reduced MXFP4 Training for Vision Transformers" on DeiT Model Pre-trainingβ39Jun 20, 2025Updated 10 months ago
- HALO: Hadamard-Assisted Low-Precision Optimization and Training method for finetuning LLMs. π The official implementation of https://arxβ¦β28Feb 17, 2025Updated last year
- β157Jun 22, 2023Updated 2 years ago
- β63Jul 21, 2024Updated last year
- A Tight-fisted Optimizerβ52Mar 7, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [Neurips 2022] β Back Razor: Memory-Efficient Transfer Learning by Self-Sparsified Backpropogationβ, Ziyu Jiang*, Xuxi Chen*, Xueqin Huanβ¦β19Mar 14, 2023Updated 3 years ago
- Official implementation of ICML 2024 paper "ExCP: Extreme LLM Checkpoint Compression via Weight-Momentum Joint Shrinking".β48Jul 12, 2024Updated last year
- code for Scaling Laws of RoPE-based Extrapolationβ73Oct 16, 2023Updated 2 years ago
- β14Aug 1, 2025Updated 9 months ago
- Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".β280Nov 3, 2023Updated 2 years ago
- β235Jun 11, 2024Updated last year
- Implementation of NM sparsity recipe presented in the paper "Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers".β11Feb 5, 2024Updated 2 years ago
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)β81Aug 30, 2023Updated 2 years ago
- Official PyTorch implementation of CD-MOEβ12Mar 18, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Quartet II Official Codeβ70Updated this week
- GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projectionβ1,689Oct 28, 2024Updated last year
- Microsoft Automatic Mixed Precision Libraryβ636Dec 1, 2025Updated 5 months ago
- [ICLR 2025] COAT: Compressing Optimizer States and Activation for Memory-Efficient FP8 Trainingβ262Aug 9, 2025Updated 8 months ago
- β122Mar 18, 2026Updated last month
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limitβ63Jun 21, 2023Updated 2 years ago
- LOMO: LOw-Memory Optimizationβ991Jul 2, 2024Updated last year
- The official implementation of the EMNLP 2023 paper LLM-FP4β224Dec 15, 2023Updated 2 years ago
- Linear Attention Sequence Parallelism (LASP)β88Jun 4, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- See https://github.com/cuda-mode/triton-index/ instead!β11May 8, 2024Updated last year
- GPTQLoRA: Efficient Finetuning of Quantized LLMs with GPTQβ101May 30, 2023Updated 2 years ago
- Collaborative Training of Large Language Models in an Efficient Wayβ420Aug 28, 2024Updated last year
- Deep neural network framework for multiple GPUsβ34Jun 20, 2015Updated 10 years ago
- Decensoring Hentaiβ14Sep 19, 2022Updated 3 years ago
- Implements the SM3-II adaptive optimization algorithm for PyTorch.β33Sep 3, 2024Updated last year
- QuTLASS: CUTLASS-Powered Quantized BLAS for Deep Learningβ179Nov 11, 2025Updated 5 months ago
- Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)β58Nov 8, 2024Updated last year
- β10Apr 24, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- FLOPS counter for all your GPU benchmarking needsβ13Aug 8, 2024Updated last year
- β32Dec 17, 2025Updated 4 months ago
- AudioSR-Upsampling (any -> 48kHz)β42Feb 13, 2024Updated 2 years ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"β22Jun 7, 2025Updated 10 months ago
- [ICML 2024] SqueezeLLM: Dense-and-Sparse Quantizationβ718Aug 13, 2024Updated last year
- Code for Neurips24 paper: QuaRot, an end-to-end 4-bit inference of large language models.β506Nov 26, 2024Updated last year
- β53Jul 18, 2024Updated last year