Explore training for quantized models
β26Jul 12, 2025Updated 9 months ago
Alternatives and similar repositories for quantized-training
Users that are interested in quantized-training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- HALO: Hadamard-Assisted Low-Precision Optimization and Training method for finetuning LLMs. π The official implementation of https://arxβ¦β28Feb 17, 2025Updated last year
- High-performance tokenized language data-loader for Python C++ extensionβ14Jul 22, 2024Updated last year
- Weakly Supervised Object Localization via Class RE-Activation Mappingβ12Sep 19, 2022Updated 3 years ago
- β14Sep 30, 2023Updated 2 years ago
- simple grpoβ12May 28, 2025Updated 10 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- BPE modification that implements removing of the intermediate tokens during tokenizer training.β27Nov 25, 2024Updated last year
- β33Nov 19, 2025Updated 4 months ago
- β16Dec 29, 2024Updated last year
- An extention to the GaLore paper, to perform Natural Gradient Descent in low rank subspaceβ18Oct 21, 2024Updated last year
- Train Llama Loras Easilyβ31Aug 3, 2023Updated 2 years ago
- β38Jul 16, 2025Updated 8 months ago
- β13Apr 1, 2026Updated 2 weeks ago
- Write a fast kernel and see how you compare against the best humans and AI on gpumode.comβ90Updated this week
- β16Nov 10, 2025Updated 5 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- minimal C implementation of speculative decoding based on llama2.cβ29Jul 15, 2024Updated last year
- β13Updated this week
- β91Feb 29, 2024Updated 2 years ago
- Official implementation of ICLR 2025 'LORO: Parameter and Memory Efficient Pretraining via Low-rank Riemannian Optimization'β16Apr 24, 2025Updated 11 months ago
- β42Mar 11, 2026Updated last month
- β18Mar 25, 2026Updated 2 weeks ago
- A framework for fast exploration of the depth-first scheduling space for DNN acceleratorsβ43Feb 8, 2023Updated 3 years ago
- Collection of scripts to build PyTorch and the domain libraries from source.β14Apr 1, 2026Updated last week
- a minimal cache manager for PagedAttention, on top of llama3.β142Aug 26, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- β11Aug 2, 2024Updated last year
- Will write CUDA for 100 daysβ39May 25, 2025Updated 10 months ago
- β19Mar 12, 2026Updated last month
- Community maintained hardware plugin for vLLM on AWS Neuronβ28Mar 20, 2026Updated 3 weeks ago
- Model Predictive Path Integral Control (MPPI) with PyTorchβ18Jan 26, 2024Updated 2 years ago
- See https://youtube-dl.org/β10Oct 24, 2020Updated 5 years ago
- Diagram tensors from torch, jax, tensorflow, numpy, etc., for understanding and debuggingβ59Nov 28, 2025Updated 4 months ago
- SPAA'21: Efficient Stepping Algorithms and Implementations for Parallel Shortest Pathsβ21Aug 10, 2024Updated last year
- All the useful tools I have been using while working in data science for remote sensingβ11Nov 27, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Python scripts to facilitate easy workingβ11Mar 23, 2026Updated 3 weeks ago
- Documentation on using the built-in Python debugger, PDB.β23Dec 8, 2022Updated 3 years ago
- β11Apr 2, 2026Updated last week
- β157Jun 22, 2023Updated 2 years ago
- β19Jul 30, 2024Updated last year
- Torch Frontend for IREEβ26Dec 21, 2023Updated 2 years ago
- β146Apr 4, 2026Updated last week