Explore training for quantized models
β26Jul 12, 2025Updated 9 months ago
Alternatives and similar repositories for quantized-training
Users that are interested in quantized-training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- HALO: Hadamard-Assisted Low-Precision Optimization and Training method for finetuning LLMs. π The official implementation of https://arxβ¦β28Feb 17, 2025Updated last year
- Inline PTX Assembly in CUDA exampleβ14May 7, 2022Updated 3 years ago
- [PACT'24] GraNNDis. A fast and unified distributed graph neural network (GNN) training framework for both full-batch (full-graph) and minβ¦β10Aug 13, 2024Updated last year
- The source code of the experimental evaluation of Deprez et al. (nd)β12Oct 8, 2025Updated 6 months ago
- β14Sep 30, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- simple grpoβ12May 28, 2025Updated 11 months ago
- BPE modification that implements removing of the intermediate tokens during tokenizer training.β27Nov 25, 2024Updated last year
- β16Dec 29, 2024Updated last year
- An extention to the GaLore paper, to perform Natural Gradient Descent in low rank subspaceβ18Oct 21, 2024Updated last year
- β39Jul 16, 2025Updated 9 months ago
- β13Apr 27, 2026Updated last week
- Python implementation of Efficient Graph-Based Image Segmentationβ25Sep 26, 2020Updated 5 years ago
- minimal C implementation of speculative decoding based on llama2.cβ29Jul 15, 2024Updated last year
- Write a fast kernel and see how you compare against the best humans and AI on gpumode.comβ92Apr 24, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- β13Apr 13, 2026Updated 3 weeks ago
- β91Feb 29, 2024Updated 2 years ago
- Official implementation of ICLR 2025 'LORO: Parameter and Memory Efficient Pretraining via Low-rank Riemannian Optimization'β17Apr 24, 2025Updated last year
- β18Apr 22, 2026Updated last week
- A framework for fast exploration of the depth-first scheduling space for DNN acceleratorsβ43Feb 8, 2023Updated 3 years ago
- The vLLM XPU kernels for Intel GPUβ37Apr 28, 2026Updated last week
- β18Jan 4, 2024Updated 2 years ago
- Collection of scripts to build PyTorch and the domain libraries from source.β14Apr 1, 2026Updated last month
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.β19Feb 9, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- a minimal cache manager for PagedAttention, on top of llama3.β142Aug 26, 2024Updated last year
- β25Dec 12, 2017Updated 8 years ago
- β19Mar 12, 2026Updated last month
- β44Mar 11, 2026Updated last month
- This repository contains the experimental PyTorch native float8 training UXβ226Aug 1, 2024Updated last year
- Model Predictive Path Integral Control (MPPI) with PyTorchβ18Jan 26, 2024Updated 2 years ago
- SPAA'21: Efficient Stepping Algorithms and Implementations for Parallel Shortest Pathsβ21Aug 10, 2024Updated last year
- Python scripts to facilitate easy workingβ11Mar 23, 2026Updated last month
- Documentation on using the built-in Python debugger, PDB.β23Dec 8, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- β11Apr 2, 2026Updated last month
- β157Jun 22, 2023Updated 2 years ago
- β21Jul 30, 2024Updated last year
- β24Apr 7, 2026Updated 3 weeks ago
- Torch Frontend for IREEβ26Dec 21, 2023Updated 2 years ago
- β148Apr 4, 2026Updated last month
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixingβ49Jan 27, 2022Updated 4 years ago