Explore training for quantized models
β26Jul 12, 2025Updated 11 months ago
Alternatives and similar repositories for quantized-training
Users that are interested in quantized-training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- HALO: Hadamard-Assisted Low-Precision Optimization and Training method for finetuning LLMs. π The official implementation of https://arxβ¦β31Feb 17, 2025Updated last year
- High-performance tokenized language data-loader for Python C++ extensionβ15Jul 22, 2024Updated last year
- Inline PTX Assembly in CUDA exampleβ14May 7, 2022Updated 4 years ago
- Weakly Supervised Object Localization via Class RE-Activation Mappingβ12Sep 19, 2022Updated 3 years ago
- The source code of the experimental evaluation of Deprez et al. (nd)β11Oct 8, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- β15Sep 30, 2023Updated 2 years ago
- simple grpoβ12May 28, 2025Updated last year
- BPE modification that implements removing of the intermediate tokens during tokenizer training.β27Nov 25, 2024Updated last year
- β40Nov 19, 2025Updated 7 months ago
- β16Dec 29, 2024Updated last year
- An extention to the GaLore paper, to perform Natural Gradient Descent in low rank subspaceβ19Oct 21, 2024Updated last year
- β19Apr 16, 2025Updated last year
- β43Jul 16, 2025Updated 11 months ago
- β13May 4, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- minimal C implementation of speculative decoding based on llama2.cβ30Jul 15, 2024Updated last year
- Write a fast kernel and see how you compare against the best humans and AI on gpumode.comβ100Jun 25, 2026Updated last week
- β13Jun 10, 2026Updated 3 weeks ago
- β92Feb 29, 2024Updated 2 years ago
- Official implementation of ICLR 2025 'LORO: Parameter and Memory Efficient Pretraining via Low-rank Riemannian Optimization'β18Apr 24, 2025Updated last year
- β17Nov 10, 2025Updated 7 months ago
- β18May 6, 2026Updated last month
- Collection of scripts to build PyTorch and the domain libraries from source.β14Jun 9, 2026Updated 3 weeks ago
- Implementation of an RL based agent, which utilizes Q-Learning to develop a policy for effectively solving a 3x3x3 rubiks cubeβ19Mar 12, 2019Updated 7 years ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.β19Feb 9, 2026Updated 4 months ago
- a minimal cache manager for PagedAttention, on top of llama3.β146Aug 26, 2024Updated last year
- β11Aug 2, 2024Updated last year
- Will write CUDA for 100 daysβ39May 25, 2025Updated last year
- β44Mar 11, 2026Updated 3 months ago
- This repository contains the experimental PyTorch native float8 training UXβ226Aug 1, 2024Updated last year
- See https://youtube-dl.org/β10Oct 24, 2020Updated 5 years ago
- Model Predictive Path Integral Control (MPPI) with PyTorchβ18Jan 26, 2024Updated 2 years ago
- https://x.com/BlinkDL_AI/status/1884768989743882276β28May 4, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Diagram tensors from torch, jax, tensorflow, numpy, etc., for understanding and debuggingβ59Nov 28, 2025Updated 7 months ago
- Python scripts to facilitate easy workingβ11Mar 23, 2026Updated 3 months ago
- Documentation on using the built-in Python debugger, PDB.β24Dec 8, 2022Updated 3 years ago
- β11Jun 15, 2026Updated 2 weeks ago
- β157Jun 22, 2023Updated 3 years ago
- β21Jul 30, 2024Updated last year
- Torch Frontend for IREEβ26Dec 21, 2023Updated 2 years ago