[ICLR 2026] ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference
☆170Mar 26, 2026Updated this week
Alternatives and similar repositories for paroquant
Users that are interested in paroquant are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- QuIP quantization☆63Mar 17, 2024Updated 2 years ago
- ☆34Mar 28, 2025Updated last year
- 踊るうんこのgif画像を生成します。☆17May 7, 2022Updated 3 years ago
- TinyNS: Platform-Aware Neurosymbolic Auto Tiny Machine Learning☆25Jun 2, 2023Updated 2 years ago
- ☆167Jun 22, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- GrFormer: A Novel Transformer on Grassmann Manifold for Infrared and Visible Image Fusion☆18Dec 14, 2025Updated 3 months ago
- Sketch Based Image Retrieval☆10Jul 13, 2018Updated 7 years ago
- ☆73Jun 20, 2025Updated 9 months ago
- Repository containing Anki Flashcards & source code to hopefully learn/revise any language☆11Jan 30, 2026Updated 2 months ago
- ☆15Mar 21, 2025Updated last year
- ☆17Mar 4, 2024Updated 2 years ago
- A custom Huggingface trainer which supports logging auxiliary losses returned by your model☆15Jul 27, 2025Updated 8 months ago
- Dice Language Support for VS Code☆10Sep 29, 2020Updated 5 years ago
- A LLM-friendly framework for translating dynamical equations to gymnasium-compatible RL environments.☆33Mar 18, 2026Updated last week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Probabilistic Circuits in Julia☆10Dec 27, 2023Updated 2 years ago
- My attempt to improve the speed of the newton schulz algorithm, starting from the dion implementation.☆33Dec 5, 2025Updated 3 months ago
- [ICCV 2025] QuantCache:Adaptive Importance-Guided Quantization with Hierarchical Latent and Layer Caching for Video Generation☆16Sep 26, 2025Updated 6 months ago
- ☆15Apr 26, 2025Updated 11 months ago
- An example model of a Network Processing Unit using the PFPSim framework.☆13Aug 23, 2016Updated 9 years ago
- Helper function for working with the REAL-Colon Dataset☆50Sep 5, 2025Updated 6 months ago
- ☆24Jan 30, 2025Updated last year
- A repo based on XiLin Li's PSGD repo that extends some of the experiments.☆14Oct 7, 2024Updated last year
- Arrow Matrix Decomposition - Communication-Efficient Distributed Sparse Matrix Multiplication☆15Mar 25, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A suite of tools for pretty printing, diffing, and exploring abstract syntax trees.☆15Mar 3, 2026Updated 3 weeks ago
- A NFC card reader for Campus card of NEU ( China )☆12Mar 13, 2021Updated 5 years ago
- [ICLR 2026]QeRL enables RL for 32B LLMs on a single H100 GPU.☆493Nov 27, 2025Updated 4 months ago
- AFPQ code implementation☆23Nov 6, 2023Updated 2 years ago
- Unofficial Implementation of Consistency Models in Pytorch☆15Mar 18, 2023Updated 3 years ago
- FPGA 2025 SAT Accel: A modern SAT Solver on FPGA Repository☆13Mar 13, 2025Updated last year
- Tiny evaluation of leading LLMs on competitive programming problems☆14Nov 28, 2024Updated last year
- ☆13Jun 29, 2024Updated last year
- Code repo for the paper "SpinQuant LLM quantization with learned rotations"☆380Feb 14, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- The official implementation of "EDA-DM: Enhanced Distribution Alignment for Post-Training Quantization of Diffusion Models"☆21Jul 8, 2025Updated 8 months ago
- DFlash: Block Diffusion for Flash Speculative Decoding☆671Mar 17, 2026Updated last week
- ☆14Jan 28, 2025Updated last year
- A synthesis flow for hybrid processing-in-RRAM modes☆12Jul 15, 2021Updated 4 years ago
- Sniffer,大二网络编程的课程设计☆10Feb 28, 2022Updated 4 years ago
- ☆30Jan 22, 2026Updated 2 months ago
- Nightly Build for LMDeploy☆11Jan 28, 2025Updated last year