[ICLR 2026] ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference
☆280May 15, 2026Updated this week
Alternatives and similar repositories for paroquant
Users that are interested in paroquant are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- QuIP quantization☆66Mar 17, 2024Updated 2 years ago
- Pure MLX implementations of UMAP, t-SNE, PaCMAP, TriMap, DREAMS, CNE, MMAE, and NNDescent for Apple Silicon. Metal GPU for computation an…☆83Mar 20, 2026Updated last month
- 踊るうんこのgif画像を生成します。☆17May 7, 2022Updated 4 years ago
- Image Gaussian Splatting☆25Jul 21, 2025Updated 9 months ago
- ☆171Jun 22, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICLR2025]: OSTQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitt…☆92Apr 8, 2025Updated last year
- GrFormer: A Novel Transformer on Grassmann Manifold for Infrared and Visible Image Fusion (Information Fusion 2026)☆18Dec 14, 2025Updated 5 months ago
- Large DNNs training framework for consumer GPUs☆78Updated this week
- Tiny Lab is a small Apple Silicon ML research tool with a real control plane, one shipped MLX training path, and checkpoint evaluation bu…☆95Mar 10, 2026Updated 2 months ago
- ☆77Jun 20, 2025Updated 10 months ago
- DiscordのBot、Cuのソースコード☆12Apr 12, 2025Updated last year
- SuperHTML support for zed☆12Mar 25, 2026Updated last month
- ☆17Mar 4, 2024Updated 2 years ago
- The Official PyTorch implementation of Shared LoRA Subspaces for almost Strict Continual Learning☆31May 7, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A collection of Git PHP Hooks that you maybe want to use with GitPHPHooks☆12Feb 19, 2015Updated 11 years ago
- Local AI runtime for training & running small LLMs directly on Apple Neural Engine (ANE). No CoreML. No Metal. Offline, on-device fine-tu…☆90Mar 6, 2026Updated 2 months ago
- The StereOS agent management daemon.☆44Updated this week
- The official implementation of BiViT: Extremely Compressed Binary Vision Transformers☆16Jun 18, 2023Updated 2 years ago
- [ICCV 2025] QuantCache:Adaptive Importance-Guided Quantization with Hierarchical Latent and Layer Caching for Video Generation☆17Sep 26, 2025Updated 7 months ago
- PMetal: high-performance Apple Silicon framework for local LLM inference, LoRA/QLoRA fine-tuning, serving, quantization, and MLX/Metal ac…☆280May 8, 2026Updated last week
- Xilly Game Mode is a competitive-grade optimization utility designed to instantly reallocate your PC's resources for maximum gaming perfo…☆19Feb 13, 2026Updated 3 months ago
- ☆15Apr 26, 2025Updated last year
- An example model of a Network Processing Unit using the PFPSim framework.☆13Aug 23, 2016Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A MIPS CPU with dual-issue, out-of-order, and 5-stage pipelines☆11Nov 28, 2019Updated 6 years ago
- ☆18Mar 18, 2024Updated 2 years ago
- ☆24Jan 30, 2025Updated last year
- Pytorch implementation of our paper accepted by ICML 2023 -- "Bi-directional Masks for Efficient N:M Sparse Training"☆13Jun 7, 2023Updated 2 years ago
- Artifacts for ATC '22 paper "Faster Software Packet Processing on FPGA NICs with eBPF Program Warping"☆17May 20, 2022Updated 4 years ago
- [ICML 2026] Official codebase for "Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactiv…☆659Updated this week
- Code for "RSQ: Learning from Important Tokens Leads to Better Quantized LLMs"☆21Mar 25, 2026Updated last month
- My attempt to improve the speed of the newton schulz algorithm, starting from the dion implementation.☆38Apr 30, 2026Updated 2 weeks ago
- Arrow Matrix Decomposition - Communication-Efficient Distributed Sparse Matrix Multiplication☆15Mar 25, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A suite of tools for pretty printing, diffing, and exploring abstract syntax trees.☆16Mar 3, 2026Updated 2 months ago
- [ICLR 2026]QeRL enables RL for 32B LLMs on a single H100 GPU.☆503Mar 30, 2026Updated last month
- AFPQ code implementation☆23Nov 6, 2023Updated 2 years ago
- [GSI 2023] Learning Lagrangian Fluid Mechanics with E(3)-Equivariant GNNs☆15Jun 3, 2024Updated last year
- ☆10Oct 24, 2024Updated last year
- Tiny evaluation of leading LLMs on competitive programming problems☆14Apr 10, 2026Updated last month
- Minimal pi coding-agent re-implementation in Zig☆83Apr 2, 2026Updated last month