AozhongZhang / MagRView external linksLinks
☆13Jun 22, 2025Updated 7 months ago
Alternatives and similar repositories for MagR
Users that are interested in MagR are comparing it to the libraries listed below
Sorting:
- Official Code For Dual Grained Quantization: Efficient Fine-Grained Quantization for LLM☆14Dec 27, 2023Updated 2 years ago
- ☆25Oct 31, 2024Updated last year
- ☆19Nov 6, 2023Updated 2 years ago
- Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…☆50Oct 21, 2023Updated 2 years ago
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆31Mar 12, 2024Updated last year
- Repository for CPU Kernel Generation for LLM Inference☆28Jul 13, 2023Updated 2 years ago
- My fork os allen AI's OLMo for educational purposes.☆28Dec 5, 2024Updated last year
- [EMNLP 2024] RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization☆37Sep 24, 2024Updated last year
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆39Mar 11, 2024Updated last year
- A code base for the third place solution of Ego-Exo4D bodypose challenge for CVPR2024 workshop☆12Jun 16, 2024Updated last year
- ☆52Nov 5, 2024Updated last year
- [ICML 2025] SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models☆51Aug 9, 2024Updated last year
- Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees" adapted for Llama models☆41Aug 4, 2023Updated 2 years ago
- Get a mask and goggle for your avatar now! 为预防2020新型冠狀病毒肺炎,请积极佩戴口罩及护目镜。☆10Dec 25, 2024Updated last year
- ☆30Nov 15, 2025Updated 3 months ago
- FMS Model Optimizer is a framework for developing reduced precision neural network models.☆20Updated this week
- Residual vector quantization for KV cache compression in large language model☆11Oct 22, 2024Updated last year
- 4-bit Shampoo for Memory-Efficient Network Training (NeurIPS 2024)☆13Feb 13, 2025Updated last year
- ☆14May 21, 2024Updated last year
- [ECCV 2024] Official Implementation of CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings☆11Feb 24, 2025Updated 11 months ago
- PyTorch code for full quantization of DNN using BCGD☆14Jul 24, 2019Updated 6 years ago
- The official code for "Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation" | [MM2…☆14Dec 7, 2024Updated last year
- TACS: Taxonomy Adaptive Cross-Domain Semantic Segmentation☆12Jul 14, 2022Updated 3 years ago
- Official code for "Algorithmic Capabilities of Random Transformers" (NeurIPS 2024)☆16Sep 28, 2024Updated last year
- ☆10Feb 12, 2024Updated 2 years ago
- Official Implementation of SEA: Sparse Linear Attention with Estimated Attention Mask (ICLR 2024)☆11Jun 20, 2025Updated 7 months ago
- ☆11Apr 3, 2023Updated 2 years ago
- tensorrt部署教程☆11Aug 1, 2025Updated 6 months ago
- ☆15Jan 12, 2026Updated last month
- Conditional DDPM for characterizing radio sources from dirty images. (autumn 2023)☆11Nov 30, 2023Updated 2 years ago
- ☆39Jan 16, 2026Updated 3 weeks ago
- Official implementation of ICML'24 paper "LQER: Low-Rank Quantization Error Reconstruction for LLMs"☆19Jul 11, 2024Updated last year
- Face detection using Multi-scale Block Local Binary Pattern algorithm - optimized with OpenCL/OpenMP - Depreciated - pls use convolutiona…☆11Jul 16, 2017Updated 8 years ago
- Code for paper "Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion"☆14Mar 28, 2024Updated last year
- ☆16Dec 7, 2025Updated 2 months ago
- [ICLR 2025] Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better☆16Feb 15, 2025Updated last year
- ☆13Oct 13, 2025Updated 4 months ago
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆36Oct 3, 2025Updated 4 months ago
- Fine-tuning Quantized Neural Networks with Zeroth-order Optimization☆15Sep 17, 2025Updated 4 months ago