ptq4vm official repository
☆28Apr 7, 2025Updated last year
Alternatives and similar repositories for ptq4vm
Users that are interested in ptq4vm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Model…☆72Mar 7, 2024Updated 2 years ago
- The official repository of Quamba1 [ICLR 2025] & Quamba2 [ICML 2025]☆70Jun 19, 2025Updated last year
- It's All In the Teacher: Zero-Shot Quantization Brought Closer to the Teacher [CVPR 2022 Oral]☆29Sep 15, 2022Updated 3 years ago
- ☆18Jul 1, 2023Updated 2 years ago
- Implementation of Microscaling data formats in SystemVerilog.☆33Jul 6, 2025Updated 11 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- BitNet a4.8 Implementation in one file of pytorch☆21Jan 13, 2025Updated last year
- Collections of model quantization algorithms. Any issues, please contact Peng Chen (blueardour@gmail.com)☆45Aug 19, 2021Updated 4 years ago
- 简单的未优化的SRT除法器☆12Jun 16, 2024Updated 2 years ago
- [ICML 2024] Sparse Model Inversion: Efficient Inversion of Vision Transformers with Less Hallucination☆14Apr 29, 2025Updated last year
- Official Code For Dual Grained Quantization: Efficient Fine-Grained Quantization for LLM☆14Dec 27, 2023Updated 2 years ago
- Hardware Division Units☆10Jul 17, 2014Updated 11 years ago
- ☆11Apr 5, 2023Updated 3 years ago
- RADIX-4 SRT division☆12Oct 31, 2019Updated 6 years ago
- PyTorch code for our paper "2DQuant: Low-bit Post-Training Quantization for Image Super-Resolution"☆49Oct 24, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [CVPR 2025 Highlight] FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation☆29Jun 16, 2025Updated last year
- Code for the papers: “Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling” and “Adaptive Block-Scaled Data Types”☆194Apr 21, 2026Updated 2 months ago
- Adversarial Attack on Graph Neural Networks as An Influence Maximization Problem☆20Oct 27, 2021Updated 4 years ago
- SMART introduces a novel test-time framework where Small Language Models (SLMs) reason step-by-step, and Large Language Models (LLMs) pro…☆12Jul 9, 2025Updated 11 months ago
- [CVPR 2025] QuartDepth☆18Mar 24, 2025Updated last year
- ☆15Apr 6, 2026Updated 2 months ago
- This project implements the Titans architecture from the paper "Titans: Learning to Memorize at Test Time" for market data prediction.☆10Jan 19, 2025Updated last year
- [ICML 2021] "Double-Win Quant: Aggressively Winning Robustness of Quantized DeepNeural Networks via Random Precision Training and Inferen…☆16Feb 13, 2022Updated 4 years ago
- Source code of our TNNLS paper "Boosting Convolutional Neural Networks with Middle Spectrum Grouped Convolution"☆12Apr 14, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆16Dec 9, 2023Updated 2 years ago
- ☆14Jul 14, 2025Updated 11 months ago
- [ICML 2025] Official PyTorch implementation of "NegMerge: Sign-Consensual Weight Merging for Machine Unlearning"☆16Nov 25, 2025Updated 6 months ago
- AAAI2025☆13Apr 18, 2025Updated last year
- ☆15Apr 11, 2024Updated 2 years ago
- ☆12Jun 4, 2024Updated 2 years ago
- An Agile RISC-V SoC Design Framework with in-order cores, out-of-order cores, accelerators, and more☆12May 29, 2026Updated 3 weeks ago
- [CVPR 2026] MergeVLA: Cross-Skill Model Merging Toward a Generalist Vision-Language-Action Agent☆33Apr 30, 2026Updated last month
- JPEG编解码从零开始实现(python JPEG codec)☆10Jul 29, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)☆16Jun 26, 2025Updated 11 months ago
- ☆15Jul 25, 2024Updated last year
- The official code for "Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation" | [MM2…☆14Dec 7, 2024Updated last year
- ☆23Dec 16, 2025Updated 6 months ago
- ☆16Aug 19, 2024Updated last year
- ☆12Jul 30, 2025Updated 10 months ago
- ☆17Oct 20, 2025Updated 8 months ago