YoungHyun197 / ptq4vmView external linksLinks
ptq4vm official repository
☆25Apr 7, 2025Updated 10 months ago
Alternatives and similar repositories for ptq4vm
Users that are interested in ptq4vm are comparing it to the libraries listed below
Sorting:
- FireQ: Fast INT4-FP8 Kernel and RoPE-aware Quantization for LLM Inference Acceleration☆20Jun 27, 2025Updated 7 months ago
- The official repository of Quamba1 [ICLR 2025] & Quamba2 [ICML 2025]☆67Jun 19, 2025Updated 7 months ago
- It's All In the Teacher: Zero-Shot Quantization Brought Closer to the Teacher [CVPR 2022 Oral]☆29Sep 15, 2022Updated 3 years ago
- Implementation of Microscaling data formats in SystemVerilog.☆29Jul 6, 2025Updated 7 months ago
- Code for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Model…☆69Mar 7, 2024Updated last year
- [ICML 2024] Sparse Model Inversion: Efficient Inversion of Vision Transformers with Less Hallucination☆13Apr 29, 2025Updated 9 months ago
- Parse command line arguments by defining dataclasses☆13Oct 13, 2024Updated last year
- ☆10Apr 24, 2024Updated last year
- AAAI2025☆11Apr 18, 2025Updated 9 months ago
- RADIX-4 SRT division☆12Oct 31, 2019Updated 6 years ago
- [CVPR 2025 Highlight] FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation☆25Jun 16, 2025Updated 8 months ago
- An up-to-date list of progress made in next-generation AI.☆11Apr 2, 2023Updated 2 years ago
- a fast and customizable CUDA int4 tensor core gemm☆15Aug 2, 2024Updated last year
- [CVPR 2025] QuartDepth☆16Mar 24, 2025Updated 10 months ago
- 简单的未优化的SRT除法器☆12Jun 16, 2024Updated last year
- ☆13Jul 14, 2025Updated 7 months ago
- Official Implementation of Robustifying and Boosting Training-Free Neural Architecture Search☆10Mar 12, 2024Updated last year
- Hardware Division Units☆10Jul 17, 2014Updated 11 years ago
- [ICML 2025] MxMoE: Mixed-precision Quantization for MoE with Accuracy and Performance Co-Design☆22Jul 4, 2025Updated 7 months ago
- Official implementation of "Modeling Multi-Task Model Merging as Adaptive Projective Gradient Descent".☆21May 23, 2025Updated 8 months ago
- ☆12Aug 26, 2025Updated 5 months ago
- JPEG编解码从零开始实现(python JPEG codec)☆10Jul 29, 2022Updated 3 years ago
- The official code for "Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation" | [MM2…☆14Dec 7, 2024Updated last year
- Implementation of Contrastive Predictive Coding for Natural Language☆10Sep 16, 2020Updated 5 years ago
- ☆10Dec 25, 2025Updated last month
- 本项目提供了面向中文的XLNet预训练模型,旨在丰富中文自然语言处理资源,提供多元化的中文预训练模型选择。 我们欢迎各位专家学者下载使用,并共同促进和发展中文资源建设。☆11May 30, 2023Updated 2 years ago
- Source code of our TNNLS paper "Boosting Convolutional Neural Networks with Middle Spectrum Grouped Convolution"☆12Apr 14, 2023Updated 2 years ago
- ☆18Jan 30, 2026Updated 2 weeks ago
- Task Aware Downscaling for efficient storing and accurate reconstruction in image and video domain☆12Jul 25, 2024Updated last year
- Basic floating-point components for RISC-V processors☆11Aug 13, 2017Updated 8 years ago
- ☆11Apr 5, 2023Updated 2 years ago
- ☆10Jun 4, 2024Updated last year
- ☆15Jan 12, 2026Updated last month
- ☆11Sep 20, 2024Updated last year
- A PyTorch implementation of [VCT](https://github.com/google-research/google-research/tree/master/vct)☆10Nov 25, 2022Updated 3 years ago
- The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)☆14Jun 26, 2025Updated 7 months ago
- ☆13Jul 25, 2024Updated last year
- An FL algorithm inspired by FedGMA☆10Oct 21, 2023Updated 2 years ago
- First Latency-Aware Competitive LLM Agent Benchmark☆26Jun 3, 2025Updated 8 months ago