mit-han-lab / fouroversixLinks
Code for the paper “Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling”
☆93Updated this week
Alternatives and similar repositories for fouroversix
Users that are interested in fouroversix are comparing it to the libraries listed below
Sorting:
- ☆83Updated 11 months ago
- ☆60Updated last year
- Code for paper: [ICLR2025 Oral] FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference