MicroMix: Efficient Mixed-Precision Quantization with Microscaling Formats for Large Language Models
☆28Feb 12, 2026Updated 3 weeks ago
Alternatives and similar repositories for MicroMix
Users that are interested in MicroMix are comparing it to the libraries listed below
Sorting:
- a mixed-precision gemm with quantize and reorder kernel.☆14Jul 22, 2025Updated 7 months ago
- Code for the paper “Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling”☆130Updated this week
- [ACL2025 Oral🔥]Turning Trash into Treasure: Accelerating Inference of Large Language Models with Token Recycling☆22Nov 11, 2025Updated 3 months ago
- B站-数电的ppt☆11Feb 19, 2024Updated 2 years ago
- 为用户每天推送arxiv的最新发布论文☆15Aug 12, 2025Updated 6 months ago
- Two-Stage ECG Signal Denoising Based Deep Convolutional Network☆13Nov 19, 2021Updated 4 years ago
- A Low-Overhead tool for Floating-Point Exception Detection in NVIDIA GPUs☆12Dec 17, 2024Updated last year
- 南开大学操作系统课程实验(UCore)☆11Oct 16, 2022Updated 3 years ago
- 南京大学ICS2019 PA实验, 实验手册https://nju-projectn.github.io/ics-pa-gitbook/ics2019/☆10Aug 22, 2020Updated 5 years ago
- 南京大学 计算机科学与技术系2019 计算机系统基础PA☆14Sep 18, 2020Updated 5 years ago
- ☆12Aug 31, 2023Updated 2 years ago
- Source code and model weights for the PGGAN model utilised for the paper: Evaluating the Clinical Realism of Synthetic Chest X-Rays Gener…☆12Mar 2, 2021Updated 5 years ago
- ☆16Dec 9, 2023Updated 2 years ago
- BEHRT: Transformer for Electronic Health Recrods☆12May 11, 2020Updated 5 years ago
- ☆15Apr 18, 2025Updated 10 months ago
- SICP Online Judge, consisting of a server, a react web interface and a modified Ok client.☆12Dec 5, 2022Updated 3 years ago
- Awesome Blockchain Articles☆14Jul 10, 2022Updated 3 years ago
- 北京交通大学-校名校徽-矢量图☆15May 10, 2021Updated 4 years ago
- 昇腾开发笔记☆15Jan 5, 2024Updated 2 years ago
- ☆32Oct 21, 2025Updated 4 months ago
- An attempt to migrate Karpathy's llm.c to safe rust.☆13Jun 4, 2024Updated last year
- ☆22Aug 14, 2024Updated last year
- [WWW'2024] "Simple Multigraph Convolution Networks"☆13May 21, 2024Updated last year
- Quartet II Official Code☆51Updated this week
- ☆18Nov 26, 2024Updated last year
- ☆15Feb 8, 2023Updated 3 years ago
- 北京交通大学 Beamer 主题(非官方)|Beamer Theme for Beijing Jiaotong University (unofficial)☆18Sep 16, 2022Updated 3 years ago
- Computer Vision: Algorithms and Applications, 2nd ed,翻译☆15Apr 25, 2021Updated 4 years ago
- Code repository for ICLR 2025 paper "LeanQuant: Accurate and Scalable Large Language Model Quantization with Loss-error-aware Grid"☆25Mar 2, 2025Updated last year
- ☆23Jul 7, 2023Updated 2 years ago
- The course of Parallel programming in Nankai university(南开大学《并行程序设计》课程 by 王刚老师)☆11Oct 5, 2022Updated 3 years ago
- 中南大学C++的历年试卷和实验代码☆18Apr 17, 2023Updated 2 years ago
- [ACM MM2025]: MQuant: Unleashing the Inference Potential of Multimodal Large Language Models via Full Static Quantization☆37Aug 13, 2025Updated 6 months ago
- ☆20Apr 12, 2023Updated 2 years ago
- ☆49Feb 5, 2026Updated last month
- This was my thesis topic in undergrad when I tried to track the heart beats from raw ECG signals. The intention was to build a model that…☆22Oct 14, 2024Updated last year
- 简单的Mac中文指南☆23Nov 9, 2022Updated 3 years ago
- QuTLASS: CUTLASS-Powered Quantized BLAS for Deep Learning☆168Nov 11, 2025Updated 3 months ago
- hardware & software prefetcher☆30Dec 21, 2023Updated 2 years ago