MicroMix: Efficient Mixed-Precision Quantization with Microscaling Formats for Large Language Models
☆29Feb 12, 2026Updated last month
Alternatives and similar repositories for MicroMix
Users that are interested in MicroMix are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- a mixed-precision gemm with quantize and reorder kernel.☆14Jul 22, 2025Updated 8 months ago
- Code for the paper “Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling”☆140Mar 7, 2026Updated 2 weeks ago
- ☆16Dec 9, 2023Updated 2 years ago
- 为用户每天推送arxiv的最新发布论文☆15Aug 12, 2025Updated 7 months ago
- SICP Online Judge, consisting of a server, a react web interface and a modified Ok client.☆12Dec 5, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 十字街的python客户端☆10Sep 25, 2021Updated 4 years ago
- Quartet II Official Code☆61Mar 19, 2026Updated last week
- Two-Stage ECG Signal Denoising Based Deep Convolutional Network☆13Nov 19, 2021Updated 4 years ago
- [WWW'2024] "Simple Multigraph Convolution Networks"☆13May 21, 2024Updated last year
- B站-数电的ppt☆11Feb 19, 2024Updated 2 years ago
- A Low-Overhead tool for Floating-Point Exception Detection in NVIDIA GPUs☆13Dec 17, 2024Updated last year
- An attempt to migrate Karpathy's llm.c to safe rust.☆13Jun 4, 2024Updated last year
- 南开大学操作系统课程实验(UCore)☆11Oct 16, 2022Updated 3 years ago
- 南京大学ICS2019 PA实验, 实验手册https://nju-projectn.github.io/ics-pa-gitbook/ics2019/☆10Aug 22, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 南京大学 计算机科学与技术系2019 计算机系统基础PA☆14Sep 18, 2020Updated 5 years ago
- BEHRT: Transformer for Electronic Health Recrods☆12May 11, 2020Updated 5 years ago
- Source code and model weights for the PGGAN model utilised for the paper: Evaluating the Clinical Realism of Synthetic Chest X-Rays Gener…☆12Mar 2, 2021Updated 5 years ago
- 昇腾开发笔记☆14Jan 5, 2024Updated 2 years ago
- Computer Vision: Algorithms and Applications, 2nd ed,翻译☆14Apr 25, 2021Updated 4 years ago
- ☆34Oct 21, 2025Updated 5 months ago
- [ACL2025 Oral🔥]Turning Trash into Treasure: Accelerating Inference of Large Language Models with Token Recycling☆27Nov 11, 2025Updated 4 months ago
- Awesome Blockchain Articles☆14Jul 10, 2022Updated 3 years ago
- ☆20Apr 12, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆34Mar 28, 2025Updated 11 months ago
- 北京交通大学 Beamer 主题(非官方)|Beamer Theme for Beijing Jiaotong University (unofficial)☆18Sep 16, 2022Updated 3 years ago
- ☆23Aug 14, 2024Updated last year
- 北京交通大学-校名校徽-矢量图☆15May 10, 2021Updated 4 years ago
- ☆15Feb 8, 2023Updated 3 years ago
- Quantize transformers to any learned arbitrary 4-bit numeric format☆53Jan 25, 2026Updated 2 months ago
- ☆12Aug 31, 2023Updated 2 years ago
- [NeurIPS 2024 Oral🔥] DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMs.☆179Oct 3, 2024Updated last year
- ☆55Feb 5, 2026Updated last month
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆19Nov 26, 2024Updated last year
- QuTLASS: CUTLASS-Powered Quantized BLAS for Deep Learning☆171Nov 11, 2025Updated 4 months ago
- ☆15Apr 18, 2025Updated 11 months ago
- 中南大学C++的历年试卷和实验代码☆18Apr 17, 2023Updated 2 years ago
- ☆44Feb 27, 2026Updated 3 weeks ago
- ☆25Jul 7, 2023Updated 2 years ago
- This is a series of quick start guide of Vitis HLS tool in Chinese. It explains the basic concepts and the most important optimize techni…☆26Nov 9, 2022Updated 3 years ago