MicroMix: Efficient Mixed-Precision Quantization with Microscaling Formats for Large Language Models
☆28Apr 2, 2026Updated last month
Alternatives and similar repositories for MicroMix
Users that are interested in MicroMix are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- a mixed-precision gemm with quantize and reorder kernel.☆13Jul 22, 2025Updated 9 months ago
- Code for the papers: “Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling” and “Adaptive Block-Scaled Data Types”☆173Apr 21, 2026Updated 2 weeks ago
- ☆16Dec 9, 2023Updated 2 years ago
- 为用户每天推送arxiv的最新发布论文☆17Aug 12, 2025Updated 8 months ago
- SICP Online Judge, consisting of a server, a react web interface and a modified Ok client.☆12Dec 5, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 十字街的python客户端☆10Sep 25, 2021Updated 4 years ago
- Two-Stage ECG Signal Denoising Based Deep Convolutional Network☆13Nov 19, 2021Updated 4 years ago
- [WWW'2024] "Simple Multigraph Convolution Networks"☆13May 21, 2024Updated last year
- Quartet II Official Code☆70Updated this week
- B站-数电的ppt☆11Feb 19, 2024Updated 2 years ago
- A Low-Overhead tool for Floating-Point Exception Detection in NVIDIA GPUs☆13Dec 17, 2024Updated last year
- An attempt to migrate Karpathy's llm.c to safe rust.☆13Jun 4, 2024Updated last year
- 南京大学ICS2019 PA实验, 实验手册https://nju-projectn.github.io/ics-pa-gitbook/ics2019/☆10Aug 22, 2020Updated 5 years ago
- 南开大学操作系统课程实验(UCore)☆11Oct 16, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 南京大学 计算机科学与技术系2019 计算机系统基础PA☆14Sep 18, 2020Updated 5 years ago
- BEHRT: Transformer for Electronic Health Recrods☆12May 11, 2020Updated 5 years ago
- Source code and model weights for the PGGAN model utilised for the paper: Evaluating the Clinical Realism of Synthetic Chest X-Rays Gener…☆11Mar 2, 2021Updated 5 years ago
- 昇腾开发笔记☆16Jan 5, 2024Updated 2 years ago
- Computer Vision: Algorithms and Applications, 2nd ed,翻译☆14Apr 25, 2021Updated 5 years ago
- ☆36Oct 21, 2025Updated 6 months ago
- [ACL2025 Oral🔥]Turning Trash into Treasure: Accelerating Inference of Large Language Models with Token Recycling☆28Nov 11, 2025Updated 5 months ago
- Awesome Blockchain Articles☆14Jul 10, 2022Updated 3 years ago
- ☆20Apr 12, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆34Mar 28, 2025Updated last year
- 北京交通大学 Beamer 主题(非官方)|Beamer Theme for Beijing Jiaotong University (unofficial)☆18Sep 16, 2022Updated 3 years ago
- ☆23Aug 14, 2024Updated last year
- 北京交通大学-校名校徽-矢量图☆16May 10, 2021Updated 4 years ago
- ☆15Feb 8, 2023Updated 3 years ago
- ☆80Apr 29, 2026Updated last week
- ☆12Aug 31, 2023Updated 2 years ago
- Quantize transformers to any learned arbitrary 4-bit numeric format☆55Apr 13, 2026Updated 3 weeks ago
- [NeurIPS 2024 Oral🔥] DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMs.☆180Apr 24, 2026Updated last week
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆20Nov 26, 2024Updated last year
- QuTLASS: CUTLASS-Powered Quantized BLAS for Deep Learning☆179Nov 11, 2025Updated 5 months ago
- ☆65Feb 5, 2026Updated 3 months ago
- ☆15Apr 18, 2025Updated last year
- 中南大学C++的历年试卷和实验代码☆18Apr 17, 2023Updated 3 years ago
- ☆46Apr 21, 2026Updated 2 weeks ago
- ☆26Jul 7, 2023Updated 2 years ago