MicroMix: Efficient Mixed-Precision Quantization with Microscaling Formats for Large Language Models
☆28Apr 2, 2026Updated 2 months ago
Alternatives and similar repositories for MicroMix
Users that are interested in MicroMix are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- a mixed-precision gemm with quantize and reorder kernel.☆13Jul 22, 2025Updated 10 months ago
- Code for the papers: “Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling” and “Adaptive Block-Scaled Data Types”☆193Apr 21, 2026Updated last month
- ☆16Dec 9, 2023Updated 2 years ago
- 为用户每天推送arxiv的最新发布论文☆18Aug 12, 2025Updated 10 months ago
- SICP Online Judge, consisting of a server, a react web interface and a modified Ok client.☆12Dec 5, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 十字街的python客户端☆10Sep 25, 2021Updated 4 years ago
- Two-Stage ECG Signal Denoising Based Deep Convolutional Network☆13Nov 19, 2021Updated 4 years ago
- [WWW'2024] "Simple Multigraph Convolution Networks"☆13May 21, 2024Updated 2 years ago
- B站-数电的ppt☆11Feb 19, 2024Updated 2 years ago
- Quartet II Official Code☆74May 1, 2026Updated last month
- An attempt to migrate Karpathy's llm.c to safe rust.☆13Jun 4, 2024Updated 2 years ago
- A Low-Overhead tool for Floating-Point Exception Detection in NVIDIA GPUs☆15Dec 17, 2024Updated last year
- 南开大学操作系统课程实验(UCore)☆11Oct 16, 2022Updated 3 years ago
- 南京大学 计 算机科学与技术系2019 计算机系统基础PA☆14Sep 18, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- BEHRT: Transformer for Electronic Health Recrods☆12May 11, 2020Updated 6 years ago
- Source code and model weights for the PGGAN model utilised for the paper: Evaluating the Clinical Realism of Synthetic Chest X-Rays Gener…☆11Mar 2, 2021Updated 5 years ago
- 昇腾开发笔记☆17Jan 5, 2024Updated 2 years ago
- Computer Vision: Algorithms and Applications, 2nd ed,翻译☆14Apr 25, 2021Updated 5 years ago
- ☆40Oct 21, 2025Updated 7 months ago
- [ACL2025 Oral🔥]Turning Trash into Treasure: Accelerating Inference of Large Language Models with Token Recycling☆29Nov 11, 2025Updated 7 months ago
- Awesome Blockchain Articles☆14Jul 10, 2022Updated 3 years ago
- ☆20Apr 12, 2023Updated 3 years ago
- ☆34Mar 28, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 北京交通大学 Beamer 主题(非官方)|Beamer Theme for Beijing Jiaotong University (unofficial)☆18Sep 16, 2022Updated 3 years ago
- ☆22Aug 14, 2024Updated last year
- 北京交通大学-校名校徽-矢量图☆16May 10, 2021Updated 5 years ago
- ☆15Feb 8, 2023Updated 3 years ago
- ☆13Aug 31, 2023Updated 2 years ago
- ☆135Updated this week
- Quantize transformers to any learned arbitrary 4-bit numeric format☆58Apr 13, 2026Updated 2 months ago
- [NeurIPS 2024 Oral🔥] DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMs.☆181Apr 24, 2026Updated last month
- ☆20Nov 26, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆17Updated this week
- QuTLASS: CUTLASS-Powered Quantized BLAS for Deep Learning☆185Nov 11, 2025Updated 7 months ago
- ☆80Feb 5, 2026Updated 4 months ago
- 中南大学C++的历年试卷和实验代码☆18Apr 17, 2023Updated 3 years ago
- ☆46Apr 21, 2026Updated last month
- ☆27Jul 7, 2023Updated 2 years ago
- The course of Parallel programming in Nankai university(南开大学《并行程序设计》课程 by 王刚老师)☆12Oct 5, 2022Updated 3 years ago