☆120Jan 8, 2026Updated last month
Alternatives and similar repositories for Quartet
Users that are interested in Quartet are comparing it to the libraries listed below
Sorting:
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆17Feb 9, 2026Updated 2 weeks ago
- QuTLASS: CUTLASS-Powered Quantized BLAS for Deep Learning☆168Nov 11, 2025Updated 3 months ago
- Official implementation for Training LLMs with MXFP4☆119Apr 25, 2025Updated 10 months ago
- ☆46May 20, 2025Updated 9 months ago
- [NeurIPS 2025, Spotlight]: Ambient-o: Training Good models with Bad Data.☆30Jan 21, 2026Updated last month
- ☆15Sep 22, 2024Updated last year
- DeeperGEMM: crazy optimized version☆74May 5, 2025Updated 9 months ago
- documentation used in my projects☆16Updated this week
- FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation☆51Aug 24, 2025Updated 6 months ago
- Code for the paper “Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling”☆128Updated this week
- My submission for the GPUMODE/AMD fp8 mm challenge☆29Jun 4, 2025Updated 8 months ago
- ClearText is an AI-powered text detection and enhancement tool that helps make text in images more readable and clearer. Perfect for impr…☆31Apr 29, 2025Updated 10 months ago
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆39Feb 7, 2026Updated 3 weeks ago
- ☆17Aug 5, 2025Updated 6 months ago
- An efficient GPU support for LLM inference with x-bit quantization (e.g. FP6,FP5).☆277Jul 16, 2025Updated 7 months ago
- ☆40Sep 24, 2025Updated 5 months ago
- A central registry and HTTP interface for coordinating Model Context Protocol (MCP) servers.☆34Dec 29, 2024Updated last year
- Implementation of PGONAS for CVPR22W and RD-NAS for ICASSP23☆23Apr 25, 2023Updated 2 years ago
- AFPQ code implementation☆23Nov 6, 2023Updated 2 years ago
- Quantized Attention on GPU☆44Nov 22, 2024Updated last year
- A wrapper around libssh2 for .NET☆29Jan 21, 2026Updated last month
- ☆15Jan 12, 2026Updated last month
- SING: SDE Inference via Natural Gradients☆36Dec 9, 2025Updated 2 months ago
- Personal Finance Expense Tracker☆19Nov 14, 2025Updated 3 months ago
- A toolkit for developers to simplify the transformation of nn.Module instances. It's now corresponding to Pytorch.fx.☆13Apr 7, 2023Updated 2 years ago
- Tutorial for TikZ☆11Apr 3, 2025Updated 10 months ago
- Persistent dense gemm for Hopper in `CuTeDSL`☆15Aug 9, 2025Updated 6 months ago
- 🕷️ n8n Community Node for Scrappey API – Automate web scraping and data extraction with advanced anti-bot blocking technology, seamlessl…☆16Feb 2, 2026Updated 3 weeks ago
- ☆16Jul 1, 2025Updated 8 months ago
- Code for paper "Reasoning Like an Economist: Post-Training on Economic Problems Induces Strategic Generalization in LLMs"☆12Jun 11, 2025Updated 8 months ago
- ☆18Dec 9, 2025Updated 2 months ago
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year
- A simple, interactive web tool to compare pricing and performance metrics of various AI models.☆16Feb 20, 2026Updated last week
- Fast Hadamard transform in CUDA, with a PyTorch interface☆285Oct 19, 2025Updated 4 months ago
- ☆12Sep 1, 2023Updated 2 years ago
- This repo documents my workflows and stack to run comfy ui GenANI assist under windows☆30Feb 14, 2026Updated 2 weeks ago
- A powerful, interactive Python CLI for converting, manipulating, and inspecting media files using FFmpeg 🎬☆17Feb 10, 2026Updated 2 weeks ago
- CodeQUEST is a generalizable framework which leverages LLMs to iteratively evaluate and enhance code quality across multiple dimensions f…☆17Feb 11, 2026Updated 2 weeks ago
- ☆12Jan 4, 2024Updated 2 years ago