HectorHHZ / Sparse_Matrix_TuningLinks
Github repo for ICLR-2025 paper, Fine-tuning Large Language Models with Sparse Matrices
☆23Updated 8 months ago
Alternatives and similar repositories for Sparse_Matrix_Tuning
Users that are interested in Sparse_Matrix_Tuning are comparing it to the libraries listed below
Sorting:
- SpInfer: Leveraging Low-Level Sparsity for Efficient Large Language Model Inference on GPUs☆61Updated 10 months ago
- ☆65Updated 9 months ago
- ☆128Updated 5 months ago
- Implement Flash Attention using Cute.☆100Updated last year
- ☆84Updated last year
- A practical way of learning Swizzle☆36Updated 11 months ago
- [HPCA 2026] A GPU-optimized system for efficient long-context LLMs decoding with low-bit KV cache.☆79Updated last month
- [DAC'25] Official implement of "HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference"☆97Updated last month
- ☆38Updated 5 months ago
- DeeperGEMM: crazy optimized version☆73Updated 8 months ago
- 使用 cutlass 仓库在 ada 架构上实现 fp8 的 flash attention☆78Updated last year
- NVIDIA cuTile learn☆154Updated last month
- FlashInfer Bench @ MLSys 2026: Building AI agents to write high performance GPU kernels☆60Updated last week
- QAQ: Quality Adaptive Quantization for LLM KV Cache☆55Updated last year
- ☆52Updated 8 months ago
- Triton adapter for Ascend. Mirror of https://gitee.com/ascend/triton-ascend☆105Updated this week
- Accelerating Large-Scale Reasoning Model Inference with Sparse Self-Speculative Decoding☆82Updated last month
- Quantized Attention on GPU☆44Updated last year
- ☆77Updated last year
- ☆20Updated last year
- Keyformer proposes KV Cache reduction through key tokens identification and without the need for fine-tuning☆59Updated last year
- Framework to reduce autotune overhead to zero for well known deployments.☆94Updated 4 months ago
- ☆35Updated 10 months ago
- DeepSeek-V3.2-Exp DSA Warmup Lightning Indexer training operator based on tilelang☆43Updated 2 months ago
- PyTorch bindings for CUTLASS grouped GEMM.☆141Updated 8 months ago
- Learning TileLang with 10 puzzles!☆56Updated this week
- ☆40Updated last year
- ☆41Updated 3 months ago
- Tile-based language built for AI computation across all scales☆117Updated 2 weeks ago
- ☆84Updated 9 months ago