☆31Apr 2, 2025Updated 10 months ago
Alternatives and similar repositories for SpInfer_EuroSys25
Users that are interested in SpInfer_EuroSys25 are comparing it to the libraries listed below
Sorting:
- A intelligent matrix format designer for SpMV☆10Oct 10, 2023Updated 2 years ago
- Mirror of http://gitlab.hpcrl.cse.ohio-state.edu/chong/ppopp19_ae, refactoring for understanding☆15Oct 20, 2021Updated 4 years ago
- [ACL 2024] A novel QAT with Self-Distillation framework to enhance ultra low-bit LLMs.☆134May 16, 2024Updated last year
- ☆20Nov 7, 2019Updated 6 years ago
- ☆88May 31, 2025Updated 9 months ago
- A Row Decomposition-based Approach for Sparse Matrix Multiplication on GPUs☆28Nov 29, 2023Updated 2 years ago
- FlashSparse significantly reduces the computation redundancy for unstructured sparsity (for SpMM and SDDMM) on Tensor Cores through a Swa…☆39Oct 5, 2025Updated 4 months ago
- Optimize GEMM with tensorcore step by step☆36Dec 17, 2023Updated 2 years ago
- [NeurIPS 2023] ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer☆30Dec 6, 2023Updated 2 years ago
- ☆38Mar 14, 2024Updated last year
- ☆11Jan 21, 2021Updated 5 years ago
- An artificial matrix generator in C☆12Feb 16, 2023Updated 3 years ago
- PanguLU: A Scalable Regular Two-Dimensional Block-Cyclic Sparse Direct Solver on Distributed Heterogeneous Systems☆45Aug 2, 2025Updated 6 months ago
- Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of pap…☆282Mar 6, 2025Updated 11 months ago
- ☆11Aug 4, 2020Updated 5 years ago
- libsmctrl论文的复现,添加了python端接口,可以在python端灵活调用接口来分配计算资源☆12May 21, 2024Updated last year
- Here is the repo for public scripts.☆11Jul 16, 2022Updated 3 years ago
- Kratos: An FPGA Benchmark for Unrolled Deep Neural Networks with Fine-Grained Sparsity and Mixed Precision☆12Jan 19, 2026Updated last month
- Code and Data for ACL 2025 Paper "Aristotle: Mastering Logical Reasoning with A Logic-Complete Decompose-Search-Resolve Framework".☆23Oct 3, 2025Updated 4 months ago
- TMMA: A Tiled Matrix Multiplication Accelerator for Self-Attention Projections in Transformer Models, optimized for edge deployment on Xi…☆26Mar 24, 2025Updated 11 months ago
- Are gradient information useful for pruning of LLMs?☆47Aug 23, 2025Updated 6 months ago
- A Easy-to-understand TensorOp Matmul Tutorial☆410Feb 11, 2026Updated 2 weeks ago
- Pie: Programmable LLM Serving☆126Feb 18, 2026Updated last week
- ☆11Mar 15, 2023Updated 2 years ago
- ☆12Jan 7, 2025Updated last year
- ☆13Sep 19, 2024Updated last year
- The official implementation of dLLM-Var☆30Nov 6, 2025Updated 3 months ago
- ☆11Sep 20, 2024Updated last year
- Source code of the IPDPS '21 paper: "TileSpMV: A Tiled Algorithm for Sparse Matrix-Vector Multiplication on GPUs" by Yuyao Niu, Zhengyang…☆12Aug 12, 2022Updated 3 years ago
- ☆12Sep 1, 2023Updated 2 years ago
- ☆13Mar 9, 2024Updated last year
- Multi Layer Perceptron by Vivado HLS for Xilinx FPGA implementation☆12Dec 26, 2016Updated 9 years ago
- ☆11Mar 9, 2022Updated 3 years ago
- Example of Matrix Multiplication using Map Reduce paradigm in python☆10Oct 25, 2016Updated 9 years ago
- FuseFL: One-Shot Federated Learning through the Lens of Causality with Progressive Model Fusion (NeurIPS 2024 Spotlight)☆14Mar 31, 2025Updated 11 months ago
- Chinese Guide for Alveo Getting Started☆12May 18, 2020Updated 5 years ago
- 一些有趣的页面,使用 Github Pages 和 Vercel 部署☆13Feb 8, 2024Updated 2 years ago
- ☆11May 4, 2025Updated 9 months ago
- Fast Synchronization-Free Algorithms for Parallel Sparse Triangular Solves with Multiple Right-Hand Sides (SpTRSM)☆14Feb 14, 2020Updated 6 years ago