uchuhimo / amanda
☆16Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for amanda
- Artifact for PPoPP22 QGTC: Accelerating Quantized GNN via GPU Tensor Core.☆27Updated 2 years ago
- ☆82Updated 5 months ago
- ☆27Updated 3 months ago
- Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Mult…☆37Updated 7 months ago
- Compiler for Dynamic Neural Networks☆43Updated 11 months ago
- ☆72Updated last year
- Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators☆102Updated 2 years ago
- MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN (ASPLOS'24)☆43Updated 5 months ago
- ☆23Updated 4 months ago
- LLM serving cluster simulator☆74Updated 6 months ago
- ☆22Updated 2 years ago
- LLM Inference analyzer for different hardware platforms☆41Updated last week
- Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.☆81Updated last year
- DietCode Code Release☆61Updated 2 years ago
- ☆23Updated last year
- ☆31Updated 2 years ago
- ☆25Updated 4 years ago
- ☆32Updated last year
- InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)☆75Updated 4 months ago
- ☆81Updated 4 months ago
- ☆24Updated 7 months ago
- ☆18Updated 3 months ago
- ☆37Updated 3 years ago
- A Row Decomposition-based Approach for Sparse Matrix Multiplication on GPUs☆11Updated 11 months ago
- Horizontal Fusion☆21Updated 2 years ago
- Source code of the SC '23 paper: "DASP: Specific Dense Matrix Multiply-Accumulate Units Accelerated General Sparse Matrix-Vector Multipli…☆17Updated 4 months ago
- ☆9Updated 2 years ago
- Repository for artifact evaluation of ASPLOS 2023 paper "SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning"☆23Updated last year
- ☆89Updated 2 years ago
- ☆41Updated 6 months ago