fuvty / DeSCo
The official implementation of WSDM'24 paper <DeSCo: Towards Generalizable and Scalable Deep Subgraph Counting>
☆15Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for DeSCo
- TidalDecode: A Fast and Accurate LLM Decoding with Position Persistent Sparse Attention☆21Updated last month
- PyTorch-Based Fast and Efficient Processing for Various Machine Learning Applications with Diverse Sparsity☆99Updated this week
- Repository for artifact evaluation of ASPLOS 2023 paper "SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning"☆23Updated last year
- Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Mult…☆37Updated 8 months ago
- The official code for DATE'23 paper <CLAP: Locality Aware and Parallel Triangle Counting with Content Addressable Memory>☆20Updated last month
- ArkVale: Efficient Generative LLM Inference with Recallable Key-Value Eviction (NIPS'24)☆17Updated last week
- Code Repository of Evaluating Quantized Large Language Models☆103Updated 2 months ago
- Graphiler is a compiler stack built on top of DGL and TorchScript which compiles GNNs defined using user-defined functions (UDFs) into ef…☆60Updated 2 years ago
- ☆131Updated 3 months ago
- ☆101Updated 3 years ago
- ☆15Updated last year
- [Mlsys'22] Understanding gnn computational graph: A coordinated computation, io, and memory perspective☆17Updated last year
- 16-fold memory access reduction with nearly no loss☆59Updated last week
- SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models☆24Updated 3 months ago
- Codebase for ICML'24 paper: Learning from Students: Applying t-Distributions to Explore Accurate and Efficient Formats for LLMs☆24Updated 4 months ago
- Artifact for USENIX ATC'23: TC-GNN: Bridging Sparse GNN Computation and Dense Tensor Cores on GPUs.☆45Updated last year
- ☆80Updated last year
- ☆41Updated 2 years ago
- Official PyTorch implementation of FlatQuant: Flatness Matters for LLM Quantization☆63Updated last week
- ☆23Updated 4 months ago
- MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN (ASPLOS'24)☆44Updated 5 months ago
- [ICLR 2022] "PipeGCN: Efficient Full-Graph Training of Graph Convolutional Networks with Pipelined Feature Communication" by Cheng Wan, Y…☆31Updated last year
- [ICML 2024 Oral] Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs☆83Updated 3 months ago
- Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity☆181Updated last year
- MagicPIG: LSH Sampling for Efficient LLM Generation☆59Updated 3 weeks ago
- ☆72Updated 3 years ago
- Quantized Side Tuning: Fast and Memory-Efficient Tuning of Quantized Large Language Models☆36Updated 2 weeks ago
- Fast Hadamard transform in CUDA, with a PyTorch interface☆111Updated 5 months ago
- A sparse attention kernel supporting mix sparse patterns☆58Updated last month
- SparseTIR: Sparse Tensor Compiler for Deep Learning☆131Updated last year