PKUZHOU / PetS-ATC-2022
☆9Updated last year
Related projects ⓘ
Alternatives and complementary repositories for PetS-ATC-2022
- PIM-DL: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimization☆25Updated 9 months ago
- ☆22Updated last year
- ☆13Updated 3 years ago
- ArkVale: Efficient Generative LLM Inference with Recallable Key-Value Eviction (NIPS'24)☆19Updated last week
- The Artifact of NeoMem: Hardware/Software Co-Design for CXL-Native Memory Tiering☆29Updated 3 months ago
- A Fast Graph Update Library for FPGA-based Dynamic Graph Processing☆8Updated 2 years ago
- STONNE Simulator integrated into SST Simulator☆17Updated 7 months ago
- Domain-Specific Architecture Generator 2☆20Updated 2 years ago
- Heterogenous ML accelerator☆16Updated last month
- The simulator for SPADA, an SpGEMM accelerator with adaptive dataflow☆29Updated last year
- GNNear: Accelerating Full-Batch Training of Graph NeuralNetworks with Near-Memory Processing☆13Updated 2 years ago
- Ok-Topk is a scheme for distributed training with sparse gradients. Ok-Topk integrates a novel sparse allreduce algorithm (less than 6k c…☆23Updated last year
- Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Mult…☆36Updated 8 months ago
- ☆25Updated 3 years ago
- SimplePIM is the first high-level programming framework for real-world processing-in-memory (PIM) architectures. Described in the PACT 20…☆21Updated last year
- ☆23Updated 2 years ago
- MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN (ASPLOS'24)☆46Updated 5 months ago
- mNPUsim: A Cycle-accurate Multi-core NPU Simulator (IISWC 2023)☆38Updated 6 months ago
- ☆25Updated 4 years ago
- ☆33Updated last year
- HW/SW co-designed end-host RPC stack☆19Updated 3 years ago
- ☆16Updated 2 years ago
- ☆24Updated last year
- ☆15Updated 3 years ago
- Source code for the paper: "A Latency-Predictable Multi-Dimensional Optimization Framework forDNN-driven Autonomous Systems"☆19Updated 3 years ago
- Pin based tool for simulation of rack-scale disaggregated memory systems☆15Updated 3 months ago
- ☆31Updated 3 years ago
- Artifact for USENIX ATC'23: TC-GNN: Bridging Sparse GNN Computation and Dense Tensor Cores on GPUs.☆45Updated last year
- NeuPIMs Simulator☆54Updated 5 months ago
- ☆14Updated 2 years ago