harvard-acc / RecPipe
☆10Updated 2 years ago
Related projects: ⓘ
- http://vlsiarch.eecs.harvard.edu/research/recommendation/☆127Updated 2 years ago
- Set of datasets for the deep learning recommendation model (DLRM).☆39Updated last year
- Enabling pure data parallel training of DLRM via caching and prefetching☆18Updated 2 years ago
- ☆12Updated last year
- Accelerating Recommender model training by leveraging popular choices -- VLDB 2022☆29Updated last week
- PyTorch-Direct code on top of PyTorch-1.8.0nightly (e152ca5) for Large Graph Convolutional Network Training with GPU-Oriented Data Commun…☆45Updated last year
- MLSys 2021 paper: MicroRec: efficient recommendation inference by hardware and data structure solutions☆13Updated 3 years ago
- Thinking is hard - automate it☆18Updated 2 years ago
- Artifact of ASPLOS'23 paper entitled: GRACE: A Scalable Graph-Based Approach to Accelerating Recommendation Model Inference☆16Updated last year
- Ginex: SSD-enabled Billion-scale Graph Neural Network Training on a Single Machine via Provably Optimal In-memory Caching☆34Updated 2 months ago
- Repo for the IISWC 2018 submission☆9Updated 2 years ago
- ☆29Updated 7 months ago
- ☆12Updated 2 years ago
- Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Mult…☆34Updated 6 months ago
- Graphiler is a compiler stack built on top of DGL and TorchScript which compiles GNNs defined using user-defined functions (UDFs) into ef…☆58Updated last year
- ☆19Updated last year
- ☆31Updated last year
- one-shot-tuner☆8Updated last year
- Distributed Multi-GPU GNN Framework☆35Updated 4 years ago
- GNNear: Accelerating Full-Batch Training of Graph NeuralNetworks with Near-Memory Processing☆11Updated 2 years ago
- [HPCA 2022] GCoD: Graph Convolutional Network Acceleration via Dedicated Algorithm and Accelerator Co-Design☆32Updated 2 years ago
- ☆17Updated last year
- ☆9Updated 6 months ago
- ☆71Updated 3 years ago
- ☆45Updated last year
- ☆20Updated last year
- Benchmark suite containing cache filtered traces for use with Ramulator. These include some of the workloads used in our SIGMETRICS 2019 …☆19Updated 3 years ago
- PIM-ML is a benchmark for training machine learning algorithms on the UPMEM architecture, which is the first publicly-available real-worl…☆15Updated last year
- The source code for GPGPUSim+Ramulator simulator. In this version, GPGPUSim uses Ramulator to simulate the DRAM. This simulator is used t…☆45Updated 4 years ago
- ☆14Updated 2 years ago