harvard-acc / DeepRecSys
http://vlsiarch.eecs.harvard.edu/research/recommendation/
☆135Updated 2 years ago
Alternatives and similar repositories for DeepRecSys:
Users that are interested in DeepRecSys are comparing it to the libraries listed below
- Set of datasets for the deep learning recommendation model (DLRM).☆45Updated 2 years ago
- Accelerating Recommender model training by leveraging popular choices -- VLDB 2022☆30Updated 7 months ago
- ☆10Updated 3 years ago
- Enabling pure data parallel training of DLRM via caching and prefetching☆17Updated 3 years ago
- distributed-embeddings is a library for building large embedding based models in Tensorflow 2.☆44Updated last year
- ☆71Updated 3 years ago
- ☆30Updated last year
- ☆79Updated 2 years ago
- Research and development for optimizing transformers☆126Updated 4 years ago
- FTPipe and related pipeline model parallelism research.☆41Updated last year
- An Efficient Pipelined Data Parallel Approach for Training Large Model☆76Updated 4 years ago
- This repo is to collect the state-of-the-art GNN hardware acceleration paper☆54Updated 3 years ago
- A schedule language for large model training☆146Updated 10 months ago
- This is a Tensor Train based compression library to compress sparse embedding tables used in large-scale machine learning models such as …☆194Updated 2 years ago
- ☆47Updated 2 years ago
- Model-less Inference Serving☆88Updated last year
- A tool for examining GPU scheduling behavior.☆81Updated 8 months ago
- ☆40Updated 4 years ago
- Training neural networks in TensorFlow 2.0 with 5x less memory☆131Updated 3 years ago
- ☆106Updated 3 years ago
- HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of…☆144Updated this week
- Artifact of ASPLOS'23 paper entitled: GRACE: A Scalable Graph-Based Approach to Accelerating Recommendation Model Inference☆18Updated 2 years ago
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…☆138Updated this week
- Distributed Multi-GPU GNN Framework☆37Updated 4 years ago
- Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Mult…☆40Updated last year
- LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale☆113Updated 2 months ago
- AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)☆81Updated last year
- ☆51Updated 5 years ago
- BytePS examples (Vision, NLP, GAN, etc)☆19Updated 2 years ago
- ☆18Updated 4 years ago