harvard-acc / DeepRecSys
http://vlsiarch.eecs.harvard.edu/research/recommendation/
☆134Updated 2 years ago
Alternatives and similar repositories for DeepRecSys:
Users that are interested in DeepRecSys are comparing it to the libraries listed below
- Set of datasets for the deep learning recommendation model (DLRM).☆45Updated 2 years ago
- Enabling pure data parallel training of DLRM via caching and prefetching☆17Updated 3 years ago
- Accelerating Recommender model training by leveraging popular choices -- VLDB 2022☆30Updated 7 months ago
- FTPipe and related pipeline model parallelism research.☆41Updated last year
- ☆10Updated 3 years ago
- distributed-embeddings is a library for building large embedding based models in Tensorflow 2.☆44Updated last year
- Model-less Inference Serving☆88Updated last year
- ☆31Updated last year
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…☆136Updated this week
- This is the (evolving) reading list for the seminar.☆57Updated 4 years ago
- This repo is to collect the state-of-the-art GNN hardware acceleration paper☆54Updated 3 years ago
- 🔮 Execution time predictions for deep neural network training iterations across different GPUs.☆60Updated 2 years ago
- ☆71Updated 3 years ago
- ☆43Updated last year
- ☆78Updated 2 years ago
- Research and development for optimizing transformers☆125Updated 4 years ago
- LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale☆105Updated last month
- This is a Tensor Train based compression library to compress sparse embedding tables used in large-scale machine learning models such as …☆194Updated 2 years ago
- A schedule language for large model training☆145Updated 10 months ago
- Fine-grained GPU sharing primitives☆141Updated 5 years ago
- HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of…☆142Updated 2 weeks ago
- ☆47Updated 2 years ago
- parser script to process pytorch autograd profiler result, convert json file to excel.☆14Updated 5 years ago
- AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)☆81Updated last year
- A tool for examining GPU scheduling behavior.☆81Updated 8 months ago
- ☆22Updated 5 years ago
- (NeurIPS 2022) Automatically finding good model-parallel strategies, especially for complex models and clusters.☆38Updated 2 years ago
- PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications☆126Updated 2 years ago
- ☆40Updated 4 years ago
- An Efficient Pipelined Data Parallel Approach for Training Large Model☆75Updated 4 years ago