daochenzha / neuroshard
[MLSys 2023] Pre-train and Search: Efficient Embedding Table Sharding with Pre-trained Neural Cost Models
☆16Updated last year
Related projects ⓘ
Alternatives and complementary repositories for neuroshard
- [KDD 2022] AutoShard: Automated Embedding Table Sharding for Recommender Systems☆21Updated last year
- [NeurIPS 2022] DreamShard: Generalizable Embedding Table Placement for Recommender Systems☆28Updated last year
- ☆72Updated 3 years ago
- Efficient Retrieval with Learned Similarities☆13Updated 3 months ago
- Largest realworld open-source graph dataset - Worked done under IBM-Illinois Discovery Accelerator Institute and Amazon Research Awards a…☆75Updated 2 months ago
- Accelerating Recommender model training by leveraging popular choices -- VLDB 2022☆29Updated last month
- ☆42Updated 4 months ago
- Set of datasets for the deep learning recommendation model (DLRM).☆40Updated last year
- Graphiler is a compiler stack built on top of DGL and TorchScript which compiles GNNs defined using user-defined functions (UDFs) into ef…☆60Updated 2 years ago
- Distributed Deep Graph Learning Framework for Dynamic Graphs☆11Updated 7 months ago
- TensorRT LLM Benchmark Configuration☆11Updated 3 months ago
- [ACL 2024] RelayAttention for Efficient Large Language Model Serving with Long System Prompts☆34Updated 8 months ago
- Modular and structured prompt caching for low-latency LLM inference☆65Updated this week
- ☆18Updated 2 years ago
- [EMNLP 2024 Main] Virtual Personas for Language Models via an Anthology of Backstories☆14Updated 4 months ago
- ☆12Updated last year
- Repository for "GIST: Distributed training for large-scale graph convolutional networks"☆14Updated last year
- A resilient distributed training framework☆85Updated 7 months ago
- PyTorch implementation of paper "Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline".☆74Updated last year
- LLM Serving Performance Evaluation Harness☆54Updated 2 months ago
- Fast Parallel Probabilistic Graphical Model Learning and Inference [IPDPS'22, PPoPP'23, USENIX ATC'24]☆41Updated this week
- A GPU-accelerated graph learning library for PyTorch, facilitating the scaling of GNN training and inference.☆118Updated last week
- Modyn is a research-platform for training ML models on growing datasets.☆29Updated 3 weeks ago
- PyTorch-Direct code on top of PyTorch-1.8.0nightly (e152ca5) for Large Graph Convolutional Network Training with GPU-Oriented Data Commun…☆45Updated last year
- A high-performance distributed deep learning system targeting large-scale and automated distributed training. If you have any interests, …☆105Updated 10 months ago
- WholeGraph - large scale Graph Neural Networks☆99Updated this week
- ☆44Updated 2 years ago
- ☆55Updated 5 months ago
- The official SALIENT system described in the paper "Accelerating Training and Inference of Graph Neural Networks with Fast Sampling and P…☆38Updated last year
- ☆15Updated 2 years ago