daochenzha / dreamshardLinks
[NeurIPS 2022] DreamShard: Generalizable Embedding Table Placement for Recommender Systems
☆29Updated 2 years ago
Alternatives and similar repositories for dreamshard
Users that are interested in dreamshard are comparing it to the libraries listed below
Sorting:
- [KDD 2022] AutoShard: Automated Embedding Table Sharding for Recommender Systems☆22Updated 2 years ago
- [MLSys 2023] Pre-train and Search: Efficient Embedding Table Sharding with Pre-trained Neural Cost Models☆16Updated 2 years ago
- Official implementation of our VQ-GNN paper (NeurIPS2021)☆38Updated 3 years ago
- JORA: JAX Tensor-Parallel LoRA Library (ACL 2024)☆33Updated last year
- Hyperparameter tuning via uncertainty modeling☆47Updated last year
- pytorch open-source library for the paper "AdaTT Adaptive Task-to-Task Fusion Network for Multitask Learning in Recommendations"☆51Updated 9 months ago
- [ICLR 2023] MLPInit: Embarrassingly Simple GNN Training Acceleration with MLP Initialization☆77Updated 2 years ago
- ☆23Updated last year
- ☆21Updated 3 years ago
- Repository for "GIST: Distributed training for large-scale graph convolutional networks"☆15Updated 2 years ago
- [ NeurIPS '22 ] Data distillation for recommender systems. Shows equivalent performance with 2-3 orders less data.☆23Updated last year
- The official implementation of "Helen: Optimizing CTR Prediction Models with Frequency-wise Hessian Eigenvalue Regularization"☆16Updated last year
- [IPDPS 2024] Adaptive neighbor sampling for temporal GNN☆12Updated 3 months ago
- ☆22Updated 2 years ago
- Can GPT-4 Perform Neural Architecture Search?☆87Updated last year
- AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers☆46Updated 2 years ago
- NASRec Weight Sharing Neural Architecture Search for Recommender Systems☆30Updated last year
- ☆18Updated 3 years ago
- ICLR 2021☆48Updated 4 years ago
- Using FlexAttention to compute attention with different masking patterns☆43Updated 8 months ago
- AutoLossGen: Automatic Loss Function Generation for Recommender Systems☆22Updated 3 years ago
- ☆29Updated 2 years ago
- Contrastive Learning with Model Augmentation☆17Updated 2 years ago
- Linear Attention Sequence Parallelism (LASP)☆83Updated last year
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated 11 months ago
- Repository for CPU Kernel Generation for LLM Inference☆26Updated last year
- ☆31Updated 7 months ago
- Distributed Deep Graph Learning Framework for Dynamic Graphs☆13Updated last year
- The implementation of HyperND from the Nonlinear Feature Diffusion on Hypergraphs paper☆13Updated 3 years ago
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆80Updated last year