daochenzha / dreamshardLinks
[NeurIPS 2022] DreamShard: Generalizable Embedding Table Placement for Recommender Systems
☆29Updated 2 years ago
Alternatives and similar repositories for dreamshard
Users that are interested in dreamshard are comparing it to the libraries listed below
Sorting:
- [KDD 2022] AutoShard: Automated Embedding Table Sharding for Recommender Systems☆22Updated 2 years ago
- [MLSys 2023] Pre-train and Search: Efficient Embedding Table Sharding with Pre-trained Neural Cost Models☆16Updated 2 years ago
- AutoLossGen: Automatic Loss Function Generation for Recommender Systems☆22Updated 3 years ago
- [ NeurIPS '22 ] Data distillation for recommender systems. Shows equivalent performance with 2-3 orders less data.☆23Updated 2 years ago
- AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers☆47Updated 2 years ago
- The official implementation of "Helen: Optimizing CTR Prediction Models with Frequency-wise Hessian Eigenvalue Regularization"☆16Updated last year
- JORA: JAX Tensor-Parallel LoRA Library (ACL 2024)☆34Updated last year
- Hyperparameter tuning via uncertainty modeling☆47Updated last year
- Official codebase for NeurIPS 2022 paper End-to-end Learning to Index and Search in Large Output Spaces☆12Updated 2 years ago
- Can GPT-4 Perform Neural Architecture Search?☆87Updated last year
- Official implementation of our VQ-GNN paper (NeurIPS2021)☆38Updated 3 years ago
- ☆13Updated 2 years ago
- Using FlexAttention to compute attention with different masking patterns☆44Updated 9 months ago
- pytorch open-source library for the paper "AdaTT Adaptive Task-to-Task Fusion Network for Multitask Learning in Recommendations"☆52Updated 10 months ago
- ☆12Updated last year
- Retrieval with Learned Similarities (http://arxiv.org/abs/2407.15462, WWW'25 Oral)☆43Updated 2 months ago
- Repository for "GIST: Distributed training for large-scale graph convolutional networks"☆15Updated 2 years ago
- ☆36Updated last week
- ICLR 2021☆48Updated 4 years ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆34Updated last year
- Graph Transformers for Large Graphs☆21Updated last year
- [WSDM 2024] Official PyTorch Implementation of Linear Recurrent Units for Sequential Recommendation (LRURec)☆59Updated 4 months ago
- ☆31Updated 8 months ago
- ☆21Updated 3 years ago
- ☆18Updated last week
- ThinK: Thinner Key Cache by Query-Driven Pruning☆20Updated 4 months ago
- ☆15Updated 3 years ago
- [KDD 2021, Research Track] DiffMG: Differentiable Meta Graph Search for Heterogeneous Graph Neural Networks☆30Updated 3 years ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 9 months ago
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆19Updated last month