daochenzha / dreamshard
[NeurIPS 2022] DreamShard: Generalizable Embedding Table Placement for Recommender Systems
☆29Updated 2 years ago
Alternatives and similar repositories for dreamshard:
Users that are interested in dreamshard are comparing it to the libraries listed below
- [KDD 2022] AutoShard: Automated Embedding Table Sharding for Recommender Systems☆21Updated 2 years ago
- [MLSys 2023] Pre-train and Search: Efficient Embedding Table Sharding with Pre-trained Neural Cost Models☆16Updated 2 years ago
- The official implementation of "Helen: Optimizing CTR Prediction Models with Frequency-wise Hessian Eigenvalue Regularization"☆16Updated last year
- [ NeurIPS '22 ] Data distillation for recommender systems. Shows equivalent performance with 2-3 orders less data.☆23Updated last year
- ☆31Updated 6 months ago
- JORA: JAX Tensor-Parallel LoRA Library (ACL 2024)☆33Updated last year
- [ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models☆30Updated 10 months ago
- Can GPT-4 Perform Neural Architecture Search?☆87Updated last year
- Repository for "GIST: Distributed training for large-scale graph convolutional networks"☆14Updated 2 years ago
- AutoLossGen: Automatic Loss Function Generation for Recommender Systems☆22Updated 3 years ago
- Using FlexAttention to compute attention with different masking patterns☆43Updated 7 months ago
- ☆13Updated 2 years ago
- official repo of AAAI2024 paper Mitigating the Impact of False Negatives in Dense Retrieval with Contrastive Confidence Regularization☆13Updated last year
- Official codebase for NeurIPS 2022 paper End-to-end Learning to Index and Search in Large Output Spaces☆12Updated 2 years ago
- AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers☆46Updated 2 years ago
- ☆20Updated 2 months ago
- ☆29Updated 2 years ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated last year
- A testbed for agents and environments that can automatically improve models through data generation.☆23Updated 2 months ago
- Code for the Population-Based Bandits Algorithm, presented at NeurIPS 2020.☆20Updated 4 years ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 7 months ago
- Implementation of Hyena Hierarchy in JAX☆10Updated 2 years ago
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆19Updated last month
- Efficient Scaling laws and collaborative pretraining.☆16Updated 3 months ago
- ☆23Updated 7 months ago
- Code for the PAPA paper☆27Updated 2 years ago
- Official implementation of our VQ-GNN paper (NeurIPS2021)☆38Updated 3 years ago
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆80Updated last year
- ☆20Updated last year
- ☆53Updated 9 months ago