daochenzha / dreamshard
[NeurIPS 2022] DreamShard: Generalizable Embedding Table Placement for Recommender Systems
☆29Updated last year
Alternatives and similar repositories for dreamshard:
Users that are interested in dreamshard are comparing it to the libraries listed below
- [KDD 2022] AutoShard: Automated Embedding Table Sharding for Recommender Systems☆21Updated last year
- [MLSys 2023] Pre-train and Search: Efficient Embedding Table Sharding with Pre-trained Neural Cost Models☆16Updated last year
- The official implementation of "Helen: Optimizing CTR Prediction Models with Frequency-wise Hessian Eigenvalue Regularization"☆15Updated 10 months ago
- [ NeurIPS '22 ] Data distillation for recommender systems. Shows equivalent performance with 2-3 orders less data.☆22Updated last year
- Hyperparameter tuning via uncertainty modeling☆46Updated 8 months ago
- Can GPT-4 Perform Neural Architecture Search?☆85Updated last year
- pytorch open-source library for the paper "AdaTT Adaptive Task-to-Task Fusion Network for Multitask Learning in Recommendations"☆42Updated 5 months ago
- JORA: JAX Tensor-Parallel LoRA Library (ACL 2024)☆32Updated 9 months ago
- Official implementation of our VQ-GNN paper (NeurIPS2021)☆37Updated 3 years ago
- ☆34Updated 2 months ago
- AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers☆42Updated 2 years ago
- ICLR 2021☆46Updated 3 years ago
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks"☆17Updated 3 weeks ago
- ☆28Updated 2 years ago
- ☆25Updated last year
- Distributed Deep Graph Learning Framework for Dynamic Graphs☆11Updated 10 months ago
- [ICLR 2025] TidalDecode: A Fast and Accurate LLM Decoding with Position Persistent Sparse Attention☆26Updated last month
- ☆19Updated 2 years ago
- ☆13Updated 7 months ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated last year
- Linear Attention Sequence Parallelism (LASP)☆76Updated 7 months ago
- NASRec Weight Sharing Neural Architecture Search for Recommender Systems☆29Updated last year
- Towards LLM Empowered Recommendation via Tool Learning☆15Updated 8 months ago
- Official codebase for NeurIPS 2022 paper End-to-end Learning to Index and Search in Large Output Spaces☆12Updated last year
- ☆18Updated 2 years ago
- [ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models☆28Updated 7 months ago
- The implementation for MLSys 2023 paper: "Cuttlefish: Low-rank Model Training without All The Tuning"☆43Updated last year
- Repository for "GIST: Distributed training for large-scale graph convolutional networks"☆14Updated 2 years ago
- Official source code for "Graph Neural Networks for Learning Equivariant Representations of Neural Networks". In ICLR 2024 (oral).☆77Updated 6 months ago
- ☆15Updated 2 years ago