apd10 / universal_memory_allocationLinks
☆15Updated 3 years ago
Alternatives and similar repositories for universal_memory_allocation
Users that are interested in universal_memory_allocation are comparing it to the libraries listed below
Sorting:
- ☆14Updated 3 years ago
- A compressed alternative to matrix multiplication using state-of-the art compression ROBE-Z☆9Updated last year
- Light-weight GPU kernel interface for graph operations☆15Updated 5 years ago
- [ NeurIPS '22 ] Data distillation for recommender systems. Shows equivalent performance with 2-3 orders less data.☆23Updated 2 years ago
- Differentiable Product Quantization for End-to-End Embedding Compression.☆62Updated 2 years ago
- [KDD 2022] AutoShard: Automated Embedding Table Sharding for Recommender Systems☆22Updated 2 years ago
- A study of the downstream instability of word embeddings☆12Updated 2 years ago
- ☆20Updated last year
- A Learnable LSH Framework for Efficient NN Training☆31Updated 3 years ago
- ☆14Updated 3 years ago
- Set of datasets for the deep learning recommendation model (DLRM).☆47Updated 2 years ago
- ☆12Updated 4 years ago
- Time-based Sequence Model for Personalization and Recommendation Systems☆49Updated 3 years ago
- ☆18Updated last week
- PyTorch implementation of HashedNets☆36Updated 2 years ago
- A "gym" style toolkit for building lightweight NAS systems.☆13Updated 3 years ago
- MLPruning, PyTorch, NLP, BERT, Structured Pruning☆20Updated 3 years ago
- Memory Optimizations for Deep Learning (ICML 2023)☆64Updated last year
- [ICLR 2022] Code for paper "Exploring Extreme Parameter Compression for Pre-trained Language Models"(https://arxiv.org/abs/2205.10036)☆22Updated 2 years ago
- Successfully training approximations to full-rank matrices for efficiency in deep learning.☆17Updated 4 years ago
- ☆27Updated 5 years ago
- ☆27Updated last year
- [ICLR 2021] "UMEC: Unified Model and Embedding Compression for Efficient Recommendation Systems" by Jiayi Shen, Haotao Wang*, Shupeng Gui…☆39Updated 3 years ago
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012☆49Updated 3 years ago
- [NeurIPS 2022] DreamShard: Generalizable Embedding Table Placement for Recommender Systems☆29Updated 2 years ago
- Efficient LDA solution on GPUs.☆24Updated 6 years ago
- sigma-MoE layer☆19Updated last year
- Distributed DataLoader For Pytorch Based On Ray☆24Updated 3 years ago
- PipeTransformer: Automated Elastic Pipelining for Distributed Training of Large-scale Models. ICML 2021☆56Updated 3 years ago
- Official Pytorch Implementation for the paper 'SUPER-ADAM: Faster and Universal Framework of Adaptive Gradients'☆17Updated 3 years ago