facebookresearch / FBTT-EmbeddingLinks
This is a Tensor Train based compression library to compress sparse embedding tables used in large-scale machine learning models such as recommendation and natural language processing. We showed this library can reduce the total model size by up to 100x in Facebook’s open sourced DLRM model while achieving same model quality. Our implementation …
☆194Updated 3 years ago
Alternatives and similar repositories for FBTT-Embedding
Users that are interested in FBTT-Embedding are comparing it to the libraries listed below
Sorting:
- Simple Distributed Deep Learning on TensorFlow☆134Updated last week
- Research and development for optimizing transformers☆131Updated 4 years ago
- High performance distributed framework for training deep learning recommendation models based on PyTorch.☆410Updated 7 months ago
- A tensor-aware point-to-point communication primitive for machine learning☆283Updated last month
- Slicing a PyTorch Tensor Into Parallel Shards☆300Updated 8 months ago
- Block-sparse primitives for PyTorch☆158Updated 4 years ago
- http://vlsiarch.eecs.harvard.edu/research/recommendation/☆134Updated 3 years ago
- PyTorch implementation of Hash Embeddings (NIPS 2017). Submission to the NIPS Implementation Challenge.☆206Updated 7 years ago
- This repository contains the results and code for the MLPerf™ Training v0.7 benchmark.☆57Updated 2 years ago
- distributed-embeddings is a library for building large embedding based models in Tensorflow 2.☆46Updated 2 years ago
- A library for syntactically rewriting Python programs, pronounced (sinner).☆67Updated 3 years ago
- Training neural networks in TensorFlow 2.0 with 5x less memory☆137Updated 3 years ago
- Distributed ML Optimizer☆35Updated 4 years ago
- PyTorch implementation of L2L execution algorithm☆109Updated 3 years ago
- Fast Block Sparse Matrices for Pytorch☆550Updated 5 years ago
- Stride visualizations☆38Updated 7 years ago
- A Tool for Automatic Parallelization of Deep Learning Training in Distributed Multi-GPU Environments.☆132Updated 3 years ago
- Torch Distributed Experimental☆117Updated last year
- Differentiable Product Quantization for End-to-End Embedding Compression.☆64Updated 3 years ago
- A deep ranking personalization framework☆133Updated last month
- Training material for IPU users: tutorials, feature examples, simple applications☆88Updated 2 years ago
- DLPack for Tensorflow☆35Updated 5 years ago
- ☆252Updated last year
- A GPU performance profiling tool for PyTorch models☆510Updated 4 years ago
- Development repository for integrating FlexFlow (A distributed deep learning framework that supports flexible parallelization strategies)…☆29Updated 4 years ago
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆164Updated last month
- Time-based Sequence Model for Personalization and Recommendation Systems☆49Updated 4 years ago
- This is the (evolving) reading list for the seminar.☆61Updated 5 years ago
- A library for building and serving multi-node distributed faiss indices.☆276Updated 2 years ago
- Fast sparse deep learning on CPUs☆56Updated 3 years ago