VictorZuanazzi / AdaptBatch
Basic code for adaptive batch in pytorch
☆18Updated 5 years ago
Alternatives and similar repositories for AdaptBatch:
Users that are interested in AdaptBatch are comparing it to the libraries listed below
- Structured matrices for compressing neural networks☆66Updated last year
- Official Pytorch Implementation of Length-Adaptive Transformer (ACL 2021)☆101Updated 4 years ago
- A collection of Models, Datasets, DataModules, Callbacks, Metrics, Losses and Loggers to better integrate pytorch-lightning with transfor…☆47Updated last year
- Notes from NeurIPS 2019☆29Updated 5 years ago
- High performance pytorch modules☆18Updated 2 years ago
- PyTorch DataLoader processed in multiple remote computation machines for heavy data processings☆67Updated 5 years ago
- ☆24Updated 10 months ago
- Code for SegTree Transformer (ICLR-RLGM 2019).☆27Updated 5 years ago
- Python way to Read/Write TFRecords☆64Updated 6 years ago
- [JMLR'20] NeurIPS 2019 MicroNet Challenge Efficient Language Modeling, Champion☆40Updated 4 years ago
- Compression of NMT transformer model with tensor methods☆48Updated 5 years ago
- [ICLR 2019] Learning Representations of Sets through Optimized Permutations☆36Updated 5 years ago
- Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch☆73Updated 2 years ago
- ☆14Updated 5 years ago
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆59Updated 4 years ago
- SparseMax activation function implementation (ICML 2016) (PyTorch)☆27Updated 7 years ago
- Code for the ICML'20 paper "Improving Transformer Optimization Through Better Initialization"☆88Updated 4 years ago
- ☆64Updated 4 years ago
- ☆47Updated 4 years ago
- Code for the paper PermuteFormer☆42Updated 3 years ago
- AdamW optimizer for bfloat16 models in pytorch 🔥.☆32Updated 9 months ago
- PyTorch implementation of HashedNets☆36Updated last year
- Repo for the work on hierarchical state space models for disentanglement☆21Updated 4 years ago
- Implementation of deep implicit attention in PyTorch☆65Updated 3 years ago
- Open source implementation of SeaRNN (ICLR 2018, https://openreview.net/forum?id=HkUR_y-RZ)☆48Updated 6 years ago
- A GPT, made only of MLPs, in Jax☆57Updated 3 years ago
- A discrete sequential VAE☆39Updated 4 years ago
- Re-implementation of the Noise Contrastive Estimation algorithm for pyTorch, following "Noise-contrastive estimation: A new estimation pr…☆45Updated 5 years ago
- MTAdam: Automatic Balancing of Multiple Training Loss Terms☆36Updated 4 years ago
- Implements the SM3-II adaptive optimization algorithm for PyTorch.☆33Updated 6 months ago