prajjwal1 / adaptive_transformerView external linksLinks
Code for the paper "Adaptive Transformers for Learning Multimodal Representations" (ACL SRW 2020)
☆43Oct 20, 2022Updated 3 years ago
Alternatives and similar repositories for adaptive_transformer
Users that are interested in adaptive_transformer are comparing it to the libraries listed below
Sorting:
- A deep learning library based on Pytorch focussed on low resource language research and robustness☆70Nov 30, 2021Updated 4 years ago
- ☆23Oct 20, 2020Updated 5 years ago
- This repo contains code to reproduce some of the results presented in the paper "SentenceMIM: A Latent Variable Language Model"☆28Jun 22, 2022Updated 3 years ago
- [AAAI 2026 Oral] Rethinking Irregular Time Series Forecasting: A Simple yet Effective Baseline☆25Feb 5, 2026Updated last week
- Easily serialize dataclasses to and from tensors (PyTorch, NumPy)☆18Apr 10, 2021Updated 4 years ago
- Sparse Transformer with limited attention span in PyTorch☆15Apr 4, 2021Updated 4 years ago
- Implementation of our PR 2020 paper:Unsupervised Text-to-Image Synthesis☆13Jul 9, 2020Updated 5 years ago
- Sparse Attention with Linear Units☆20Apr 21, 2021Updated 4 years ago
- Code for the paper PermuteFormer☆41Oct 10, 2021Updated 4 years ago
- ☆17Oct 7, 2022Updated 3 years ago
- High performance pytorch modules☆18Jan 14, 2023Updated 3 years ago
- Hash-routed Networks☆20Nov 20, 2020Updated 5 years ago
- ☆19Apr 7, 2020Updated 5 years ago
- [ACL 2020] DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering☆121May 22, 2023Updated 2 years ago
- 📒Record some paper read notes☆20Jan 1, 2022Updated 4 years ago
- ☆24Jan 20, 2021Updated 5 years ago
- Paper and code for Gradient Descent: The Ultimate Optimizer☆24Oct 3, 2023Updated 2 years ago
- INSET: Sentence Infilling with Inter-sentential Transformer☆30Nov 21, 2020Updated 5 years ago
- An implementation of drophead regularization for pytorch transformers☆19Aug 24, 2021Updated 4 years ago
- Implementation of RealFormer using pytorch☆101Dec 27, 2020Updated 5 years ago
- ☆29Oct 18, 2022Updated 3 years ago
- Code repo for "Transformer on a Diet" paper☆31Jun 22, 2020Updated 5 years ago
- TBC☆28Nov 2, 2022Updated 3 years ago
- Multivariate time-series forecasting with LSTNET and soft-DTW loss☆30Jun 3, 2020Updated 5 years ago
- ☆32Oct 30, 2023Updated 2 years ago
- A Monte Carlo Tree Search AI for the game 2048☆30May 24, 2020Updated 5 years ago
- FLASHQuad_pytorch☆68Apr 1, 2022Updated 3 years ago
- Transfer Learning via Unsupervised Task Discovery for Visual Question Answering☆31Apr 9, 2019Updated 6 years ago
- Code for KGLM paper☆122Jul 25, 2024Updated last year
- Code for: "Neural Controlled Differential Equations for Online Prediction Tasks"☆39Oct 19, 2022Updated 3 years ago
- [ICML 2020] code for "PowerNorm: Rethinking Batch Normalization in Transformers" https://arxiv.org/abs/2003.07845☆120Jun 20, 2021Updated 4 years ago
- Implementation of the retriever distillation procedure as outlined in the paper "Distilling Knowledge from Reader to Retriever"☆32Dec 16, 2020Updated 5 years ago
- Implementation of Memformer, a Memory-augmented Transformer, in Pytorch☆126Nov 13, 2020Updated 5 years ago
- Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".☆66Jun 19, 2021Updated 4 years ago
- Official repository for "PAIR: Planning and Iterative Refinement in Pre-trained Transformers for Long Text Generation"☆32Apr 17, 2021Updated 4 years ago
- ☆32Mar 31, 2020Updated 5 years ago
- An implementation of Transformer with Expire-Span, a circuit for learning which memories to retain☆34Oct 30, 2020Updated 5 years ago
- Custom Keras layers for implementing multi-dimensional recurrent neural networks (MDRNNs) described in Alex Graves's paper https://arxiv.…☆10Apr 27, 2020Updated 5 years ago
- Apache Spark based framework for analysis A/B experiments☆15Nov 3, 2024Updated last year