lukemelas / simple-bert
A simple PyTorch implementation of BERT, complete with pretrained models and training scripts.
☆41Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for simple-bert
- An open source implementation of CLIP.☆32Updated 2 years ago
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆59Updated 3 years ago
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012☆49Updated 2 years ago
- A Pytorch implementation of Attention on Attention module (both self and guided variants), for Visual Question Answering☆40Updated 4 years ago
- Implementation of the Remixer Block from the Remixer paper, in Pytorch☆35Updated 3 years ago
- ☆24Updated 3 years ago
- Implementation of Kronecker Attention in Pytorch☆17Updated 4 years ago
- ☆15Updated 2 years ago
- Official PyTorch implementation of RIO☆18Updated 3 years ago
- ☆24Updated 3 years ago
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols☆14Updated 3 years ago
- A python library for highly configurable transformers - easing model architecture search and experimentation.☆49Updated 2 years ago
- A simple implementation of a deep linear Pytorch module☆18Updated 4 years ago
- [ICLR 2021] Beyond Categorical Label Representations for Image Classification☆25Updated 2 years ago
- SimCLR pytorch implementation using DistributedDataParallel.☆24Updated last year
- A GPT, made only of MLPs, in Jax☆55Updated 3 years ago
- Re-implementation of local descriptor HardNet training in fasta2+kornia☆21Updated 4 years ago
- A JAX implementation of Broaden Your Views for Self-Supervised Video Learning, or BraVe for short.☆48Updated 4 months ago
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆50Updated 2 years ago
- Large dataset storage format for Pytorch☆45Updated 3 years ago
- An implementation of Transformer with Expire-Span, a circuit for learning which memories to retain☆33Updated 4 years ago
- ☆21Updated last year
- ☆13Updated 5 years ago
- A non-JIT version implementation / replication of CLIP of OpenAI in pytorch☆34Updated 3 years ago
- GPT, but made only out of MLPs☆86Updated 3 years ago
- Website for TextVQA dataset.☆28Updated last year
- source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT☆73Updated last year
- ☆32Updated 2 years ago
- [NeurIPS 2022] DataMUX: Data Multiplexing for Neural Networks☆59Updated last year