SauravMaheshkar / MLP-Mixer
☆10Updated this week
Related projects: ⓘ
- A python library for highly configurable transformers - easing model architecture search and experimentation.☆50Updated 2 years ago
- Unofficially Implements https://arxiv.org/abs/2112.05682 to get Linear Memory Cost on Attention for PyTorch☆12Updated 2 years ago
- JAX implementation of Graph Attention Networks