lucidrains / AGI-pytorch
☆24Updated this week
Related projects: ⓘ
- Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper☆78Updated 2 years ago
- Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI☆74Updated 2 years ago
- Another attempt at a long-context / efficient transformer by me☆37Updated 2 years ago
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆94Updated last year
- ☆40Updated this week
- Implementation of Feedback Transformer in Pytorch☆103Updated 3 years ago
- ☆64Updated 2 years ago
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆90Updated 2 years ago
- Refactoring dalle-pytorch and taming-transformers for TPU VM☆60Updated 3 years ago
- Implementation of LogAvgExp for Pytorch☆32Updated 2 years ago
- JAX implementation ViT-VQGAN☆77Updated 2 years ago
- Contrastive Language-Audio Pretraining☆87Updated 2 years ago
- ☆38Updated this week
- HomebrewNLP in JAX flavour for maintable TPU-Training☆46Updated 8 months ago
- Simple python template☆40Updated 4 months ago
- gpu tester detects broken and slow gpus in a cluster☆63Updated last year
- CLOOB training (JAX) and inference (JAX and PyTorch)☆70Updated 2 years ago
- Pedagogical codebase for a simplified score-based generative model design, with training loop☆39Updated 3 years ago
- GPT, but made only out of MLPs☆86Updated 3 years ago
- Implementation of some personal helper functions for Einops, my most favorite tensor manipulation library ❤️☆52Updated last year
- Contrastive Language-Image Pretraining☆140Updated 2 years ago
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆66Updated last year
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆50Updated 2 years ago
- Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch☆64Updated 2 years ago
- Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch☆86Updated last year
- ☆33Updated last year
- ☆85Updated this week
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆76Updated last year
- Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload☆123Updated 2 years ago