madcato / forward-forward-pytorch

☆16

Related projects: ⓘ

lucidrains / compositional-attention-pytorch
Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…
☆50Updated 2 years ago
ColinQiyangLi / AdaCat
AdaCat
☆49Updated 2 years ago
lucidrains / panoptic-transformer
Another attempt at a long-context / efficient transformer by me
☆37Updated 2 years ago
lucidrains / hyena-pytorch
☆23Updated this week
lucidrains / poolformer
☆40Updated this week
lucidrains / hourglass-transformer-pytorch
Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI
☆74Updated 2 years ago
lucidrains / einops-exts
Implementation of some personal helper functions for Einops, my most favorite tensor manipulation library ❤️
☆52Updated last year
lucidrains / memory-editable-transformer
My explorations into editing the knowledge and memories of an attention network
☆34Updated last year
lucidrains / token-shift-gpt
Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing
☆47Updated 2 years ago
TomFrederik / grokking
Re-implementation of 'Grokking: Generalization beyond overfitting on small algorithmic datasets'
☆38Updated 2 years ago
lucidrains / mlp-gpt-jax
A GPT, made only of MLPs, in Jax
☆55Updated 3 years ago
lucidrains / cogvideo-pytorch
☆38Updated this week
tmabraham / Trans-CycleGAN
A convolution-free, transformer-only version of the CycleGAN framework
☆32Updated 2 years ago
ahennequ / pytorch-custom-mma
☆29Updated last year
gregorbachmann / scaling_mlps
☆48Updated 3 months ago
taskswithcode / sota_researchers_with_published_code
Researchers who published code, models (in some cases), and demo apps (in few cases) along with their SOTA paper
☆10Updated 11 months ago
Zasder3 / open_clip_juwels
An open source implementation of CLIP.
☆32Updated last year
lucidrains / discrete-key-value-bottleneck-pytorch
Implementation of Discrete Key / Value Bottleneck, in Pytorch
☆87Updated last year
Newbeeer / Anytime-Auto-Regressive-Model
Code for ICLR 2021 Paper, "Anytime Sampling for Autoregressive Models via Ordered Autoencoding"
☆23Updated last year
lucidrains / rvq-vae-gpt
My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation
☆76Updated last year
crypdick / timm-lr-scheduler-explorer
A dashboard for exploring timm learning rate schedulers
☆18Updated last year
lucidrains / ESBN-pytorch
Usable implementation of Emerging Symbol Binding Network (ESBN), in Pytorch
☆23Updated 3 years ago
lucidrains / ponder-transformer
Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper
☆78Updated 2 years ago
lucidrains / metaformer-gpt
Implementation of Metaformer, but in an autoregressive manner
☆22Updated 2 years ago
lucidrains / autoregressive-linear-attention-cuda
CUDA implementation of autoregressive linear attention, with all the latest research findings
☆43Updated last year
codekansas / rwkv
RWKV model implementation
☆38Updated last year
antofuller / configaformers
A python library for highly configurable transformers - easing model architecture search and experimentation.
☆50Updated 2 years ago
google-deepmind / brave
A JAX implementation of Broaden Your Views for Self-Supervised Video Learning, or BraVe for short.
☆48Updated 3 months ago
lucidrains / logavgexp-torch
Implementation of LogAvgExp for Pytorch
☆32Updated 2 years ago