lucidrains/local-attention-flax

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lucidrains/local-attention-flax)

lucidrains / local-attention-flax

Local Attention - Flax module for Jax

☆22

Alternatives and similar repositories for local-attention-flax

Users that are interested in local-attention-flax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lucidrains / cross-transformers-pytorch
View on GitHub
Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch
☆54Mar 30, 2021Updated 5 years ago
lucidrains / metaformer-gpt
View on GitHub
Implementation of Metaformer, but in an autoregressive manner
☆26Jun 21, 2022Updated 4 years ago
d-li14 / dot-product-attention
View on GitHub
A collection of self-attention modules and pre-trained backbones
☆13Nov 28, 2020Updated 5 years ago
lucidrains / kronecker-attention-pytorch
View on GitHub
Implementation of Kronecker Attention in Pytorch
☆20Sep 12, 2020Updated 5 years ago
lucidrains / tranception-pytorch
View on GitHub
Implementation of Tranception, an attention network, paired with retrieval, that is SOTA for protein fitness prediction
☆32Jun 19, 2022Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
lucidrains / all-normalization-transformer
View on GitHub
A simple Transformer where the softmax has been replaced with normalization
☆20Sep 11, 2020Updated 5 years ago
lucidrains / multistream-transformers
View on GitHub
Implementation of Multistream Transformers in Pytorch
☆54Jul 31, 2021Updated 4 years ago
conceptofmind / vit-flax
View on GitHub
Implementation of numerous Vision Transformers in Google's JAX and Flax.
☆22Aug 30, 2022Updated 3 years ago
lucidrains / rela-transformer
View on GitHub
Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012
☆49Apr 6, 2022Updated 4 years ago
lucidrains / coco-lm-pytorch
View on GitHub
Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch
☆46Mar 3, 2021Updated 5 years ago
songys / 2021Langcon
View on GitHub
☆11Oct 3, 2021Updated 4 years ago
lucidrains / compositional-attention-pytorch
View on GitHub
Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…
☆51May 10, 2022Updated 4 years ago
lucidrains / evolutionary-design-molecules
View on GitHub
Implementation of the algorithm detailed in paper "Evolutionary design of molecules based on deep learning and a genetic algorithm"
☆24Dec 15, 2023Updated 2 years ago
JoungheeKim / kor-spacing
View on GitHub
This is project for korean auto spacing
☆12Aug 3, 2020Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
baikalai / baikal-bert
View on GitHub
baikal.ai's pre-trained BERT models: descriptions and sample codes
☆12Jun 24, 2021Updated 5 years ago
lucidrains / NWT-pytorch
View on GitHub
Implementation of NWT, audio-to-video generation, in Pytorch
☆92Mar 17, 2022Updated 4 years ago
lucidrains / HTM-pytorch
View on GitHub
Implementation of Hierarchical Transformer Memory (HTM) for Pytorch
☆74Sep 15, 2021Updated 4 years ago
detail-novelist / novelist-triton-server
View on GitHub
Deploy KoGPT with Triton Inference Server
☆14Nov 18, 2022Updated 3 years ago
n2cholas / progan-flax
View on GitHub
Flax (JAX) implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variation
☆12May 24, 2021Updated 5 years ago
lucidrains / quartic-transformer
View on GitHub
Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)
☆56Mar 25, 2025Updated last year
lucidrains / holodeck-pytorch
View on GitHub
Implementation of a holodeck, written in Pytorch
☆19Nov 1, 2023Updated 2 years ago
lucidrains / transframer-pytorch
View on GitHub
Implementation of Transframer, Deepmind's U-net + Transformer architecture for up to 30 seconds video generation, in Pytorch
☆72Aug 23, 2022Updated 3 years ago
lucidrains / blackbox-gradient-sensing
View on GitHub
Implementation and explorations into Blackbox Gradient Sensing (BGS), an evolutionary strategies approach proposed in a Google Deepmind p…
☆20Apr 17, 2026Updated 3 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
lucidrains / token-shift-gpt
View on GitHub
Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing
☆49Jan 27, 2022Updated 4 years ago
lucidrains / einops-exts
View on GitHub
Implementation of some personal helper functions for Einops, my most favorite tensor manipulation library ❤️
☆57Jan 5, 2023Updated 3 years ago
eslambakr / EMCA
View on GitHub
MSCA: Multi-Scale Channel Attention Module
☆16Nov 24, 2021Updated 4 years ago
jason9693 / FROZEN
View on GitHub
☆14May 3, 2022Updated 4 years ago
lucidrains / geometric-vector-perceptron
View on GitHub
Implementation of Geometric Vector Perceptron, a simple circuit for 3d rotation equivariance for learning over large biomolecules, in Pyt…
☆77Jun 8, 2021Updated 5 years ago
lucidrains / axial-positional-embedding
View on GitHub
Axial Positional Embedding for Pytorch
☆84Feb 25, 2025Updated last year
lucidrains / jax2torch
View on GitHub
Use Jax functions in Pytorch
☆263Jul 1, 2023Updated 3 years ago
lucidrains / tableformer-pytorch
View on GitHub
Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch
☆39Mar 29, 2022Updated 4 years ago
lucidrains / discrete-key-value-bottleneck-pytorch
View on GitHub
Implementation of Discrete Key / Value Bottleneck, in Pytorch
☆88Jul 9, 2023Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
lucidrains / insertion-deletion-ddpm
View on GitHub
Implementation of Insertion-deletion Denoising Diffusion Probabilistic Models
☆30May 31, 2022Updated 4 years ago
lucidrains / transganformer
View on GitHub
Implementation of TransGanFormer, an all-attention GAN that combines the finding from the recent GanFormer and TransGan paper
☆155Apr 27, 2021Updated 5 years ago
lucidrains / fast-transformer-pytorch
View on GitHub
Implementation of Fast Transformer in Pytorch
☆176Aug 26, 2021Updated 4 years ago
lucidrains / molecule-attention-transformer
View on GitHub
Pytorch reimplementation of Molecule Attention Transformer, which uses a transformer to tackle the graph-like structure of molecules
☆58Dec 2, 2020Updated 5 years ago
phillip0726 / NaverBlog-Twitter-Youtube-crawling
View on GitHub
We can crawl NaverBlog, Twitter, Youtube!!
☆13Sep 13, 2019Updated 6 years ago
leichenNUSJ / AAMandDCM
View on GitHub
This project is to implement “Attention-Adaptive and Deformable Convolutional Modules for Dynamic Scene Deblurring(with ERCNN)” . To…
☆16Jul 20, 2020Updated 6 years ago
lucidrains / nystrom-attention
View on GitHub
Implementation of Nyström Self-attention, from the paper Nyströmformer
☆145Mar 24, 2025Updated last year