lucidrains/kronecker-attention-pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lucidrains/kronecker-attention-pytorch)

lucidrains / kronecker-attention-pytorch

Implementation of Kronecker Attention in Pytorch

☆20

Alternatives and similar repositories for kronecker-attention-pytorch

Users that are interested in kronecker-attention-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lucidrains / local-attention-flax
View on GitHub
Local Attention - Flax module for Jax
☆22May 26, 2021Updated 5 years ago
inikishev / torchzero
View on GitHub
Modular optimization library for PyTorch (work-in-progress).
☆13Feb 4, 2026Updated 5 months ago
lucidrains / esbn-transformer
View on GitHub
An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols
☆16Aug 3, 2021Updated 4 years ago
lucidrains / remixer-pytorch
View on GitHub
Implementation of the Remixer Block from the Remixer paper, in Pytorch
☆36Sep 27, 2021Updated 4 years ago
lucidrains / cross-transformers-pytorch
View on GitHub
Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch
☆54Mar 30, 2021Updated 5 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
lucidrains / g-mlp-gpt
View on GitHub
GPT, but made only out of MLPs
☆89May 25, 2021Updated 5 years ago
pingxue-hfut / sd-bnn
View on GitHub
Self-Distribution BNN
☆10Mar 8, 2022Updated 4 years ago
lucidrains / holodeck-pytorch
View on GitHub
Implementation of a holodeck, written in Pytorch
☆19Nov 1, 2023Updated 2 years ago
lucidrains / blackbox-gradient-sensing
View on GitHub
Implementation and explorations into Blackbox Gradient Sensing (BGS), an evolutionary strategies approach proposed in a Google Deepmind p…
☆20Apr 17, 2026Updated 3 months ago
lucidrains / metaformer-gpt
View on GitHub
Implementation of Metaformer, but in an autoregressive manner
☆26Jun 21, 2022Updated 4 years ago
lucidrains / transganformer
View on GitHub
Implementation of TransGanFormer, an all-attention GAN that combines the finding from the recent GanFormer and TransGan paper
☆155Apr 27, 2021Updated 5 years ago
lucidrains / frame-averaging-pytorch
View on GitHub
Pytorch implementation of a simple way to enable (Stochastic) Frame Averaging for any network
☆52Jul 26, 2024Updated 2 years ago
lucidrains / all-normalization-transformer
View on GitHub
A simple Transformer where the softmax has been replaced with normalization
☆20Sep 11, 2020Updated 5 years ago
lucidrains / coco-lm-pytorch
View on GitHub
Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch
☆46Mar 3, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
kkahatapitiya / LinearConv
View on GitHub
Code for our WACV 2021 paper "Exploiting the Redundancy in Convolutional Filters for Parameter Reduction"
☆11Jan 6, 2021Updated 5 years ago
lucidrains / insertion-deletion-ddpm
View on GitHub
Implementation of Insertion-deletion Denoising Diffusion Probabilistic Models
☆30May 31, 2022Updated 4 years ago
lucidrains / distilled-retriever-pytorch
View on GitHub
Implementation of the retriever distillation procedure as outlined in the paper "Distilling Knowledge from Reader to Retriever"
☆32Dec 16, 2020Updated 5 years ago
EEEGUI / Mapillary-vistas-semseg
View on GitHub
Train ICNet on Mapillary-vistas-dataset
☆18Apr 22, 2019Updated 7 years ago
lucidrains / mogrifier
View on GitHub
Usable implementation of Mogrifier, a circuit for enhancing LSTMs and potentially other networks, from Deepmind
☆21Jun 9, 2024Updated 2 years ago
mariushobbhahn / LB_for_BNNs_official
View on GitHub
Official repository for the paper "Fast Predictive Uncertainty for Classification with Bayesian Deep Networks". Accepted at UAI 2022. htt…
☆13May 25, 2022Updated 4 years ago
lucidrains / STAM-pytorch
View on GitHub
Implementation of STAM (Space Time Attention Model), a pure and simple attention model that reaches SOTA for video classification
☆133Apr 1, 2021Updated 5 years ago
lucidrains / isab-pytorch
View on GitHub
An implementation of (Induced) Set Attention Block, from the Set Transformers paper
☆70Jun 8, 2026Updated last month
sdmhans / arxiv_dataset_extraction
View on GitHub
A simple script for extracting plain text from arxiv dataset: https://www.kaggle.com/Cornell-University/arxiv
☆15Dec 7, 2020Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
lucidrains / triangle-multiplicative-module
View on GitHub
Implementation of the Triangle Multiplicative module, used in Alphafold2 as an efficient way to mix rows or columns of a 2d feature map, …
☆39Aug 3, 2021Updated 4 years ago
yu-changqian / CondNet
View on GitHub
☆31Dec 20, 2022Updated 3 years ago
romebert / RomeBERT
View on GitHub
☆16May 6, 2021Updated 5 years ago
yu-changqian / RepGraph
View on GitHub
Representative Graph Neural Network
☆35Aug 12, 2020Updated 5 years ago
yukang2017 / NAS-quantization
View on GitHub
The code for Joint Neural Architecture Search and Quantization
☆14Apr 10, 2019Updated 7 years ago
ppwwyyxx / FRN-on-common-ImageNet-baseline
View on GitHub
Filter Response Normalization tested on better ImageNet baselines.
☆35Mar 28, 2020Updated 6 years ago
lucidrains / logavgexp-torch
View on GitHub
Implementation of LogAvgExp for Pytorch
☆37Apr 10, 2025Updated last year
lucidrains / nystrom-attention
View on GitHub
Implementation of Nyström Self-attention, from the paper Nyströmformer
☆145Mar 24, 2025Updated last year
lucidrains / transformer-lm-gan
View on GitHub
Explorations into adversarial losses on top of autoregressive loss for language modeling
☆41Dec 21, 2025Updated 7 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
BayesWatch / pytorch-blockswap
View on GitHub
Code for BlockSwap (ICLR 2020).
☆33Mar 25, 2021Updated 5 years ago
lucidrains / adjacent-attention-network
View on GitHub
Graph neural network message passing reframed as a Transformer with local attention
☆70Dec 24, 2022Updated 3 years ago
openseg-group / RankSeg
View on GitHub
[ECCV2022] This is an official implementation of paper "RankSeg: Adaptive Pixel Classification with Image Category Ranking for Segmentati…
☆78Feb 12, 2023Updated 3 years ago
tgisaturday / dalle-lightning
View on GitHub
Refactoring dalle-pytorch and taming-transformers for TPU VM
☆60Aug 30, 2021Updated 4 years ago
shyam671 / Twin_Auxiliary_Classifier_GAN
View on GitHub
Twin Auxiliary Classifiers GAN (NeurIPS 2019) [Spotlight]
☆15Sep 19, 2019Updated 6 years ago
universome / firelab
View on GitHub
Experimental framework for running pytorch experiments
☆14Mar 6, 2023Updated 3 years ago
lonePatient / EvoNorms_PyTorch
View on GitHub
Evolving Normalization-Activation Layers
☆19Apr 10, 2020Updated 6 years ago