lucidrains/hamburger-pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lucidrains/hamburger-pytorch)

lucidrains / hamburger-pytorch

Pytorch implementation of the hamburger module from the ICLR 2021 paper "Is Attention Better Than Matrix Decomposition"

☆99

Alternatives and similar repositories for hamburger-pytorch

Users that are interested in hamburger-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

halcy / tpuddim
View on GitHub
☆22May 3, 2022Updated 4 years ago
jenni-ai / T2FW
View on GitHub
Fine-Tuning Pre-trained Transformers into Decaying Fast Weights
☆20Oct 9, 2022Updated 3 years ago
lucidrains / rela-transformer
View on GitHub
Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012
☆49Apr 6, 2022Updated 4 years ago
lucidrains / ESBN-pytorch
View on GitHub
Usable implementation of Emerging Symbol Binding Network (ESBN), in Pytorch
☆25Jan 6, 2021Updated 5 years ago
lucidrains / token-shift-gpt
View on GitHub
Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing
☆49Jan 27, 2022Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
lucidrains / lie-transformer-pytorch
View on GitHub
Implementation of Lie Transformer, Equivariant Self-Attention, in Pytorch
☆98Feb 19, 2021Updated 5 years ago
lucidrains / distilled-retriever-pytorch
View on GitHub
Implementation of the retriever distillation procedure as outlined in the paper "Distilling Knowledge from Reader to Retriever"
☆32Dec 16, 2020Updated 5 years ago
lucidrains / AoA-pytorch
View on GitHub
A Pytorch implementation of Attention on Attention module (both self and guided variants), for Visual Question Answering
☆43Nov 8, 2020Updated 5 years ago
lucidrains / remixer-pytorch
View on GitHub
Implementation of the Remixer Block from the Remixer paper, in Pytorch
☆36Sep 27, 2021Updated 4 years ago
antofuller / configaformers
View on GitHub
A python library for highly configurable transformers - easing model architecture search and experimentation.
☆48Nov 30, 2021Updated 4 years ago
ZichenMiao / L3Net
View on GitHub
ICLR 2021 (spotlight): Graph Convolution with Low-rank Learnable Local Filters
☆16Jan 14, 2021Updated 5 years ago
yechengxi / deconvolution
View on GitHub
☆182Feb 23, 2023Updated 3 years ago
calclavia / Performer-Pytorch
View on GitHub
Pytorch implementation of Performer from the paper "Rethinking Attention with Performers".
☆24Oct 5, 2020Updated 5 years ago
lucidrains / n-grammer-pytorch
View on GitHub
Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch
☆81Dec 4, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
rjbruin / flexconv
View on GitHub
Code repository for the ICLR 2022 paper "FlexConv: Continuous Kernel Convolutions With Differentiable Kernel Sizes" https://openreview.ne…
☆116Nov 30, 2022Updated 3 years ago
Lifelong-ML / LASEM
View on GitHub
Code for the ICML 2021 paper "Sharing Less is More: Lifelong Learning in Deep Networks with Selective Layer Transfer"
☆12Aug 17, 2021Updated 4 years ago
sayakpaul / NALU
View on GitHub
Neural Arithmetic Logic Units by Trask et al.
☆12Apr 10, 2019Updated 7 years ago
lucidrains / nystrom-attention
View on GitHub
Implementation of Nyström Self-attention, from the paper Nyströmformer
☆145Mar 24, 2025Updated last year
inspire-group / robustness-via-transport
View on GitHub
☆12Sep 26, 2019Updated 6 years ago
lxtGH / BSSeg
View on GitHub
BoundarySqueeze: Image Segmentation as Boundary Squeezing
☆56Apr 9, 2022Updated 4 years ago
researchmm / SariGAN
View on GitHub
[NeurIPS'20] Learning Semantic-aware Normalization for Generative Adversarial Networks
☆52May 14, 2021Updated 5 years ago
EkdeepSLubana / BeyondBatchNorm
View on GitHub
Codebase for the paper "Beyond BatchNorm: Towards a Unified Understanding of Normalization in Deep Learning"
☆17Jul 12, 2021Updated 5 years ago
lucidrains / hourglass-transformer-pytorch
View on GitHub
Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI
☆99Dec 31, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
lucidrains / isab-pytorch
View on GitHub
An implementation of (Induced) Set Attention Block, from the Set Transformers paper
☆70Jun 8, 2026Updated last month
lucidrains / lambda-networks
View on GitHub
Implementation of LambdaNetworks, a new approach to image recognition that reaches SOTA with less compute
☆1,528Nov 18, 2020Updated 5 years ago
amazon-science / unified-ept
View on GitHub
A Unified Efficient Pyramid Transformer for Semantic Segmentation, ICCVW 2021
☆31Oct 11, 2021Updated 4 years ago
kemaloksuz / aLRPLoss
View on GitHub
Official PyTorch Implementation of aLRP Loss [NeurIPS2020]
☆137Dec 17, 2020Updated 5 years ago
JunnYu / ChineseBert_pytorch
View on GitHub
huggingface ChineseBert Tokenizer
☆16Apr 16, 2022Updated 4 years ago
lovit / flask_api_tutorial
View on GitHub
Flask 로 API 를 만들기 위한 튜토리얼
☆10Jun 22, 2020Updated 6 years ago
taoyang1122 / MutualNet
View on GitHub
[ECCV'20 Oral] MutualNet: Adaptive ConvNet via Mutual Learning from Network Width and Resolution
☆159Oct 4, 2022Updated 3 years ago
dmbernaal / Daedalus
View on GitHub
Deep Learning Research
☆16Nov 13, 2019Updated 6 years ago
bohanzhuang / Group-Net-semantic-segmentation
View on GitHub
Structured Binary Neural Networks for Image Recognition
☆16Oct 12, 2022Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
sacmehta / delight
View on GitHub
DeLighT: Very Deep and Light-Weight Transformers
☆469Oct 16, 2020Updated 5 years ago
neuralchen / Bivolution
View on GitHub
Accepted by AAAI2022
☆21Apr 10, 2022Updated 4 years ago
lucidrains / omninet-pytorch
View on GitHub
Implementation of OmniNet, Omnidirectional Representations from Transformers, in Pytorch
☆59Mar 19, 2021Updated 5 years ago
JianqiangWan / Super-BPD
View on GitHub
Super-BPD: Super Boundary-to-Pixel Direction for Fast Image Segmentation (CVPR 2020)
☆202Sep 25, 2020Updated 5 years ago
htoyryla / DALLE-pytorch
View on GitHub
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
☆59Jan 13, 2021Updated 5 years ago
rmlin / CoMHE
View on GitHub
Implementation for <Regularizing Neural Networks via Minimizing Hyperspherical Energy> in CVPR'20.
☆24Jun 23, 2020Updated 6 years ago
timy90022 / DropLoss
View on GitHub
Implementation of DropLoss for Long-Tail Instance Segmentation in Pytorch
☆44Apr 14, 2021Updated 5 years ago