Pytorch implementation of the hamburger module from the ICLR 2021 paper "Is Attention Better Than Matrix Decomposition"
☆99Jan 13, 2021Updated 5 years ago
Alternatives and similar repositories for hamburger-pytorch
Users that are interested in hamburger-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22May 3, 2022Updated 4 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Oct 9, 2022Updated 3 years ago
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012☆49Apr 6, 2022Updated 4 years ago
- Usable implementation of Emerging Symbol Binding Network (ESBN), in Pytorch☆25Jan 6, 2021Updated 5 years ago
- Implementation of Lie Transformer, Equivariant Self-Attention, in Pytorch☆97Feb 19, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A Pytorch implementation of Attention on Attention module (both self and guided variants), for Visual Question Answering☆43Nov 8, 2020Updated 5 years ago
- Implementation of the Remixer Block from the Remixer paper, in Pytorch☆36Sep 27, 2021Updated 4 years ago
- ICLR 2021 (spotlight): Graph Convolution with Low-rank Learnable Local Filters☆16Jan 14, 2021Updated 5 years ago
- A python library for highly configurable transformers - easing model architecture search and experimentation.☆48Nov 30, 2021Updated 4 years ago
- ☆182Feb 23, 2023Updated 3 years ago
- Pytorch implementation of Performer from the paper "Rethinking Attention with Performers".☆24Oct 5, 2020Updated 5 years ago
- Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)☆16Oct 11, 2021Updated 4 years ago
- ☆31Dec 20, 2022Updated 3 years ago
- Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch☆76Dec 4, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for the ICML 2021 paper "Sharing Less is More: Lifelong Learning in Deep Networks with Selective Layer Transfer"☆12Aug 17, 2021Updated 4 years ago
- Neural Arithmetic Logic Units by Trask et al.☆12Apr 10, 2019Updated 7 years ago
- ☆12Sep 26, 2019Updated 6 years ago
- Implementation of Nyström Self-attention, from the paper Nyströmformer☆145Mar 24, 2025Updated last year
- BoundarySqueeze: Image Segmentation as Boundary Squeezing☆56Apr 9, 2022Updated 4 years ago
- [NeurIPS'20] Learning Semantic-aware Normalization for Generative Adversarial Networks☆53May 14, 2021Updated 4 years ago
- Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper☆81Oct 30, 2021Updated 4 years ago
- An implementation of (Induced) Set Attention Block, from the Set Transformers paper☆68Jan 10, 2023Updated 3 years ago
- To be a next-generation DL-based phenotype prediction from genome mutations.☆19May 17, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Codebase for the paper "Beyond BatchNorm: Towards a Unified Understanding of Normalization in Deep Learning"☆17Jul 12, 2021Updated 4 years ago
- Implementation of LambdaNetworks, a new approach to image recognition that reaches SOTA with less compute☆1,526Nov 18, 2020Updated 5 years ago
- Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI☆99Dec 31, 2021Updated 4 years ago
- Official PyTorch Implementation of aLRP Loss [NeurIPS2020]☆137Dec 17, 2020Updated 5 years ago
- huggingface ChineseBert Tokenizer☆17Apr 16, 2022Updated 4 years ago
- Deep Learning Research☆16Nov 13, 2019Updated 6 years ago
- Flask 로 API 를 만들기 위한 튜토리얼☆10Jun 22, 2020Updated 5 years ago
- Structured Binary Neural Networks for Image Recognition☆16Oct 12, 2022Updated 3 years ago
- [ECCV'20 Oral] MutualNet: Adaptive ConvNet via Mutual Learning from Network Width and Resolution☆159Oct 4, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Accepted by AAAI2022☆21Apr 10, 2022Updated 4 years ago
- Super-BPD: Super Boundary-to-Pixel Direction for Fast Image Segmentation (CVPR 2020)☆201Sep 25, 2020Updated 5 years ago
- DeLighT: Very Deep and Light-Weight Transformers☆469Oct 16, 2020Updated 5 years ago
- Implementation of OmniNet, Omnidirectional Representations from Transformers, in Pytorch☆59Mar 19, 2021Updated 5 years ago
- Implementation of DropLoss for Long-Tail Instance Segmentation in Pytorch☆44Apr 14, 2021Updated 5 years ago
- Implementation for <Regularizing Neural Networks via Minimizing Hyperspherical Energy> in CVPR'20.☆24Jun 23, 2020Updated 5 years ago
- Learning Saliency Propagation for Semi-supervised Instance Segmentation☆67Aug 18, 2020Updated 5 years ago