Unofficial PyTorch implementation of Attention Free Transformer (AFT) layers by Apple Inc.
☆246Feb 16, 2026Updated last month
Alternatives and similar repositories for aft-pytorch
Users that are interested in aft-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆30Jan 17, 2022Updated 4 years ago
- GAN models implemented with Pytorch Lightning and Hydra configuration☆33Jun 5, 2022Updated 3 years ago
- Trains Transformer model variants. Data isn't shuffled between batches.☆143Oct 5, 2022Updated 3 years ago
- Expressive Power of Invariant and Equivariant Graph Neural Networks (ICLR 2021)☆41Aug 25, 2023Updated 2 years ago
- Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).☆228Apr 18, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Official code Cross-Covariance Image Transformer (XCiT)☆674Sep 28, 2021Updated 4 years ago
- Code for the CVPR 2020 [ORAL] paper "SAM: The Sensitivity of Attribution Methods to Hyperparameters"☆27Dec 8, 2022Updated 3 years ago
- ☆13Jun 18, 2021Updated 4 years ago
- Neural Ensemble Search for Uncertainty Estimation and Dataset Shift☆33Jan 10, 2026Updated 2 months ago
- My implementation of DeepMind's Perceiver☆63Apr 23, 2021Updated 4 years ago
- ☆389Oct 18, 2023Updated 2 years ago
- bumble bee transformer☆14Apr 19, 2021Updated 4 years ago
- A collection of Models, Datasets, DataModules, Callbacks, Metrics, Losses and Loggers to better integrate pytorch-lightning with transfor…☆47May 29, 2023Updated 2 years ago
- Estimating Example Difficulty using Variance of Gradients☆64Jan 10, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆42May 20, 2020Updated 5 years ago
- Drop-in replacement for any ResNet with a significantly reduced memory footprint and better representation capabilities☆205Apr 24, 2024Updated last year
- http://nlp.seas.harvard.edu/2018/04/03/attention.html☆63May 20, 2021Updated 4 years ago
- Implementation of Fast Transformer in Pytorch☆176Aug 26, 2021Updated 4 years ago
- Code for "Self-Attention Between Datapoints: Going Beyond Individual Input-Output Pairs in Deep Learning"☆415Mar 21, 2024Updated 2 years ago
- Implementation of DropLoss for Long-Tail Instance Segmentation in Pytorch☆42Apr 14, 2021Updated 4 years ago
- Knowledge-Aware machine LEarning (KALE): accessible machine learning from multiple sources for interdisciplinary research, part of the 🔥 …☆479Feb 23, 2026Updated last month
- Fast Block Sparse Matrices for Pytorch☆550Jan 21, 2021Updated 5 years ago
- ☆100Dec 8, 2021Updated 4 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Official Pytorch Implementation for the paper 'SUPER-ADAM: Faster and Universal Framework of Adaptive Gradients'☆17Jan 12, 2022Updated 4 years ago
- Official code for the paper "Why Do Self-Supervised Models Transfer? Investigating the Impact of Invariance on Downstream Tasks".☆16Dec 7, 2021Updated 4 years ago
- Collection of machine learning research paper references☆26Feb 23, 2025Updated last year
- [ICML 2021] GraphNorm: A Principled Approach to Accelerating Graph Neural Network Training (official implementation)☆106Dec 19, 2022Updated 3 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Apr 21, 2023Updated 2 years ago
- Implements the SM3-II adaptive optimization algorithm for PyTorch.☆33Sep 3, 2024Updated last year
- Simple dataset to dataloader library for pytorch☆32Jan 3, 2025Updated last year
- Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes☆242May 12, 2023Updated 2 years ago
- ☆46Apr 13, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…☆433Aug 17, 2022Updated 3 years ago
- E-NER: Evidential Deep Learning for Trustworthy Named Entity Recognition☆25Jul 18, 2023Updated 2 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Jul 26, 2021Updated 4 years ago
- There and Back Again: Revisiting Backpropagation Saliency Methods (CVPR 2020)☆53Apr 7, 2020Updated 5 years ago
- ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet☆1,194Oct 27, 2023Updated 2 years ago
- GPT-style network for phonemization with durations of text☆68Mar 21, 2024Updated 2 years ago
- Hopfield Networks is All You Need☆1,907Apr 23, 2023Updated 2 years ago