rish-16/aft-pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rish-16/aft-pytorch)

rish-16 / aft-pytorch

Unofficial PyTorch implementation of Attention Free Transformer (AFT) layers by Apple Inc.

☆246

Alternatives and similar repositories for aft-pytorch

Users that are interested in aft-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

PAL-ML / PEARL_v1
View on GitHub
☆30Jan 17, 2022Updated 4 years ago
mlelarge / graph_neural_net
View on GitHub
Expressive Power of Invariant and Equivariant Graph Neural Networks (ICLR 2021)
☆43Aug 25, 2023Updated 2 years ago
facebookresearch / transformer-sequential
View on GitHub
Trains Transformer model variants. Data isn't shuffled between batches.
☆147Oct 5, 2022Updated 3 years ago
NVIDIA / transformer-ls
View on GitHub
Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).
☆228Apr 18, 2022Updated 4 years ago
facebookresearch / xcit
View on GitHub
Official code Cross-Covariance Image Transformer (XCiT)
☆681Sep 28, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
anguyen8 / sam
View on GitHub
Code for the CVPR 2020 [ORAL] paper "SAM: The Sensitivity of Attribution Methods to Hyperparameters"
☆29Dec 8, 2022Updated 3 years ago
nttcslab / Generalized-Domain-Adaptation
View on GitHub
☆12Jun 18, 2021Updated 5 years ago
supsi-dacd-isaac / mbtr
View on GitHub
Multivariate Boosted TRee
☆62Oct 3, 2022Updated 3 years ago
automl / nes
View on GitHub
Neural Ensemble Search for Uncertainty Estimation and Dataset Shift
☆35Jan 10, 2026Updated 6 months ago
mlpen / Nystromformer
View on GitHub
☆391Oct 18, 2023Updated 2 years ago
quinte22 / bumblebee
View on GitHub
bumble bee transformer
☆14Apr 19, 2021Updated 5 years ago
iKernels / transformers-lightning
View on GitHub
A collection of Models, Datasets, DataModules, Callbacks, Metrics, Losses and Loggers to better integrate pytorch-lightning with transfor…
☆47May 29, 2023Updated 3 years ago
chirag-agarwall / VOG
View on GitHub
Estimating Example Difficulty using Variance of Gradients
☆66Jan 10, 2023Updated 3 years ago
michaelsdr / momentumnet
View on GitHub
Drop-in replacement for any ResNet with a significantly reduced memory footprint and better representation capabilities
☆206Apr 24, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
lucidrains / mlp-gpt-jax
View on GitHub
A GPT, made only of MLPs, in Jax
☆59Jun 23, 2021Updated 5 years ago
hadarser / ProvablyPowerfulGraphNetworks_torch
View on GitHub
☆42May 20, 2020Updated 6 years ago
bhavsarpratik / semantic-search
View on GitHub
[WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…
☆15Apr 21, 2023Updated 3 years ago
s-nlp / annotated-transformer
View on GitHub
http://nlp.seas.harvard.edu/2018/04/03/attention.html
☆63May 20, 2021Updated 5 years ago
lucidrains / fast-transformer-pytorch
View on GitHub
Implementation of Fast Transformer in Pytorch
☆176Aug 26, 2021Updated 4 years ago
pykale / pykale
View on GitHub
Knowledge-Aware machine LEarning (KALE): accessible machine learning from multiple sources for interdisciplinary research, part of the 🔥…
☆486Updated this week
OATML / non-parametric-transformers
View on GitHub
Code for "Self-Attention Between Datapoints: Going Beyond Individual Input-Output Pairs in Deep Learning"
☆418Mar 21, 2024Updated 2 years ago
timy90022 / DropLoss
View on GitHub
Implementation of DropLoss for Long-Tail Instance Segmentation in Pytorch
☆44Apr 14, 2021Updated 5 years ago
revsic / torch-diffusion-wavegan
View on GitHub
Parallel waveform generation with DiffusionGAN
☆17Mar 26, 2022Updated 4 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
huggingface / pytorch_block_sparse
View on GitHub
Fast Block Sparse Matrices for Pytorch
☆551Jan 21, 2021Updated 5 years ago
rish-16 / involution_pytorch
View on GitHub
Unofficial PyTorch implementation of the Involution layer from CVPR 2021
☆45Dec 8, 2025Updated 7 months ago
Enealor / PyTorch-SM3
View on GitHub
Implements the SM3-II adaptive optimization algorithm for PyTorch.
☆33Sep 3, 2024Updated last year
g-benton / loss-surface-simplexes
View on GitHub
☆100Dec 8, 2021Updated 4 years ago
LIJUNYI95 / SuperAdam
View on GitHub
Official Pytorch Implementation for the paper 'SUPER-ADAM: Faster and Universal Framework of Adaptive Gradients'
☆17Jan 12, 2022Updated 4 years ago
lsj2408 / GraphNorm
View on GitHub
[ICML 2021] GraphNorm: A Principled Approach to Accelerating Graph Neural Network Training (official implementation)
☆106Dec 19, 2022Updated 3 years ago
donutloop / machine-learning-research-papers
View on GitHub
Collection of machine learning research paper references
☆26Updated this week
linusericsson / ssl-invariances
View on GitHub
Official code for the paper "Why Do Self-Supervised Models Transfer? Investigating the Impact of Invariance on Downstream Tasks".
☆16Dec 7, 2021Updated 4 years ago
lucidrains / feedback-transformer-pytorch
View on GitHub
Implementation of Feedback Transformer in Pytorch
☆108Mar 2, 2021Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
nextml-code / pytorch-datastream
View on GitHub
Simple dataset to dataloader library for pytorch
☆32Jul 7, 2026Updated 2 weeks ago
kingoflolz / swarm-jax
View on GitHub
Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes
☆241May 12, 2023Updated 3 years ago
microsoft / fastseq
View on GitHub
An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…
☆433Aug 17, 2022Updated 3 years ago
ignaciohrdz / FLAT-O
View on GitHub
👁️ Facial Landmark Annotation Tool with OpenCV
☆12Apr 5, 2024Updated 2 years ago
rish-16 / grafog
View on GitHub
Graph Data Augmentation Library for PyTorch Geometric
☆133Aug 17, 2022Updated 3 years ago
lucidrains / long-short-transformer
View on GitHub
Implementation of Long-Short Transformer, combining local and global inductive biases for attention over long sequences, in Pytorch
☆120Aug 4, 2021Updated 4 years ago
lucidrains / routing-transformer
View on GitHub
Fully featured implementation of Routing Transformer
☆300Nov 6, 2021Updated 4 years ago