lucidrains/learning-to-expire-pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lucidrains/learning-to-expire-pytorch)

lucidrains / learning-to-expire-pytorch

An implementation of Transformer with Expire-Span, a circuit for learning which memories to retain

☆34

Alternatives and similar repositories for learning-to-expire-pytorch

Users that are interested in learning-to-expire-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lucidrains / memory-transformer-xl
View on GitHub
A variant of Transformer-XL where the memory is updated not with a queue, but with attention
☆49Jul 31, 2020Updated 5 years ago
lucidrains / memformer
View on GitHub
Implementation of Memformer, a Memory-augmented Transformer, in Pytorch
☆126Nov 13, 2020Updated 5 years ago
lucidrains / compressive-transformer-pytorch
View on GitHub
Pytorch implementation of Compressive Transformers, from Deepmind
☆165Oct 4, 2021Updated 4 years ago
lucidrains / all-normalization-transformer
View on GitHub
A simple Transformer where the softmax has been replaced with normalization
☆20Sep 11, 2020Updated 5 years ago
cyk1337 / Highway-Transformer
View on GitHub
[ACL‘20] Highway Transformer: A Gated Transformer.
☆33Dec 5, 2021Updated 4 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
lucidrains / rela-transformer
View on GitHub
Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012
☆49Apr 6, 2022Updated 4 years ago
lucidrains / token-shift-gpt
View on GitHub
Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing
☆49Jan 27, 2022Updated 4 years ago
lucidrains / insertion-deletion-ddpm
View on GitHub
Implementation of Insertion-deletion Denoising Diffusion Probabilistic Models
☆30May 31, 2022Updated 4 years ago
lucidrains / deep-linear-network
View on GitHub
A simple implementation of a deep linear Pytorch module
☆21Oct 16, 2020Updated 5 years ago
mcoavoux / mtg
View on GitHub
Statistical discontinuous constituent parsing
☆11Feb 15, 2018Updated 8 years ago
lucidrains / ESBN-pytorch
View on GitHub
Usable implementation of Emerging Symbol Binding Network (ESBN), in Pytorch
☆25Jan 6, 2021Updated 5 years ago
lucidrains / panoptic-transformer
View on GitHub
Another attempt at a long-context / efficient transformer by me
☆38Apr 11, 2022Updated 4 years ago
Noahs-ARK / RFA
View on GitHub
☆33Apr 12, 2021Updated 5 years ago
MultiPath / Efficient-Neural-Machine-Translation
View on GitHub
PhD thesis (updating) of Jiatao Gu from HKU
☆19Aug 10, 2018Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
timvieira / vocrf
View on GitHub
Variable-order CRFs with structure learning
☆17Aug 1, 2024Updated last year
lucidrains / adjacent-attention-network
View on GitHub
Graph neural network message passing reframed as a Transformer with local attention
☆70Dec 24, 2022Updated 3 years ago
junekihong / beam-span-parser
View on GitHub
A DP beam-search extension of Mitchell Stern's span-based neural constituency parser
☆11Aug 24, 2022Updated 3 years ago
harvardnlp / cascaded-generation
View on GitHub
Cascaded Text Generation with Markov Transformers
☆130Mar 20, 2023Updated 3 years ago
XiangLi1999 / PosteriorControl-NLG
View on GitHub
Posterior Control of Blackbox Generation
☆23May 2, 2020Updated 6 years ago
rishikksh20 / rectified-linear-attention
View on GitHub
Sparse Attention with Linear Units
☆20Apr 21, 2021Updated 5 years ago
LZhengisme / CODA
View on GitHub
Implementation of Cascaded Head-colliding Attention (ACL'2021)
☆11Sep 16, 2021Updated 4 years ago
ChiyuSONG / dynamics-of-instruction-tuning
View on GitHub
☆18Mar 3, 2025Updated last year
lucidrains / cross-transformers-pytorch
View on GitHub
Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch
☆54Mar 30, 2021Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
lemon234071 / AdaLabel
View on GitHub
The code for paper "Diversifying Dialog Generation via Adaptive Label Smoothing" in ACL 2021.
☆26Jun 7, 2021Updated 5 years ago
lucidrains / mogrifier
View on GitHub
Usable implementation of Mogrifier, a circuit for enhancing LSTMs and potentially other networks, from Deepmind
☆21Jun 9, 2024Updated 2 years ago
lucidrains / marge-pytorch
View on GitHub
Implementation of Marge, Pre-training via Paraphrasing, in Pytorch
☆76Jan 14, 2021Updated 5 years ago
lucidrains / g-mlp-gpt
View on GitHub
GPT, but made only out of MLPs
☆89May 25, 2021Updated 5 years ago
lucidrains / fast-transformer-pytorch
View on GitHub
Implementation of Fast Transformer in Pytorch
☆176Aug 26, 2021Updated 4 years ago
ClashLuke / PerfTorch
View on GitHub
High performance pytorch modules
☆18Jan 14, 2023Updated 3 years ago
nelson-liu / lexical-semantic-recognition
View on GitHub
☆18Jun 12, 2023Updated 3 years ago
notAI-tech / IndicASR
View on GitHub
Speeech Recognition for Indic languages.
☆13Apr 3, 2021Updated 5 years ago
tonyduan / snn
View on GitHub
Self-normalizing neural network implemented in PyTorch.
☆12Apr 3, 2019Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
lucidrains / global-self-attention-network
View on GitHub
A Pytorch implementation of Global Self-Attention Network, a fully-attention backbone for vision tasks
☆94Nov 21, 2020Updated 5 years ago
lucidrains / tableformer-pytorch
View on GitHub
Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch
☆39Mar 29, 2022Updated 4 years ago
xuanqing94 / FLOATER
View on GitHub
Learning to Encode Position for Transformer with Continuous Dynamical Model
☆60Aug 3, 2020Updated 5 years ago
frankxu2004 / knnlm-why
View on GitHub
Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"
☆59Jan 12, 2023Updated 3 years ago
lucidrains / ResizeRight
View on GitHub
The correct way to resize images or tensors. For Numpy or Pytorch (differentiable).
☆18May 5, 2022Updated 4 years ago
lucidrains / routing-transformer
View on GitHub
Fully featured implementation of Routing Transformer
☆300Nov 6, 2021Updated 4 years ago
lucidrains / product-key-memory
View on GitHub
Standalone Product Key Memory module in Pytorch - for augmenting Transformer models
☆87Nov 1, 2025Updated 8 months ago