lucidrains/RETRO-pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lucidrains/RETRO-pytorch)

lucidrains / RETRO-pytorch

Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch

☆879

Alternatives and similar repositories for RETRO-pytorch

Users that are interested in RETRO-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Langboat / mengzi-retrieval-lm
View on GitHub
An experimental implementation of the retrieval-enhanced language model
☆74Dec 29, 2022Updated 3 years ago
lucidrains / memorizing-transformers-pytorch
View on GitHub
Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate …
☆646Jul 17, 2023Updated 3 years ago
lucidrains / PaLM-jax
View on GitHub
Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)
☆189Jun 24, 2022Updated 4 years ago
lucidrains / Mega-pytorch
View on GitHub
Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena
☆207Aug 26, 2023Updated 2 years ago
lucidrains / PaLM-pytorch
View on GitHub
Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways
☆825Nov 9, 2022Updated 3 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
facebookresearch / atlas
View on GitHub
Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03…
☆560Jul 2, 2026Updated 3 weeks ago
TobiasNorlund / retro
View on GitHub
Official repo to On the Generalization Ability of Retrieval-Enhanced Transformers
☆47Jun 4, 2024Updated 2 years ago
lucidrains / x-clip
View on GitHub
A concise but complete implementation of CLIP with various experimental improvements from recent papers
☆724Oct 16, 2023Updated 2 years ago
lucidrains / x-transformers
View on GitHub
A concise but complete full-attention transformer with a set of promising experimental features from various papers
☆5,928Updated this week
CarperAI / trlx
View on GitHub
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
☆4,752Jan 8, 2024Updated 2 years ago
tunib-ai / oslo
View on GitHub
OSLO: Open Source framework for Large-scale model Optimization
☆310Aug 25, 2022Updated 3 years ago
lucidrains / n-grammer-pytorch
View on GitHub
Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch
☆81Dec 4, 2022Updated 3 years ago
facebookresearch / contriever
View on GitHub
Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning
☆780Apr 7, 2023Updated 3 years ago
urvashik / knnlm
View on GitHub
☆331Jun 7, 2021Updated 5 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
lucidrains / flamingo-pytorch
View on GitHub
Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch
☆1,267Oct 18, 2022Updated 3 years ago
lucidrains / panoptic-transformer
View on GitHub
Another attempt at a long-context / efficient transformer by me
☆38Apr 11, 2022Updated 4 years ago
EleutherAI / magiCARP
View on GitHub
One stop shop for all things carp
☆58Sep 9, 2022Updated 3 years ago
lucidrains / PaLM-rlhf-pytorch
View on GitHub
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
☆7,865Updated this week
lucidrains / h-transformer-1d
View on GitHub
Implementation of H-Transformer-1D, Hierarchical Attention for Sequence Learning
☆167Feb 12, 2024Updated 2 years ago
lucidrains / mlp-gpt-jax
View on GitHub
A GPT, made only of MLPs, in Jax
☆59Jun 23, 2021Updated 5 years ago
criteo / autofaiss
View on GitHub
Automatically create Faiss knn indices with the most optimal similarity search parameters.
☆907Nov 4, 2025Updated 8 months ago
google-research / t5x
View on GitHub
☆2,978Jul 9, 2026Updated 2 weeks ago
lucidrains / token-shift-gpt
View on GitHub
Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing
☆49Jan 27, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
facebookresearch / SEAL
View on GitHub
Search Engines with Autoregressive Language models
☆296Apr 4, 2023Updated 3 years ago
allenai / RL4LMs
View on GitHub
A modular RL library to fine-tune language models to human preferences
☆2,393Mar 1, 2024Updated 2 years ago
r-three / t-few
View on GitHub
Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"
☆460Sep 6, 2023Updated 2 years ago
EleutherAI / gpt-neox
View on GitHub
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
☆7,448Jun 11, 2026Updated last month
lucidrains / nuwa-pytorch
View on GitHub
Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch
☆548Jan 17, 2023Updated 3 years ago
facebookresearch / FiD
View on GitHub
Fusion-in-Decoder
☆596Oct 4, 2023Updated 2 years ago
lucidrains / flash-cosine-sim-attention
View on GitHub
Implementation of fused cosine similarity attention in the same style as Flash Attention
☆220Feb 13, 2023Updated 3 years ago
castorini / pyserini
View on GitHub
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
☆2,107Jul 16, 2026Updated last week
lucidrains / einops-exts
View on GitHub
Implementation of some personal helper functions for Einops, my most favorite tensor manipulation library ❤️
☆57Jan 5, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
lucidrains / simple-hierarchical-transformer
View on GitHub
Experiments around a simple idea for inducing multiple hierarchical predictive model within a GPT
☆228Mar 25, 2026Updated 4 months ago
lucidrains / CoLT5-attention
View on GitHub
Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch
☆230Sep 6, 2024Updated last year
lucidrains / tranception-pytorch
View on GitHub
Implementation of Tranception, an attention network, paired with retrieval, that is SOTA for protein fitness prediction
☆32Jun 19, 2022Updated 4 years ago
lucidrains / memory-editable-transformer
View on GitHub
My explorations into editing the knowledge and memories of an attention network
☆35Dec 8, 2022Updated 3 years ago
facebookresearch / metaseq
View on GitHub
Repo for external large-scale work
☆6,550Apr 27, 2024Updated 2 years ago
lucidrains / reformer-pytorch
View on GitHub
Reformer, the efficient Transformer, in Pytorch
☆2,191Jun 21, 2023Updated 3 years ago
jxhe / efficient-knnlm
View on GitHub
Pytorch implementation of paper "Efficient Nearest Neighbor Language Models" (EMNLP 2021)
☆75Jan 20, 2022Updated 4 years ago