lucidrains/g-mlp-gpt

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lucidrains/g-mlp-gpt)

lucidrains / g-mlp-gpt

GPT, but made only out of MLPs

☆89

Alternatives and similar repositories for g-mlp-gpt

Users that are interested in g-mlp-gpt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lucidrains / esbn-transformer
View on GitHub
An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols
☆16Aug 3, 2021Updated 4 years ago
lucidrains / long-short-transformer
View on GitHub
Implementation of Long-Short Transformer, combining local and global inductive biases for attention over long sequences, in Pytorch
☆120Aug 4, 2021Updated 4 years ago
lucidrains / token-shift-gpt
View on GitHub
Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing
☆49Jan 27, 2022Updated 4 years ago
lucidrains / triton-transformer
View on GitHub
Implementation of a Transformer, but completely in Triton
☆279Apr 5, 2022Updated 4 years ago
lucidrains / panoptic-transformer
View on GitHub
Another attempt at a long-context / efficient transformer by me
☆38Apr 11, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
lucidrains / g-mlp-pytorch
View on GitHub
Implementation of gMLP, an all-MLP replacement for Transformers, in Pytorch
☆431Aug 14, 2021Updated 4 years ago
lucidrains / mlp-gpt-jax
View on GitHub
A GPT, made only of MLPs, in Jax
☆59Jun 23, 2021Updated 5 years ago
AranKomat / Metroplex
View on GitHub
☆21Mar 15, 2023Updated 3 years ago
lucidrains / kronecker-attention-pytorch
View on GitHub
Implementation of Kronecker Attention in Pytorch
☆20Sep 12, 2020Updated 5 years ago
lucidrains / multistream-transformers
View on GitHub
Implementation of Multistream Transformers in Pytorch
☆54Jul 31, 2021Updated 4 years ago
EleutherAI / pyfra
View on GitHub
Python Research Framework
☆107Nov 3, 2022Updated 3 years ago
lucidrains / HTM-pytorch
View on GitHub
Implementation of Hierarchical Transformer Memory (HTM) for Pytorch
☆74Sep 15, 2021Updated 4 years ago
lucidrains / ponder-transformer
View on GitHub
Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper
☆84Oct 30, 2021Updated 4 years ago
lucidrains / fast-transformer-pytorch
View on GitHub
Implementation of Fast Transformer in Pytorch
☆176Aug 26, 2021Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
lucidrains / n-grammer-pytorch
View on GitHub
Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch
☆81Dec 4, 2022Updated 3 years ago
wenkokke / dep2con
View on GitHub
several algorithms for converting dependency structures into constituency structures.
☆10Feb 7, 2022Updated 4 years ago
lucidrains / local-attention-flax
View on GitHub
Local Attention - Flax module for Jax
☆22May 26, 2021Updated 5 years ago
antofuller / configaformers
View on GitHub
A python library for highly configurable transformers - easing model architecture search and experimentation.
☆48Nov 30, 2021Updated 4 years ago
lucidrains / charformer-pytorch
View on GitHub
Implementation of the GBST block from the Charformer paper, in Pytorch
☆118Jul 15, 2021Updated 5 years ago
lucidrains / transganformer
View on GitHub
Implementation of TransGanFormer, an all-attention GAN that combines the finding from the recent GanFormer and TransGan paper
☆155Apr 27, 2021Updated 5 years ago
jenni-ai / T2FW
View on GitHub
Fine-Tuning Pre-trained Transformers into Decaying Fast Weights
☆20Oct 9, 2022Updated 3 years ago
lucidrains / triangle-multiplicative-module
View on GitHub
Implementation of the Triangle Multiplicative module, used in Alphafold2 as an efficient way to mix rows or columns of a 2d feature map, …
☆39Aug 3, 2021Updated 4 years ago
frahik / BMTME
View on GitHub
Bayesian Multi-Trait Multi-Environment for genomic selection[R package] [Dev version]
☆19Aug 6, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
lucidrains / coordinate-descent-attention
View on GitHub
Implementation of an Attention layer where each head can attend to more than just one token, using coordinate descent to pick topk
☆47Jul 16, 2023Updated 3 years ago
lucidrains / metaformer-gpt
View on GitHub
Implementation of Metaformer, but in an autoregressive manner
☆26Jun 21, 2022Updated 4 years ago
lucidrains / En-transformer
View on GitHub
Implementation of E(n)-Transformer, which incorporates attention mechanisms into Welling's E(n)-Equivariant Graph Neural Network
☆226Jun 2, 2024Updated 2 years ago
lucidrains / coco-lm-pytorch
View on GitHub
Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch
☆46Mar 3, 2021Updated 5 years ago
lucidrains / rela-transformer
View on GitHub
Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012
☆49Apr 6, 2022Updated 4 years ago
warnikchow / kosp2e
View on GitHub
Korean Speech to English Translation Corpus
☆45Sep 3, 2021Updated 4 years ago
lucidrains / lie-transformer-pytorch
View on GitHub
Implementation of Lie Transformer, Equivariant Self-Attention, in Pytorch
☆98Feb 19, 2021Updated 5 years ago
lucidrains / se3-transformer-pytorch
View on GitHub
Implementation of SE3-Transformers for Equivariant Self-Attention, in Pytorch. This specific repository is geared towards integration wit…
☆331Aug 28, 2025Updated 10 months ago
tunib-ai / artwork_captions
View on GitHub
Machine Generated Captions for Best Artworks
☆22Sep 21, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
lucidrains / axial-positional-embedding
View on GitHub
Axial Positional Embedding for Pytorch
☆84Feb 25, 2025Updated last year
rwightman / imagenet-12k
View on GitHub
ImageNet-12k subset of ImageNet-21k (fall11)
☆23Jun 13, 2023Updated 3 years ago
lucidrains / marge-pytorch
View on GitHub
Implementation of Marge, Pre-training via Paraphrasing, in Pytorch
☆76Jan 14, 2021Updated 5 years ago
htoyryla / DALLE-pytorch
View on GitHub
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
☆59Jan 13, 2021Updated 5 years ago
lucidrains / h-transformer-1d
View on GitHub
Implementation of H-Transformer-1D, Hierarchical Attention for Sequence Learning
☆167Feb 12, 2024Updated 2 years ago
lucidrains / holodeck-pytorch
View on GitHub
Implementation of a holodeck, written in Pytorch
☆19Nov 1, 2023Updated 2 years ago
lucidrains / learning-to-expire-pytorch
View on GitHub
An implementation of Transformer with Expire-Span, a circuit for learning which memories to retain
☆34Oct 30, 2020Updated 5 years ago