lucidrains/mlp-gpt-jax

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lucidrains/mlp-gpt-jax)

lucidrains / mlp-gpt-jax

A GPT, made only of MLPs, in Jax

☆59

Alternatives and similar repositories for mlp-gpt-jax

Users that are interested in mlp-gpt-jax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AranKomat / Metroplex
View on GitHub
☆21Mar 15, 2023Updated 3 years ago
lucidrains / token-shift-gpt
View on GitHub
Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing
☆49Jan 27, 2022Updated 4 years ago
lucidrains / local-attention-flax
View on GitHub
Local Attention - Flax module for Jax
☆22May 26, 2021Updated 5 years ago
sayakpaul / BiT-jax2tf
View on GitHub
This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.
☆14Dec 21, 2021Updated 4 years ago
kingoflolz / CLIP_JAX
View on GitHub
Contrastive Language-Image Pretraining
☆147Sep 6, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
lucidrains / deep-linear-network
View on GitHub
A simple implementation of a deep linear Pytorch module
☆21Oct 16, 2020Updated 5 years ago
lucidrains / g-mlp-gpt
View on GitHub
GPT, but made only out of MLPs
☆89May 25, 2021Updated 5 years ago
lucidrains / all-normalization-transformer
View on GitHub
A simple Transformer where the softmax has been replaced with normalization
☆20Sep 11, 2020Updated 5 years ago
lucidrains / triangle-multiplicative-module
View on GitHub
Implementation of the Triangle Multiplicative module, used in Alphafold2 as an efficient way to mix rows or columns of a 2d feature map, …
☆39Aug 3, 2021Updated 4 years ago
lucidrains / geometric-vector-perceptron
View on GitHub
Implementation of Geometric Vector Perceptron, a simple circuit for 3d rotation equivariance for learning over large biomolecules, in Pyt…
☆77Jun 8, 2021Updated 5 years ago
lucidrains / pi-GAN-pytorch
View on GitHub
Implementation of π-GAN, for 3d-aware image synthesis, in Pytorch
☆125Feb 22, 2021Updated 5 years ago
lucidrains / long-short-transformer
View on GitHub
Implementation of Long-Short Transformer, combining local and global inductive biases for attention over long sequences, in Pytorch
☆120Aug 4, 2021Updated 4 years ago
lucidrains / esbn-transformer
View on GitHub
An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols
☆16Aug 3, 2021Updated 4 years ago
lucidrains / glom-pytorch
View on GitHub
An attempt at the implementation of Glom, Geoffrey Hinton's new idea that integrates concepts from neural fields, top-down-bottom-up proc…
☆196Mar 27, 2021Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
lucidrains / CLAP
View on GitHub
Contrastive Language-Audio Pretraining
☆15May 18, 2021Updated 5 years ago
lucidrains / ESBN-pytorch
View on GitHub
Usable implementation of Emerging Symbol Binding Network (ESBN), in Pytorch
☆25Jan 6, 2021Updated 5 years ago
antofuller / configaformers
View on GitHub
A python library for highly configurable transformers - easing model architecture search and experimentation.
☆48Nov 30, 2021Updated 4 years ago
lucidrains / metaformer-gpt
View on GitHub
Implementation of Metaformer, but in an autoregressive manner
☆26Jun 21, 2022Updated 4 years ago
lucidrains / n-grammer-pytorch
View on GitHub
Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch
☆81Dec 4, 2022Updated 3 years ago
AeroScripts / HiddenEngrams
View on GitHub
Hidden Engrams: Long Term Memory for Transformer Model Inference
☆35Jun 26, 2021Updated 5 years ago
lucidrains / molecule-attention-transformer
View on GitHub
Pytorch reimplementation of Molecule Attention Transformer, which uses a transformer to tackle the graph-like structure of molecules
☆58Dec 2, 2020Updated 5 years ago
kingoflolz / cc_img_dl
View on GitHub
☆26Mar 13, 2021Updated 5 years ago
j-towns / vdvae-jax
View on GitHub
Very deep VAEs in JAX/Flax
☆47Jun 16, 2021Updated 5 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
EleutherAI / pyfra
View on GitHub
Python Research Framework
☆107Nov 3, 2022Updated 3 years ago
lucidrains / marge-pytorch
View on GitHub
Implementation of Marge, Pre-training via Paraphrasing, in Pytorch
☆76Jan 14, 2021Updated 5 years ago
lucidrains / coco-lm-pytorch
View on GitHub
Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch
☆46Mar 3, 2021Updated 5 years ago
lucidrains / NWT-pytorch
View on GitHub
Implementation of NWT, audio-to-video generation, in Pytorch
☆92Mar 17, 2022Updated 4 years ago
lucidrains / transformer-lm-gan
View on GitHub
Explorations into adversarial losses on top of autoregressive loss for language modeling
☆41Dec 21, 2025Updated 7 months ago
lucidrains / nystrom-attention
View on GitHub
Implementation of Nyström Self-attention, from the paper Nyströmformer
☆145Mar 24, 2025Updated last year
JCBrouwer / maua-style
View on GitHub
Neural style transfer
☆21Jul 29, 2021Updated 5 years ago
lucidrains / isab-pytorch
View on GitHub
An implementation of (Induced) Set Attention Block, from the Set Transformers paper
☆70Jun 8, 2026Updated last month
sayakpaul / robustness-vit
View on GitHub
Contains code for the paper "Vision Transformers are Robust Learners" (AAAI 2022).
☆122Dec 3, 2022Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
lucidrains / panoptic-transformer
View on GitHub
Another attempt at a long-context / efficient transformer by me
☆38Apr 11, 2022Updated 4 years ago
lucidrains / discrete-key-value-bottleneck-pytorch
View on GitHub
Implementation of Discrete Key / Value Bottleneck, in Pytorch
☆88Jul 9, 2023Updated 3 years ago
sayakpaul / NALU
View on GitHub
Neural Arithmetic Logic Units by Trask et al.
☆12Apr 10, 2019Updated 7 years ago
lucidrains / memory-editable-transformer
View on GitHub
My explorations into editing the knowledge and memories of an attention network
☆35Dec 8, 2022Updated 3 years ago
lucidrains / rela-transformer
View on GitHub
Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012
☆49Apr 6, 2022Updated 4 years ago
microsoft / tensorflow-rematerialization
View on GitHub
Implementation of a Tensorflow XLA rematerialization pass
☆15Dec 20, 2019Updated 6 years ago
lucidrains / jax2torch
View on GitHub
Use Jax functions in Pytorch
☆263Jul 1, 2023Updated 3 years ago