lucidrains / metaformer-gptView external linksLinks
Implementation of Metaformer, but in an autoregressive manner
☆26Jun 21, 2022Updated 3 years ago
Alternatives and similar repositories for metaformer-gpt
Users that are interested in metaformer-gpt are comparing it to the libraries listed below
Sorting:
- Local Attention - Flax module for Jax☆22May 26, 2021Updated 4 years ago
- Implementation of Differentiable Sign-Distance Function Rendering - in Pytorch☆70May 9, 2022Updated 3 years ago
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆51May 10, 2022Updated 3 years ago
- Implementation of the video diffusion model and training scheme presented in the paper, Flexible Diffusion Modeling of Long Videos, in Py…☆85May 28, 2022Updated 3 years ago
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols☆16Aug 3, 2021Updated 4 years ago
- Implementation of fused cosine similarity attention in the same style as Flash Attention☆220Feb 13, 2023Updated 3 years ago
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch☆39Mar 29, 2022Updated 3 years ago
- PyTorch implementation of 2D Sharpened Cosine Similarity layer☆17Feb 1, 2022Updated 4 years ago
- Implementation of a U-net complete with efficient attention as well as the latest research findings☆292May 3, 2024Updated last year
- Implementation of E(n)-Transformer, which incorporates attention mechanisms into Welling's E(n)-Equivariant Graph Neural Network☆226Jun 2, 2024Updated last year
- A simple way to keep track of an Exponential Moving Average (EMA) version of your Pytorch model☆641Dec 19, 2025Updated last month
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆104Oct 10, 2023Updated 2 years ago
- This tool allows local LLM usage that can automate tasks without human interventention. The agent can call itself recursively and work on…☆20May 5, 2025Updated 9 months ago
- Aggregating embeddings over time☆32Jan 19, 2023Updated 3 years ago
- Implementation of the Triangle Multiplicative module, used in Alphafold2 as an efficient way to mix rows or columns of a 2d feature map, …☆39Aug 3, 2021Updated 4 years ago
- My explorations into editing the knowledge and memories of an attention network☆35Dec 8, 2022Updated 3 years ago
- Graph neural network message passing reframed as a Transformer with local attention☆70Dec 24, 2022Updated 3 years ago
- Implementation of the transformer proposed in "Building Blocks for a Complex-Valued Transformer Architecture"☆88Oct 13, 2023Updated 2 years ago
- Implementation of the Belief State Encoder / Decoder in the new breakthrough robotics paper from ETH Zürich☆84Apr 23, 2025Updated 9 months ago
- Hidden Engrams: Long Term Memory for Transformer Model Inference☆35Jun 26, 2021Updated 4 years ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit.☆10Feb 29, 2024Updated last year
- U-Net Model for Image Segmentation Problems using PyTorch 0.4☆38Dec 3, 2019Updated 6 years ago
- DECtalk Twitch text-to-speech bot is a bot that reads out chat messages with DECtalk. DECtalk is famously used by Professor Stephen Hawki…☆10Jan 5, 2023Updated 3 years ago
- My personal solutions to some textbook problems☆10Feb 12, 2020Updated 6 years ago
- A Feature based morphing tool which implements Beier–Neely morphing algorithm☆12Jul 18, 2021Updated 4 years ago
- Tool for image-based control RDP (Remote Desktop Protocol). Manipulations, automations and testing via Python and Apache Guacamole☆14Nov 16, 2022Updated 3 years ago
- Anime Character Segmentation☆11Aug 31, 2020Updated 5 years ago
- Personal repo for working on things prior to merging into official crDroid repositories. '-testx' branches are WIP, '-mxx' are milestone …☆10Jan 13, 2026Updated last month
- ☆39May 25, 2021Updated 4 years ago
- ☆15Mar 11, 2025Updated 11 months ago
- ☆11Feb 9, 2026Updated last week
- An Elder Scrolls neural name generator trained using PyTorch☆10Jan 29, 2019Updated 7 years ago
- Directed masked autoencoders☆14Feb 5, 2026Updated last week
- [ICLR 2025] UniCO: On Unified Combinatorial Optimization via Problem Reduction to Matrix-Encoded General TSP☆15Jun 20, 2025Updated 7 months ago
- Edith Virtual Assistant 🧠☆11Aug 9, 2021Updated 4 years ago
- Python SIR-x model implementation☆10Dec 8, 2022Updated 3 years ago
- The Colour Deconvolution 2 ImageJ plugin implements stain unmixing with Ruifrok and Johnston’s method described in [Ruifrok AC, Johnston …☆12Aug 5, 2022Updated 3 years ago
- Implementation of Agent Attention in Pytorch☆93Jul 10, 2024Updated last year
- Implementation of NWT, audio-to-video generation, in Pytorch☆92Mar 17, 2022Updated 3 years ago