lucidrains/complex-valued-transformer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lucidrains/complex-valued-transformer)

lucidrains / complex-valued-transformer

Implementation of the transformer proposed in "Building Blocks for a Complex-Valued Transformer Architecture"

☆92

Alternatives and similar repositories for complex-valued-transformer

Users that are interested in complex-valued-transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

josiahwsmith10 / complextorch
View on GitHub
☆42Jun 22, 2026Updated 2 weeks ago
vkothapally / Complex-valued-Attention
View on GitHub
Transformer based Self-Attention for Complex Numbers
☆14Oct 19, 2021Updated 4 years ago
lucidrains / autoregressive-linear-attention-cuda
View on GitHub
CUDA implementation of autoregressive linear attention, with all the latest research findings
☆46May 23, 2023Updated 3 years ago
lucidrains / coordinate-descent-attention
View on GitHub
Implementation of an Attention layer where each head can attend to more than just one token, using coordinate descent to pick topk
☆47Jul 16, 2023Updated 2 years ago
lucidrains / memory-editable-transformer
View on GitHub
My explorations into editing the knowledge and memories of an attention network
☆35Dec 8, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
lucidrains / kalman-filtering-attention
View on GitHub
Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"
☆59Oct 22, 2023Updated 2 years ago
NEGU93 / polsar_cvnn
View on GitHub
PolSAR classification / segmentation using complex-valued neural networks.
☆20Jan 5, 2022Updated 4 years ago
saurjya / EnsembleSep
View on GitHub
This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.
☆12Nov 7, 2024Updated last year
lucidrains / gateloop-transformer
View on GitHub
Implementation of GateLoop Transformer in Pytorch and Jax
☆92Jun 18, 2024Updated 2 years ago
lucidrains / metaformer-gpt
View on GitHub
Implementation of Metaformer, but in an autoregressive manner
☆26Jun 21, 2022Updated 4 years ago
lucidrains / mirasol-pytorch
View on GitHub
Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch
☆92Dec 22, 2023Updated 2 years ago
lucidrains / TPDNE
View on GitHub
Thispersondoesnotexist went down, so this time, while building it back up, I am going to open source all of it.
☆91Aug 26, 2023Updated 2 years ago
lucidrains / mixture-of-attention
View on GitHub
Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts
☆122Oct 17, 2024Updated last year
becker929 / rave-training
View on GitHub
Utilities and experiments for training RAVE
☆16Oct 23, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
kyegomez / PaLM2-VAdapter
View on GitHub
Implementation of "PaLM2-VAdapter:" from the multi-modal model paper: "PaLM2-VAdapter: Progressively Aligned Language Model Makes a Stron…
☆17Nov 11, 2024Updated last year
lucidrains / quartic-transformer
View on GitHub
Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)
☆56Mar 25, 2025Updated last year
morganmcg1 / wandb_spectrogram
View on GitHub
☆15Sep 24, 2022Updated 3 years ago
lucidrains / agent-attention-pytorch
View on GitHub
Implementation of Agent Attention in Pytorch
☆93Jul 10, 2024Updated 2 years ago
CODEJIN / DiffSingerKR
View on GitHub
☆25Aug 31, 2024Updated last year
lucidrains / holodeck-pytorch
View on GitHub
Implementation of a holodeck, written in Pytorch
☆19Nov 1, 2023Updated 2 years ago
NEGU93 / cvnn
View on GitHub
Library to help implement a complex-valued neural network (cvnn) using tensorflow as back-end
☆196Apr 23, 2026Updated 2 months ago
lucidrains / flash-attention-jax
View on GitHub
Implementation of Flash Attention in Jax
☆228Mar 1, 2024Updated 2 years ago
AndyShih12 / LongHorizonTemperatureScaling
View on GitHub
PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023
☆21May 31, 2023Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
JianpingWang-TUD / InterferenceMitigation_CFAR
View on GitHub
CFAR-based Interference Mitigation for FMCW Radars
☆21Dec 19, 2022Updated 3 years ago
lucidrains / tranception-pytorch
View on GitHub
Implementation of Tranception, an attention network, paired with retrieval, that is SOTA for protein fitness prediction
☆32Jun 19, 2022Updated 4 years ago
kyegomez / HRTX
View on GitHub
Multi-Modal Multi-Embodied Hivemind-like Iteration of RTX-2
☆15Jun 27, 2025Updated last year
lucidrains / pause-transformer
View on GitHub
Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount…
☆53Oct 22, 2023Updated 2 years ago
lucidrains / gradnorm-pytorch
View on GitHub
A practical implementation of GradNorm, Gradient Normalization for Adaptive Loss Balancing, in Pytorch
☆132Jun 1, 2026Updated last month
lucidrains / scaling-vin-pytorch
View on GitHub
Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group
☆37Sep 23, 2024Updated last year
Algomancer / VCReg
View on GitHub
Minimal Implimentation of VCRec (2024) for collapse provention.
☆18Jan 28, 2025Updated last year
lucidrains / blackbox-gradient-sensing
View on GitHub
Implementation and explorations into Blackbox Gradient Sensing (BGS), an evolutionary strategies approach proposed in a Google Deepmind p…
☆20Apr 17, 2026Updated 2 months ago
kyegomez / Reka-Torch
View on GitHub
Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch
☆29Jun 29, 2026Updated last week
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
lucidrains / pytorch-custom-utils
View on GitHub
Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…
☆126Jul 26, 2024Updated last year
lucidrains / esbn-transformer
View on GitHub
An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols
☆16Aug 3, 2021Updated 4 years ago
VITA-Group / ViT-Anti-Oversmoothing
View on GitHub
[ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wen…
☆84Jan 6, 2024Updated 2 years ago
kyegomez / AudioMamba
View on GitHub
Implementation of the paper: "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning" in pytorch
☆15Jun 22, 2026Updated 2 weeks ago
uqmarlonbran / TCS
View on GitHub
This repository contains the supplementary material for the paper titled: "TRANSFORMER COMPRESSED SENSING VIA GLOBAL IMAGE TOKENS".
☆13Dec 20, 2022Updated 3 years ago
MingjieWang0606 / 2021-Sohu-Text-Matching-TOP2
View on GitHub
☆13Jun 19, 2021Updated 5 years ago
UT-Radar-Interferometry-Group / psps
View on GitHub
Persistent Scatterer selection based on phase cosine similarity
☆17Jul 20, 2023Updated 2 years ago