lucidrains / complex-valued-transformerLinks
Implementation of the transformer proposed in "Building Blocks for a Complex-Valued Transformer Architecture"
☆79Updated last year
Alternatives and similar repositories for complex-valued-transformer
Users that are interested in complex-valued-transformer are comparing it to the libraries listed below
Sorting:
- ☆31Updated 2 months ago
- Code for the paper: Complex-Valued Autoencoders for Object Discovery☆54Updated 2 years ago
- Complex tensor and complex functions for pytorch.☆49Updated 2 years ago
- ☆56Updated last year
- Implementation of Complex Valued Neural Networks in Pytorch 🧠☆49Updated 3 months ago
- Complex-valued neural networks for pytorch and Variational Dropout for real and complex layers.☆145Updated 3 years ago
- Learnable Fourier Features for Multi-Dimensional Spatial Positional Encoding☆51Updated 8 months ago
- A Triton Kernel for incorporating Bi-Directionality in Mamba2☆70Updated 6 months ago
- [ICLR 2023] "Dilated convolution with learnable spacings" Ismail Khalfaoui Hassani, Thomas Pellegrini and Timothée Masquelier☆68Updated last year
- Implementation of Agent Attention in Pytorch☆90Updated 11 months ago
- Pytorch implementation of Simplified Structured State-Spaces for Sequence Modeling (S5)☆76Updated last year
- Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch☆108Updated 7 months ago
- Implementation of GateLoop Transformer in Pytorch and Jax☆89Updated last year
- High order and sparse layers in pytorch. Lagrange Polynomial, Piecewise Lagrange Polynomial, Piecewise Discontinuous Lagrange Polynomial…☆44Updated last year
- Library to help implement a complex-valued neural network (cvnn) using tensorflow as back-end☆175Updated last month
- Deep Learning Model for Signal Data☆87Updated 5 years ago
- A practical implementation of GradNorm, Gradient Normalization for Adaptive Loss Balancing, in Pytorch☆97Updated last year
- PyTorch implementation of Structured State Space for Sequence Modeling (S4), based on Annotated S4.☆82Updated last year
- Trying out the Mamba architecture on small examples (cifar-10, shakespeare char level etc.)☆46Updated last year
- The Gaussian Histogram Loss (HL-Gauss) proposed by Imani et al. with a few convenient wrappers for regression, in Pytorch☆64Updated 3 weeks ago
- A State-Space Model with Rational Transfer Function Representation.☆78Updated last year
- Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts☆119Updated 8 months ago
- Implementation of the proposed DeepCrossAttention by Heddes et al at Google research, in Pytorch☆88Updated 4 months ago
- an implementation of FAdam (Fisher Adam) in PyTorch☆44Updated last year
- ☆22Updated 8 months ago
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆124Updated last year
- Rate-Adaptive Quantization: A Multi-Rate Codebook Adaptation for Vector Quantization-based Generative Models☆11Updated 4 months ago
- [ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wen…☆80Updated last year
- Attempt to make multiple residual streams from Bytedance's Hyper-Connections paper accessible to the public☆85Updated last week
- Source code for the experiments of Trainable Fractional Fourier Transform paper submitted to IEEE Signal Processing Letters.☆15Updated last year