njmarko / graph-transformer-psimlLinks
Transformer implemented with graph attention network (GAT) layers from PyTorch Geometric
☆18Updated 3 years ago
Alternatives and similar repositories for graph-transformer-psiml
Users that are interested in graph-transformer-psiml are comparing it to the libraries listed below
Sorting:
- HGRN2: Gated Linear RNNs with State Expansion☆55Updated last year
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Updated 4 months ago
- Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.☆41Updated last year
- Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention☆12Updated 2 years ago
- Official code for the paper "Attention as a Hypernetwork"☆45Updated last year
- This is the public github for our paper "Transformer with a Mixture of Gaussian Keys"☆28Updated 3 years ago
- Towards Understanding the Mixture-of-Experts Layer in Deep Learning☆32Updated last year
- Official implementation of Adaptive Feature Transfer (AFT)☆23Updated last year
- ☆28Updated last year
- A Transformer-based Prediction Method for Depth of Anesthesia During Target-controlled Infusion of Propofol and Remifentanil.☆13Updated 8 months ago
- Official implementation for Equivariant Architectures for Learning in Deep Weight Spaces [ICML 2023]☆90Updated 2 years ago
- ☆29Updated 2 weeks ago
- Diffusion based transformer, in PyTorch (Experimental).☆24Updated 3 years ago
- Official Code for ICLR 2024 Paper: Non-negative Contrastive Learning☆46Updated last year
- Deep Networks Grok All the Time and Here is Why☆37Updated last year
- [NeurIPS 2022] Your Transformer May Not be as Powerful as You Expect (official implementation)☆33Updated 2 years ago
- Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)☆15Updated 10 months ago
- Explorations into the recently proposed Taylor Series Linear Attention☆99Updated last year
- Awesome Mamba Papers: A Curated Collection of Research Papers , Tutorials & Blogs☆25Updated last year
- ☆38Updated 5 months ago
- Learnable Fourier Features for Multi-Dimensional Spatial Positional Encoding☆55Updated last year
- PyTorch implementation of Soft MoE by Google Brain in "From Sparse to Soft Mixtures of Experts" (https://arxiv.org/pdf/2308.00951.pdf)☆78Updated 2 years ago
- A weak supervision framework for (partial) labeling functions☆16Updated last year
- Implementation of Agent Attention in Pytorch☆91Updated last year
- ☆19Updated 2 years ago
- More dimensions = More fun☆26Updated last year
- ☆16Updated 5 months ago
- Official source code for "Graph Neural Networks for Learning Equivariant Representations of Neural Networks". In ICLR 2024 (oral).☆83Updated last year
- A practical implementation of GradNorm, Gradient Normalization for Adaptive Loss Balancing, in Pytorch☆112Updated 2 months ago
- Deep Learning & Information Bottleneck☆61Updated 2 years ago