njmarko / graph-transformer-psimlLinks
Transformer implemented with graph attention network (GAT) layers from PyTorch Geometric
☆18Updated 3 years ago
Alternatives and similar repositories for graph-transformer-psiml
Users that are interested in graph-transformer-psiml are comparing it to the libraries listed below
Sorting:
- A Transformer-based Prediction Method for Depth of Anesthesia During Target-controlled Infusion of Propofol and Remifentanil.☆15Updated 10 months ago
- ☆29Updated last year
- HGRN2: Gated Linear RNNs with State Expansion☆56Updated last year
- Official code for the paper "Attention as a Hypernetwork"☆46Updated last year
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆72Updated last year
- ☆29Updated 2 months ago
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆62Updated last year
- ☆24Updated last year
- Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention☆12Updated 2 years ago
- Towards Understanding the Mixture-of-Experts Layer in Deep Learning☆34Updated 2 years ago
- Implementation of Agent Attention in Pytorch☆93Updated last year
- [NeurIPS 2022] Your Transformer May Not be as Powerful as You Expect (official implementation)☆34Updated 2 years ago
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆37Updated last year
- Implementation of the transformer proposed in "Building Blocks for a Complex-Valued Transformer Architecture"☆86Updated 2 years ago
- Implementation of Infini-Transformer in Pytorch☆112Updated last year
- Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"☆59Updated 2 years ago
- Official implementation of Adaptive Feature Transfer (AFT)☆23Updated last year
- A practical implementation of GradNorm, Gradient Normalization for Adaptive Loss Balancing, in Pytorch☆124Updated 4 months ago
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆57Updated last year
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Updated 7 months ago
- Recursive Leasting Squares (RLS) with Neural Network for fast learning☆58Updated 2 years ago
- Code for the paper "Query-Key Normalization for Transformers"☆50Updated 4 years ago
- Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.☆41Updated last year
- Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount…☆53Updated 2 years ago
- ☆35Updated 2 years ago
- ☆52Updated last year
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆127Updated 2 years ago
- ☆20Updated 2 years ago
- ☆33Updated last year
- Explorations into the recently proposed Taylor Series Linear Attention☆100Updated last year