Lemon-cmd / energy-transformer-graphLinks
This repository contains the official code for Energy Transformer---an efficient Energy-based Transformer variant for graph classification
☆25Updated last year
Alternatives and similar repositories for energy-transformer-graph
Users that are interested in energy-transformer-graph are comparing it to the libraries listed below
Sorting:
- The Energy Transformer block, in JAX☆62Updated last year
- ☆62Updated last year
- Parallelizing non-linear sequential models over the sequence length☆56Updated 5 months ago
- Scalable and Stable Parallelization of Nonlinear RNNS☆27Updated last month
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14Updated 6 months ago
- ☆33Updated last year
- A State-Space Model with Rational Transfer Function Representation.☆83Updated last year
- Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.☆41Updated last year
- A MAD laboratory to improve AI architecture designs 🧪☆135Updated 11 months ago
- Official implementation of Stochastic Taylor Derivative Estimator (STDE) NeurIPS2024☆124Updated last year
- Code repository for Trajectory Flow Matching☆93Updated last year
- An annotated implementation of the Hyena Hierarchy paper☆34Updated 2 years ago
- Code for verifying deep neural feature ansatz☆21Updated 2 years ago
- ☆35Updated 8 months ago
- Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)☆15Updated 11 months ago
- Physics-inspired transformer modules based on mean-field dynamics of vector-spin models in JAX☆45Updated 2 years ago
- Implementation of GateLoop Transformer in Pytorch and Jax☆91Updated last year
- ☆33Updated last year
- ☆61Updated 2 months ago
- [ICLR'25] Artificial Kuramoto Oscillatory Neurons☆105Updated last month
- Pytorch implementation of a simple way to enable (Stochastic) Frame Averaging for any network☆51Updated last year
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆92Updated last year
- Omnigrok: Grokking Beyond Algorithmic Data☆62Updated 2 years ago
- nanoGPT using Equinox☆14Updated 2 years ago
- ☆35Updated last year
- Repository for code used in the xVal paper☆145Updated last year
- Code for paper "Compositional Sculpting of Iterative Generative Processes"☆25Updated 2 years ago
- ☆34Updated last year
- Official implementation of Fisher-Flow Matching (NeurIPS 2024).☆34Updated last year
- Erwin: A Tree-based Hierarchical Transformer for Large-scale Physical Systems [ICML'25]☆109Updated 2 months ago