Lemon-cmd / energy-transformer-graphLinks
This repository contains the official code for Energy Transformer---an efficient Energy-based Transformer variant for graph classification
☆24Updated last year
Alternatives and similar repositories for energy-transformer-graph
Users that are interested in energy-transformer-graph are comparing it to the libraries listed below
Sorting:
- Physics-inspired transformer modules based on mean-field dynamics of vector-spin models in JAX☆41Updated last year
- Scalable and Stable Parallelization of Nonlinear RNNS☆17Updated 5 months ago
- The Energy Transformer block, in JAX☆59Updated last year
- ☆9Updated 2 years ago
- Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.☆41Updated last year
- A State-Space Model with Rational Transfer Function Representation.☆79Updated last year
- Parallelizing non-linear sequential models over the sequence length☆52Updated 3 weeks ago
- Code repository for Trajectory Flow Matching☆73Updated 8 months ago
- A MAD laboratory to improve AI architecture designs 🧪☆123Updated 7 months ago
- ☆32Updated 9 months ago
- Implementation of GateLoop Transformer in Pytorch and Jax☆89Updated last year
- Official implementation of Stochastic Taylor Derivative Estimator (STDE) NeurIPS2024☆110Updated 7 months ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆84Updated last year
- Code for our paper "Generative Flow Networks for Discrete Probabilistic Modeling"☆84Updated 2 years ago
- ☆53Updated 9 months ago
- ☆35Updated 3 months ago
- Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)☆15Updated 6 months ago
- An annotated implementation of the Hyena Hierarchy paper☆33Updated 2 years ago
- ☆32Updated last year
- ☆26Updated 2 years ago
- Pytorch implementation of a simple way to enable (Stochastic) Frame Averaging for any network☆50Updated 11 months ago
- [ICLR'25] Artificial Kuramoto Oscillatory Neurons☆95Updated last month
- GflowNets, MCMC, Metropolis-Hasting, Gibbs sampling, Metropolis-adjusted Langevin, Inverse Transform Sampling, Acceptance-Rejection Metho…☆85Updated 2 years ago
- Omnigrok: Grokking Beyond Algorithmic Data☆58Updated 2 years ago
- Generative Flow Networks - GFlowNet☆258Updated 2 weeks ago
- Deep Networks Grok All the Time and Here is Why☆37Updated last year
- ☆197Updated 7 months ago
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆124Updated last year
- ☆32Updated 8 months ago
- Brain-Inspired Modular Training (BIMT), a method for making neural networks more modular and interpretable.☆171Updated 2 years ago