Lemon-cmd / energy-transformer-graphLinks
This repository contains the official code for Energy Transformer---an efficient Energy-based Transformer variant for graph classification
☆25Updated last year
Alternatives and similar repositories for energy-transformer-graph
Users that are interested in energy-transformer-graph are comparing it to the libraries listed below
Sorting:
- Parallelizing non-linear sequential models over the sequence length☆54Updated 3 months ago
- The Energy Transformer block, in JAX☆58Updated last year
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14Updated 4 months ago
- Code repository for Trajectory Flow Matching☆83Updated 11 months ago
- ☆33Updated 6 months ago
- Physics-inspired transformer modules based on mean-field dynamics of vector-spin models in JAX☆41Updated last year
- ☆33Updated last year
- Scalable and Stable Parallelization of Nonlinear RNNS☆23Updated this week
- Code for our paper "Generative Flow Networks for Discrete Probabilistic Modeling"☆84Updated 2 years ago
- Pytorch implementation of a simple way to enable (Stochastic) Frame Averaging for any network☆51Updated last year
- ☆58Updated last year
- A MAD laboratory to improve AI architecture designs 🧪☆129Updated 9 months ago
- A State-Space Model with Rational Transfer Function Representation.☆81Updated last year
- Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.☆41Updated last year
- An annotated implementation of the Hyena Hierarchy paper☆34Updated 2 years ago
- Erwin: A Tree-based Hierarchical Transformer for Large-scale Physical Systems [ICML'25]☆101Updated 3 months ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆88Updated last year
- Generative Flow Networks - GFlowNet☆279Updated last week
- Omnigrok: Grokking Beyond Algorithmic Data☆62Updated 2 years ago
- Explorations into whether a transformer with RL can direct a genetic algorithm to converge faster☆71Updated 4 months ago
- [ICLR'25] Artificial Kuramoto Oscillatory Neurons☆102Updated last month
- Official source code for "Graph Neural Networks for Learning Equivariant Representations of Neural Networks". In ICLR 2024 (oral).☆82Updated last year
- Official implementation of Stochastic Taylor Derivative Estimator (STDE) NeurIPS2024☆122Updated 10 months ago
- ☆52Updated last year
- A PyTorch implementation of a Generative Flow Network (GFlowNet) proposed by Bengio et al. (2021)☆43Updated 2 years ago
- GflowNets, MCMC, Metropolis-Hasting, Gibbs sampling, Metropolis-adjusted Langevin, Inverse Transform Sampling, Acceptance-Rejection Metho…☆86Updated 2 years ago
- ☆56Updated 4 months ago
- Improved sampling via learned diffusions (ICLR2024) and an optimal control perspective on diffusion-based generative modeling (TMLR2024)☆67Updated 6 months ago
- DoG is SGD's Best Friend: A Parameter-Free Dynamic Step Size Schedule☆63Updated 2 years ago
- Official Jax Implementation of MD4 Masked Diffusion Models☆128Updated 7 months ago