bhoov / energy-transformer-jax
The Energy Transformer block, in JAX
☆56Updated last year
Alternatives and similar repositories for energy-transformer-jax:
Users that are interested in energy-transformer-jax are comparing it to the libraries listed below
- ☆52Updated 5 months ago
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated 2 years ago
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆35Updated 2 years ago
- ☆49Updated last year
- Neural Optimal Transport with Lagrangian Costs☆53Updated 7 months ago
- Transformers with doubly stochastic attention☆45Updated 2 years ago
- Code associated to papers on superposition (in ML interpretability)☆28Updated 2 years ago
- Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.☆41Updated last year
- ☆24Updated 2 years ago
- This repository contains the official code for Energy Transformer---an efficient Energy-based Transformer variant for graph classificatio…☆23Updated last year
- ☆23Updated 5 months ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆82Updated last year
- ☆24Updated last week
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆58Updated last year
- Code for our paper "Generative Flow Networks for Discrete Probabilistic Modeling"☆82Updated 2 years ago
- ☆30Updated 4 months ago
- ☆9Updated 2 years ago
- Graphically structured diffusion model.☆20Updated last year
- Pytorch-like dataloaders for JAX.☆75Updated 4 months ago
- Fine-grained, dynamic control of neural network topology in JAX.☆21Updated last year
- ICML 2022: Learning Iterative Reasoning through Energy Minimization☆45Updated 2 years ago
- Repository for Sparse Universal Transformers☆17Updated last year
- Official repository for the paper "Neural Differential Equations for Learning to Program Neural Nets Through Continuous Learning Rules" (…☆19Updated 2 years ago
- A simple hypernetwork implementation in jax using haiku.☆23Updated 2 years ago
- Experiments on the impact of depth in transformers and SSMs.☆23Updated 3 months ago
- Deep Networks Grok All the Time and Here is Why☆28Updated 9 months ago
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20Updated last year
- A Scalable Approximate Method for Probabilistic Neurosymbolic Inference☆13Updated last month
- Official Jax Implementation of MD4 Masked Diffusion Models☆61Updated this week