akandykeller / FERNNLinks
Official repository for the paper "Flow Equivariant Recurrent Neural Networks"
☆27Updated 5 months ago
Alternatives and similar repositories for FERNN
Users that are interested in FERNN are comparing it to the libraries listed below
Sorting:
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆37Updated last year
- ☆44Updated last year
- Deep Networks Grok All the Time and Here is Why☆38Updated last year
- ☆35Updated last year
- ☆35Updated last year
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆25Updated 11 months ago
- Flow-matching algorithms in JAX☆112Updated last year
- ICML 2022: Learning Iterative Reasoning through Energy Minimization☆48Updated 2 years ago
- Neural Optimal Transport with Lagrangian Costs☆60Updated 7 months ago
- Pytorch implementation of a simple way to enable (Stochastic) Frame Averaging for any network☆51Updated last year
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆56Updated last year
- [ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction☆82Updated 7 months ago
- The Gaussian Histogram Loss (HL-Gauss) proposed by Imani et al. with a few convenient wrappers for regression, in Pytorch☆70Updated last month
- ☆122Updated 6 months ago
- JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)☆40Updated last year
- ☆52Updated last year
- My take on Flow Matching☆86Updated 11 months ago
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆127Updated 2 years ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆25Updated 9 months ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆92Updated last year
- Implementation of an Attention layer where each head can attend to more than just one token, using coordinate descent to pick topk☆47Updated 2 years ago
- NF-Layers for constructing neural functionals.☆93Updated last year
- Flash Attention Triton kernel with support for second-order derivatives☆125Updated this week
- Official code for the paper "Attention as a Hypernetwork"☆46Updated last year
- Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"☆59Updated 2 years ago
- A repo where I play with conditional flow approaches for learning time-varying vector-fields.☆23Updated last year
- Official codebase for the paper "How to build a consistency model: Learning flow maps via self-distillation" (NeurIPS 2025).☆61Updated 2 months ago
- Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.☆41Updated last year
- Experiment with diffusion models that you can run on your local jupyter instances☆63Updated last year
- RS-IMLE☆43Updated last year