Lemon-cmd / energy-transformer-torch
Official Implementation of Energy Transformer in PyTorch for Mask Image Reconstruction
☆23Updated last year
Alternatives and similar repositories for energy-transformer-torch:
Users that are interested in energy-transformer-torch are comparing it to the libraries listed below
- Pytorch implementation of Simplified Structured State-Spaces for Sequence Modeling (S5)☆76Updated 11 months ago
- This repository contains the official code for Energy Transformer---an efficient Energy-based Transformer variant for graph classificatio…☆23Updated last year
- Parallelizing non-linear sequential models over the sequence length☆51Updated 3 months ago
- Non official implementation of the Linear Recurrent Unit (LRU, Orvieto et al. 2023)☆53Updated 5 months ago
- ☆16Updated 7 months ago
- Implementation of "Gradients without backpropagation" paper (https://arxiv.org/abs/2202.08587) using functorch☆108Updated last year
- Code to simulate energy-based analog systems and equilibrium propagation☆26Updated 2 weeks ago
- ☆289Updated 3 months ago
- Official implementation of Stochastic Taylor Derivative Estimator (STDE) NeurIPS2024☆104Updated 4 months ago
- Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation (ICML'24 Oral)☆14Updated 9 months ago
- Accelerated First Order Parallel Associative Scan☆181Updated 8 months ago
- ☆175Updated 10 months ago
- A State-Space Model with Rational Transfer Function Representation.☆78Updated 11 months ago
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆57Updated last month
- ☆10Updated 3 months ago
- Scalable and Stable Parallelization of Nonlinear RNNS☆15Updated 2 months ago
- ☆53Updated last week
- MoMo: Momentum Models for Adaptive Learning Rates☆18Updated 10 months ago
- ☆175Updated 4 months ago
- Unofficial implementation of Linear Recurrent Units, by Deepmind, in Pytorch☆68Updated last year
- Implementations of various linear RNN layers using pytorch and triton☆49Updated last year
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆100Updated last year
- Augmenting Physical Models with Deep Networks for Complex Dynamics Forecasting☆45Updated last year
- ☆25Updated 2 years ago
- [ICLR'25] Artificial Kuramoto Oscillatory Neurons☆73Updated 2 months ago
- ☆67Updated 4 months ago
- PyTorch implementation of Structured State Space for Sequence Modeling (S4), based on Annotated S4.☆80Updated last year
- Better implementation of Kolmogorov Arnold Network☆24Updated 10 months ago
- Kolmogorov–Arnold Networks with modified activation (using MLP to represent the activation)☆103Updated 5 months ago
- Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆52Updated 3 weeks ago