dtunai / xLSTM-Jax
Jax implementation of x-LSTM: Extended Long Short-Term Memory by Beck et al. (2024)
☆16Updated 5 months ago
Alternatives and similar repositories for xLSTM-Jax:
Users that are interested in xLSTM-Jax are comparing it to the libraries listed below
- This is a port of Mistral-7B model in JAX☆30Updated 6 months ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆82Updated 11 months ago
- Lightning-like training API for JAX with Flax☆36Updated last month
- Exploration into the Firefly algorithm in Pytorch☆33Updated 4 months ago
- Open source code for EigenGame.☆29Updated last year
- Wraps PyTorch code in a JIT-compatible way for JAX. Supports automatically defining gradients for reverse-mode AutoDiff.☆43Updated last week
- Einsum-like high-level array sharding API for JAX☆33Updated 6 months ago
- Pytorch-like dataloaders in JAX.☆67Updated 3 months ago
- The 2D discrete wavelet transform for JAX☆40Updated last year
- Flow-matching algorithms in JAX☆82Updated 5 months ago
- Pytorch implementation of a simple way to enable (Stochastic) Frame Averaging for any network☆48Updated 5 months ago
- ☆106Updated this week
- A simple hypernetwork implementation in jax using haiku.☆23Updated 2 years ago
- ☆26Updated 2 years ago
- ☆31Updated last month
- ☆14Updated last month
- Diffusion models in PyTorch☆89Updated 3 months ago
- ☆31Updated 9 months ago
- Fine-grained, dynamic control of neural network topology in JAX.☆21Updated last year
- Neural Networks for JAX☆83Updated 3 months ago
- Codes accompanying the paper "LaProp: a Better Way to Combine Momentum with Adaptive Gradient"☆26Updated 4 years ago
- Code for the book "The Elements of Differentiable Programming".☆71Updated 4 months ago
- flexible meta-learning in jax☆12Updated last year
- ☆52Updated 2 months ago
- Implementation of DreamerV3 in Pytorch☆42Updated 2 months ago
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆36Updated 3 months ago
- A scalable implementation of diffusion and flow-matching with XGBoost models, applied to calorimeter data.☆17Updated 2 months ago
- Neural Optimal Transport with Lagrangian Costs☆50Updated 6 months ago