dtunai / xLSTM-JaxLinks
Jax implementation of x-LSTM: Extended Long Short-Term Memory by Beck et al. (2024)
☆17Updated last year
Alternatives and similar repositories for xLSTM-Jax
Users that are interested in xLSTM-Jax are comparing it to the libraries listed below
Sorting:
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆88Updated last year
- This is a port of Mistral-7B model in JAX☆32Updated last year
- Pytorch implementation of a simple way to enable (Stochastic) Frame Averaging for any network☆51Updated last year
- Jax like function transformation engine but micro, microjax☆33Updated 11 months ago
- Lightning-like training API for JAX with Flax☆43Updated 10 months ago
- Multi-framework implementation of Deep Kernel Shaping and Tailored Activation Transformations, which are methods that modify neural netwo…☆74Updated 3 months ago
- FID computation in Jax/Flax.☆28Updated last year
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆25Updated 8 months ago
- Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efficient, non-parametric inf…☆25Updated last year
- Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"☆59Updated last year
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆37Updated last year
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆39Updated last month
- JAX implementation of the Mistral 7b v0.2 model☆35Updated last year
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).☆114Updated 3 years ago
- JAX implementation of Learning to learn by gradient descent by gradient descent☆27Updated 2 months ago
- Open source code for EigenGame.☆32Updated 2 years ago
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated 2 years ago
- ☆115Updated last month
- Running Jax in PyTorch Lightning☆112Updated 10 months ago
- ☆33Updated 10 months ago
- Graph neural networks in JAX.☆68Updated last year
- AdaCat☆49Updated 3 years ago
- ☆33Updated 2 years ago
- ☆26Updated 3 years ago
- ☆192Updated 3 months ago
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Updated last year
- Tensor Parallelism with JAX + Shard Map☆11Updated 2 years ago
- Implementation of numerous Vision Transformers in Google's JAX and Flax.☆22Updated 3 years ago
- Train vision models using JAX and 🤗 transformers☆99Updated last month
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator☆31Updated 2 years ago