borjanG / 2023-transformers-rotfLinks
Codes for the paper "A mathematical perspective on Transformers".
☆37Updated 11 months ago
Alternatives and similar repositories for 2023-transformers-rotf
Users that are interested in 2023-transformers-rotf are comparing it to the libraries listed below
Sorting:
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆83Updated last year
- A State-Space Model with Rational Transfer Function Representation.☆78Updated last year
- ☆32Updated 8 months ago
- JAX implementation of Kolmogorov Arnold Networks (KANs).☆11Updated last year
- The Energy Transformer block, in JAX☆58Updated last year
- ☆31Updated 7 months ago
- Evaluation of neuro-symbolic engines☆35Updated 10 months ago
- ☆104Updated 2 weeks ago
- Scalable and Stable Parallelization of Nonlinear RNNS☆16Updated 5 months ago
- ☆39Updated 3 years ago
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆78Updated 2 years ago
- ☆51Updated last year
- Code for the book "The Elements of Differentiable Programming".☆88Updated last week
- Open source code for EigenGame.☆30Updated 2 years ago
- Riemannian Optimization Using JAX☆49Updated last year
- Code for Discovering Preference Optimization Algorithms with and for Large Language Models☆63Updated last year
- Jax like function transformation engine but micro, microjax☆32Updated 8 months ago
- A simple hypernetwork implementation in jax using haiku.☆23Updated 2 years ago
- Flexible Inference for Predictive Coding Networks in JAX.☆48Updated 3 weeks ago
- Lightning-like training API for JAX with Flax☆41Updated 6 months ago
- Official Implementation of the ICML 2023 paper: "Neural Wave Machines: Learning Spatiotemporally Structured Representations with Locally …☆72Updated 2 years ago
- Flow-matching algorithms in JAX☆97Updated 10 months ago
- ☆32Updated last year
- Codes for the paper The emergence of clusters in self-attention dynamics.☆16Updated last year
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆124Updated last year
- Turn jitted jax functions back into python source code☆22Updated 6 months ago
- nanoGPT using Equinox☆13Updated 2 years ago
- Omnigrok: Grokking Beyond Algorithmic Data☆58Updated 2 years ago
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated 2 years ago
- ☆31Updated last year