borjanG / 2023-transformers-rotfLinks
Codes for the paper "A mathematical perspective on Transformers".
☆36Updated 10 months ago
Alternatives and similar repositories for 2023-transformers-rotf
Users that are interested in 2023-transformers-rotf are comparing it to the libraries listed below
Sorting:
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆81Updated last year
- A State-Space Model with Rational Transfer Function Representation.☆78Updated last year
- ☆53Updated 8 months ago
- ☆29Updated 6 months ago
- Implementation of PSGD optimizer in JAX☆33Updated 5 months ago
- ☆32Updated last year
- Code for the book "The Elements of Differentiable Programming".☆86Updated 2 months ago
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated 2 years ago
- Pytorch-like dataloaders for JAX.☆83Updated last week
- ☆32Updated 8 months ago
- Open source code for EigenGame.☆30Updated 2 years ago
- Flow-matching algorithms in JAX☆92Updated 9 months ago
- Code for Discovering Preference Optimization Algorithms with and for Large Language Models☆62Updated 11 months ago
- Lightning-like training API for JAX with Flax☆38Updated 5 months ago
- Graph neural networks in JAX.☆67Updated 11 months ago
- Wraps PyTorch code in a JIT-compatible way for JAX. Supports automatically defining gradients for reverse-mode AutoDiff.☆53Updated last month
- [ICLR 2024 Spotlight] This is the official code for the paper "SNIP: Bridging Mathematical Symbolic and Numeric Realms with Unified Pre-t…☆52Updated 7 months ago
- Code for papers Linear Algebra with Transformers (TMLR) and What is my Math Transformer Doing? (AI for Maths Workshop, Neurips 2022)☆68Updated 9 months ago
- Turn jitted jax functions back into python source code☆22Updated 5 months ago
- Explorations into the recently proposed Taylor Series Linear Attention☆99Updated 9 months ago
- Official implementation of Stochastic Taylor Derivative Estimator (STDE) NeurIPS2024☆108Updated 6 months ago
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆78Updated 2 years ago
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆124Updated last year
- nanoGPT using Equinox☆13Updated 2 years ago
- A MAD laboratory to improve AI architecture designs 🧪☆116Updated 5 months ago
- Jax like function transformation engine but micro, microjax☆32Updated 7 months ago
- Official Implementation of the ICML 2023 paper: "Neural Wave Machines: Learning Spatiotemporally Structured Representations with Locally …☆72Updated 2 years ago
- The Energy Transformer block, in JAX☆57Updated last year
- ☆37Updated last year
- JAX implementation of Kolmogorov Arnold Networks (KANs).☆10Updated last year