awf / functional-transformerLinks
A pure-functional implementation of a machine learning transformer model in Python/JAX
☆180Updated 5 months ago
Alternatives and similar repositories for functional-transformer
Users that are interested in functional-transformer are comparing it to the libraries listed below
Sorting:
- A Pytree Module system for Deep Learning in JAX☆214Updated 2 years ago
- Named tensors with first-class dimensions for PyTorch☆331Updated 2 years ago
- ☆106Updated last year
- ☆71Updated last year
- ☆60Updated 3 years ago
- A functional training loops library for JAX☆88Updated last year
- Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions☆258Updated last year
- MinT: Minimal Transformer Library and Tutorials☆258Updated 3 years ago
- Framework-agnostic library for checking array/tensor shapes at runtime.☆46Updated 4 years ago
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆188Updated last week
- ☆153Updated 5 years ago
- A Python package of computer vision models for the Equinox ecosystem.☆109Updated last year
- JMP is a Mixed Precision library for JAX.☆208Updated 8 months ago
- JAX Synergistic Memory Inspector☆179Updated last year
- Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes☆242Updated 2 years ago
- ☆115Updated last month
- ☆158Updated last year
- An interactive exploration of Transformer programming.☆269Updated last year
- An alternative to convolution in neural networks☆257Updated last year
- PIX is an image processing library in JAX, for JAX.☆423Updated 7 months ago
- ☆44Updated 2 months ago
- Silly twitter torch implementations.☆46Updated 3 years ago
- Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload☆131Updated 3 years ago
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).☆114Updated 3 years ago
- Unofficial JAX implementations of deep learning research papers☆158Updated 3 years ago
- Differentiable Algorithms and Algorithmic Supervision.☆116Updated 2 years ago
- NumPy arrays, ready for human consumption☆71Updated 2 weeks ago
- ☆108Updated 2 years ago
- ☆247Updated 3 months ago
- Automatic gradient descent☆215Updated 2 years ago