awf / functional-transformer
A pure-functional implementation of a machine learning transformer model in Python/JAX
☆175Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for functional-transformer
- A Pytree Module system for Deep Learning in JAX☆214Updated last year
- Named tensors with first-class dimensions for PyTorch☆322Updated last year
- ☆99Updated 4 months ago
- Memory mapped numpy arrays of varying shapes☆285Updated 5 months ago
- MinT: Minimal Transformer Library and Tutorials☆248Updated 2 years ago
- ☆67Updated last year
- JAX Synergistic Memory Inspector☆164Updated 4 months ago
- ☆155Updated 4 years ago
- A functional training loops library for JAX☆85Updated 9 months ago
- Multidimensional indexing for tensors☆113Updated last year
- Silly twitter torch implementations.☆46Updated 2 years ago
- Differentiable Algorithms and Algorithmic Supervision.☆105Updated last year
- PIX is an image processing library in JAX, for JAX.☆389Updated last week
- A Python package of computer vision models for the Equinox ecosystem.☆102Updated 4 months ago
- ☆108Updated last year
- Pytorch implementation of preconditioned stochastic gradient descent (affine group preconditioner, low-rank approximation preconditioner …☆127Updated last month
- A library to inspect and extract intermediate layers of PyTorch models.☆470Updated 2 years ago
- Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload☆124Updated 2 years ago
- An interactive exploration of Transformer programming.☆246Updated last year
- NumPy arrays, ready for human consumption☆63Updated 4 months ago
- ☆58Updated 2 years ago
- Running Jax in PyTorch Lightning☆82Updated 2 weeks ago
- ☆105Updated 2 weeks ago
- Unofficial JAX implementations of deep learning research papers☆151Updated 2 years ago
- Run PyTorch in JAX. 🤝☆200Updated last year
- JAX Arrays for human consumption☆88Updated last year
- Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes☆237Updated last year
- A case study of efficient training of large language models using commodity hardware.☆68Updated 2 years ago
- Docs☆143Updated last month