rcrowe-google / Learning-JAXLinks
Slide decks, coding exercises, and quick references for learning the JAX AI Stack
☆65Updated this week
Alternatives and similar repositories for Learning-JAX
Users that are interested in Learning-JAX are comparing it to the libraries listed below
Sorting:
- A collection of reusable, high-performance, well-documented, thorough-tested layers and models in Jax☆21Updated 5 months ago
- Diffusion models in PyTorch☆112Updated last week
- Various transformers for FSDP research☆38Updated 2 years ago
- ☆31Updated 4 months ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆59Updated 3 weeks ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆27Updated last year
- ☆51Updated last week
- Fast, Modern, and Low Precision PyTorch Optimizers☆116Updated 2 months ago
- This is a port of Mistral-7B model in JAX☆32Updated last year
- (EasyDel Former) is a utility library designed to simplify and enhance the development in JAX☆28Updated 2 weeks ago
- Tutorials for Triton, a language for writing gpu kernels☆56Updated 2 years ago
- An implementation of PSGD Kron second-order optimizer for PyTorch☆96Updated 3 months ago
- JAX implementation of the Mistral 7b v0.2 model☆34Updated last year
- ☆150Updated last year
- Seemless interface of using PyTOrch distributed with Jupyter notebooks☆53Updated last month
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆103Updated 10 months ago
- ML/DL Math and Method notes☆64Updated last year
- ☆21Updated last year
- Proof-of-concept of global switching between numpy/jax/pytorch in a library.☆18Updated last year
- Triton Implementation of HyperAttention Algorithm☆48Updated last year
- ☆48Updated last year
- ☆52Updated last year
- This repository shows various ways of deploying a vision model (TensorFlow) from 🤗 Transformers.☆30Updated 3 years ago
- some common Huggingface transformers in maximal update parametrization (µP)☆86Updated 3 years ago
- Utilities for PyTorch distributed☆25Updated 8 months ago
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆39Updated 2 months ago
- ☆81Updated last year
- Neural Networks for JAX☆84Updated last year
- Graph neural networks in JAX.☆68Updated last year
- ☆68Updated 11 months ago