HenryNdubuaku / nanodlLinks
A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.
☆293Updated last year
Alternatives and similar repositories for nanodl
Users that are interested in nanodl are comparing it to the libraries listed below
Sorting:
- Named Tensors for Legible Deep Learning in JAX☆205Updated last week
- Automatic gradient descent☆211Updated 2 years ago
- Accelerate, Optimize performance with streamlined training and serving options with JAX.☆311Updated this week
- ☆281Updated last year
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆667Updated this week
- For optimization algorithm research and development.☆539Updated this week
- JAX Synergistic Memory Inspector☆180Updated last year
- Neural Networks for JAX☆84Updated last year
- Unofficial JAX implementations of deep learning research papers☆156Updated 3 years ago
- ☆115Updated 3 weeks ago
- 🧱 Modula software package☆277Updated last month
- A functional training loops library for JAX☆88Updated last year
- Minimal yet performant LLM examples in pure JAX☆177Updated last week
- LoRA for arbitrary JAX models and functions☆142Updated last year
- Implementation of Flash Attention in Jax☆218Updated last year
- JAX implementation of the Llama 2 model☆218Updated last year
- MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…☆398Updated last week
- ☆188Updated last week
- This is a port of Mistral-7B model in JAX☆32Updated last year
- Run PyTorch in JAX. 🤝☆292Updated 3 weeks ago
- A simple library for scaling up JAX programs☆143Updated 11 months ago
- jax-triton contains integrations between JAX and OpenAI Triton☆424Updated 3 weeks ago
- ☆247Updated 3 months ago
- JMP is a Mixed Precision library for JAX.☆208Updated 8 months ago
- A zero-to-one guide on scaling modern transformers with n-dimensional parallelism.☆94Updated this week
- An interactive exploration of Transformer programming.☆269Updated last year
- A FlashAttention implementation for JAX with support for efficient document mask computation and context parallelism.☆143Updated 5 months ago
- ☆309Updated last year
- ☆233Updated 7 months ago
- Orbax provides common checkpointing and persistence utilities for JAX users☆428Updated this week