HenryNdubuaku / nanodlLinks
A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.
☆297Updated last year
Alternatives and similar repositories for nanodl
Users that are interested in nanodl are comparing it to the libraries listed below
Sorting:
- Named Tensors for Legible Deep Learning in JAX☆215Updated 2 months ago
- Automatic gradient descent☆216Updated 2 years ago
- ☆287Updated last year
- Neural Networks for JAX☆84Updated last year
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆688Updated 2 weeks ago
- ☆118Updated last month
- Unofficial JAX implementations of deep learning research papers☆160Updated 3 years ago
- A functional training loops library for JAX☆88Updated last year
- JAX Synergistic Memory Inspector☆183Updated last year
- For optimization algorithm research and development.☆556Updated 3 weeks ago
- A simple library for scaling up JAX programs☆144Updated 2 months ago
- Accelerate, Optimize performance with streamlined training and serving options with JAX.☆328Updated last week
- LoRA for arbitrary JAX models and functions☆143Updated last year
- 🧱 Modula software package☆322Updated 4 months ago
- ☆191Updated 3 weeks ago
- MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…☆406Updated this week
- JAX implementation of the Llama 2 model☆215Updated last year
- Running Jax in PyTorch Lightning☆118Updated last year
- Run PyTorch in JAX. 🤝☆309Updated 2 months ago
- JMP is a Mixed Precision library for JAX.☆210Updated 11 months ago
- ☆314Updated last year
- Implementation of Flash Attention in Jax☆223Updated last year
- Graph neural networks in JAX.☆68Updated last year
- jax-triton contains integrations between JAX and OpenAI Triton☆436Updated last month
- git extension for {collaborative, communal, continual} model development☆217Updated last year
- Minimal yet performant LLM examples in pure JAX☆225Updated last week
- This is a port of Mistral-7B model in JAX☆32Updated last year
- Implementation of Diffusion Transformer (DiT) in JAX☆300Updated last year
- Orbax provides common checkpointing and persistence utilities for JAX users☆474Updated this week
- Notebooks for the "Deep Learning with JAX" book☆166Updated 7 months ago