HenryNdubuaku / nanodlLinks
A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.
☆297Updated last year
Alternatives and similar repositories for nanodl
Users that are interested in nanodl are comparing it to the libraries listed below
Sorting:
- Automatic gradient descent☆215Updated 2 years ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆685Updated last week
- Named Tensors for Legible Deep Learning in JAX☆212Updated 3 weeks ago
- ☆285Updated last year
- Neural Networks for JAX☆84Updated last year
- For optimization algorithm research and development.☆547Updated 2 weeks ago
- ☆118Updated 3 weeks ago
- JAX implementation of the Llama 2 model☆216Updated last year
- 🧱 Modula software package☆307Updated 3 months ago
- Accelerate, Optimize performance with streamlined training and serving options with JAX.☆323Updated last week
- Unofficial JAX implementations of deep learning research papers☆159Updated 3 years ago
- JAX Synergistic Memory Inspector☆182Updated last year
- LoRA for arbitrary JAX models and functions☆143Updated last year
- A functional training loops library for JAX☆88Updated last year
- ☆248Updated 5 months ago
- A simple library for scaling up JAX programs☆144Updated 3 weeks ago
- Minimal yet performant LLM examples in pure JAX☆204Updated 2 months ago
- MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…☆401Updated this week
- Run PyTorch in JAX. 🤝☆307Updated last month
- Implementation of Flash Attention in Jax☆222Updated last year
- Train very large language models in Jax.☆210Updated 2 years ago
- ☆190Updated 2 weeks ago
- jax-triton contains integrations between JAX and OpenAI Triton☆436Updated last week
- JMP is a Mixed Precision library for JAX.☆211Updated 10 months ago
- Inference code for LLaMA models in JAX☆120Updated last year
- Library for reading and processing ML training data.☆603Updated last week
- git extension for {collaborative, communal, continual} model development☆216Updated last year
- An interactive exploration of Transformer programming.☆270Updated 2 years ago
- Modular, scalable library to train ML models☆176Updated this week
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆125Updated 2 months ago