PABannier / nanogradLinks
A lightweight deep learning framework
☆34Updated 4 years ago
Alternatives and similar repositories for nanograd
Users that are interested in nanograd are comparing it to the libraries listed below
Sorting:
- ☆275Updated last year
- Neural Networks for JAX☆84Updated 11 months ago
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.☆290Updated last year
- This is a port of Mistral-7B model in JAX☆32Updated last year
- Symbolic API for model creation in PyTorch.☆67Updated 5 months ago
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆179Updated this week
- Unofficial JAX implementations of deep learning research papers☆156Updated 3 years ago
- ☆150Updated last year
- Deep learning library implemented from scratch in numpy. Mixtral, Mamba, LLaMA, GPT, ResNet, and other experiments.☆52Updated last year
- This repository contain the simple llama3 implementation in pure jax.☆68Updated 6 months ago
- A functional training loops library for JAX☆88Updated last year
- A really tiny autograd engine☆95Updated 3 months ago
- Automatic gradient descent☆210Updated 2 years ago
- A pure-functional implementation of a machine learning transformer model in Python/JAX☆178Updated 3 months ago
- 🧱 Modula software package☆231Updated 2 weeks ago
- ☆53Updated last year
- JAX implementation of the Llama 2 model☆219Updated last year
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)☆94Updated 8 months ago
- The boundary of neural network trainability is fractal☆215Updated last year
- HomebrewNLP in JAX flavour for maintable TPU-Training☆50Updated last year
- Teaching transformers to play chess☆138Updated 7 months ago
- Hierarchical Associative Memory User Experience☆103Updated last month
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆45Updated last year
- JAX Synergistic Memory Inspector☆179Updated last year
- ☆88Updated last year
- MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…☆391Updated this week
- Functional local implementations of main model parallelism approaches☆96Updated 2 years ago
- Run PyTorch in JAX. 🤝☆283Updated this week
- NanoGPT-speedrunning for the poor T4 enjoyers☆70Updated 4 months ago
- ☆115Updated 2 weeks ago