cgarciae / nanoGPT-jax
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆33Updated last year
Alternatives and similar repositories for nanoGPT-jax:
Users that are interested in nanoGPT-jax are comparing it to the libraries listed below
- A collection of meta-learning algorithms in Jax☆23Updated 2 years ago
- Jax/Flax rewrite of Karpathy's nanoGPT☆57Updated 2 years ago
- minGPT in JAX☆48Updated 3 years ago
- Scaling scaling laws with board games.☆48Updated last year
- A functional training loops library for JAX☆87Updated last year
- Minimal but scalable implementation of large language models in JAX☆34Updated 6 months ago
- ☆105Updated this week
- LoRA for arbitrary JAX models and functions☆136Updated last year
- JAX implementations of core Deep RL algorithms☆79Updated 3 years ago
- Turn jitted jax functions back into python source code☆22Updated 4 months ago
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- ☆75Updated 6 months ago
- Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery☆14Updated last year
- A metrics library for the JAX ecosystem☆40Updated 2 years ago
- General Modules for JAX☆64Updated 3 weeks ago
- Lightning-like training API for JAX with Flax☆38Updated 4 months ago
- A simple library for scaling up JAX programs☆134Updated 6 months ago
- Pytorch-like dataloaders for JAX.☆80Updated last week
- 🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX☆57Updated last year
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- Efficiently Composable Data Augmentation on the GPU with Jax☆33Updated 9 months ago
- flexible meta-learning in jax☆13Updated last year
- JAX Arrays for human consumption☆92Updated last year
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Updated 2 years ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆83Updated last year
- Reinforcement learning library in JAX.☆100Updated last year
- Simple tools to mix and match PyTorch and Jax - Get the best of both worlds!☆28Updated this week
- A PyTorch implementation of a Generative Flow Network (GFlowNet) proposed by Bengio et al. (2021)☆42Updated last year
- ☆56Updated 2 years ago
- This is a port of Mistral-7B model in JAX☆32Updated 10 months ago