cgarciae / nanoGPT-jax
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆32Updated last year
Alternatives and similar repositories for nanoGPT-jax:
Users that are interested in nanoGPT-jax are comparing it to the libraries listed below
- A collection of meta-learning algorithms in Jax☆22Updated 2 years ago
- Turn jitted jax functions back into python source code☆22Updated 3 months ago
- minGPT in JAX☆47Updated 3 years ago
- Scaling scaling laws with board games.☆48Updated last year
- Jax/Flax rewrite of Karpathy's nanoGPT☆57Updated 2 years ago
- A JAX implementation of stochastic addition.☆14Updated 2 years ago
- ☆87Updated last week
- JAX implementations of core Deep RL algorithms☆79Updated 2 years ago
- A metrics library for the JAX ecosystem☆40Updated 2 years ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆83Updated last year
- A simple library for scaling up JAX programs☆134Updated 4 months ago
- The Energy Transformer block, in JAX☆56Updated last year
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- ☆24Updated 2 years ago
- LoRA for arbitrary JAX models and functions☆135Updated last year
- Pytorch-like dataloaders for JAX.☆76Updated 5 months ago
- ☆38Updated last year
- Visualize, create, and operate on pytrees in the most intuitive way possible.☆44Updated 2 months ago
- ☆28Updated 2 years ago
- JAX Arrays for human consumption☆90Updated last year
- A small library for creating and manipulating custom JAX Pytree classes☆56Updated 2 years ago
- Lightning-like training API for JAX with Flax☆38Updated 3 months ago
- ☆80Updated 3 years ago
- Minimal but scalable implementation of large language models in JAX☆34Updated 4 months ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago
- Einsum-like high-level array sharding API for JAX☆35Updated 8 months ago
- ☆112Updated last month
- Running Jax in PyTorch Lightning☆90Updated 3 months ago
- [NeurIPS'19] Deep Equilibrium Models Jax Implementation☆39Updated 4 years ago
- JMP is a Mixed Precision library for JAX.☆193Updated last month