m-wojnar / reinforced-libLinks
Reinforcement learning library
☆64Updated last month
Alternatives and similar repositories for reinforced-lib
Users that are interested in reinforced-lib are comparing it to the libraries listed below
Sorting:
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆36Updated last year
- ☆115Updated last week
- Minimal yet performant LLM examples in pure JAX☆150Updated last week
- Turn jitted jax functions back into python source code☆22Updated 8 months ago
- Loopy belief propagation for factor graphs on discrete variables in JAX☆154Updated 10 months ago
- Einsum-like high-level array sharding API for JAX☆35Updated last year
- Parallel hyperparameter tuning with JAX☆36Updated last month
- Tidy autoregressive inference in JAX☆14Updated this week
- Minimal but scalable implementation of large language models in JAX☆35Updated last month
- fast + parallel AlphaZero in JAX☆96Updated 8 months ago
- Parameter-Free Optimizers for Pytorch☆130Updated last year
- Named Tensors for Legible Deep Learning in JAX☆201Updated last week
- ☆52Updated last year
- A collection of meta-learning algorithms in Jax☆23Updated 2 years ago
- Minimal, lightweight JAX implementations of popular models.☆96Updated last week
- Simple single file implementations of Reinforcement Learning algorithms in Julia☆22Updated 6 months ago
- A simple library for scaling up JAX programs☆143Updated 10 months ago
- Learning Universal Predictors☆79Updated last year
- Graph neural networks in JAX.☆67Updated last year
- LoRA for arbitrary JAX models and functions☆142Updated last year
- Implementation of PSGD optimizer in JAX☆34Updated 8 months ago
- JAX Arrays for human consumption☆106Updated last month
- ☆52Updated 2 years ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆93Updated 5 months ago
- Second Order Optimization and Curvature Estimation with K-FAC in JAX.☆282Updated last month
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆87Updated last year
- nanoGPT using Equinox☆13Updated 2 years ago
- Pytorch-like dataloaders for JAX.☆94Updated 3 months ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆73Updated last year