drozzy / reinforceLinks
Implementation of Reinforce for educational purposes.
☆12Updated 2 years ago
Alternatives and similar repositories for reinforce
Users that are interested in reinforce are comparing it to the libraries listed below
Sorting:
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆92Updated last year
- ☆69Updated 2 years ago
- ☆35Updated last year
- ☆38Updated last year
- Scalable Computation of Hessian Diagonals☆14Updated last year
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆25Updated 11 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆37Updated 2 years ago
- WandB sweeps integration with Hydra sweeper☆50Updated last year
- Deep Networks Grok All the Time and Here is Why☆38Updated last year
- Implementation of GateLoop Transformer in Pytorch and Jax☆91Updated last year
- ☆13Updated 2 weeks ago
- An implementation of PPO in Pytorch☆101Updated last month
- ☆41Updated 3 years ago
- Scaling scaling laws with board games.☆54Updated 2 years ago
- Implementation of numerous Vision Transformers in Google's JAX and Flax.☆22Updated 3 years ago
- Proof-of-concept of global switching between numpy/jax/pytorch in a library.☆18Updated last year
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Updated last year
- Sparse and discrete interpretability tool for neural networks☆65Updated last year
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆107Updated last month
- ☆21Updated last year
- ☆212Updated last year
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Updated 2 years ago
- Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch☆91Updated 2 years ago
- ☆58Updated 3 years ago
- JAX/Flax implementation of the Hyena Hierarchy☆34Updated 2 years ago
- Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"☆26Updated last year
- ☆28Updated 3 years ago
- Running Jax in PyTorch Lightning☆117Updated last year
- ☆32Updated 5 months ago
- Implementation of Soft Actor Critic and some of its improvements in Pytorch☆62Updated this week