drozzy / reinforceLinks
Implementation of Reinforce for educational purposes.
☆12Updated 2 years ago
Alternatives and similar repositories for reinforce
Users that are interested in reinforce are comparing it to the libraries listed below
Sorting:
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆36Updated last year
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆92Updated 2 years ago
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆25Updated last year
- ☆13Updated last month
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆37Updated 2 years ago
- Implementation of GateLoop Transformer in Pytorch and Jax☆92Updated last year
- Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)☆15Updated last year
- Scalable Computation of Hessian Diagonals☆14Updated last year
- ☆35Updated last year
- ☆35Updated last year
- ☆38Updated last year
- ☆58Updated 3 years ago
- Unofficial implementation of Linear Recurrent Units, by Deepmind, in Pytorch☆73Updated 9 months ago
- Implementation of Soft Actor Critic and some of its improvements in Pytorch☆64Updated 3 weeks ago
- Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch☆91Updated 2 years ago
- An implementation of PPO in Pytorch☆106Updated 2 weeks ago
- Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learnin…☆21Updated last week
- PyTorch implementation of Soft MoE by Google Brain in "From Sparse to Soft Mixtures of Experts" (https://arxiv.org/pdf/2308.00951.pdf)☆82Updated 2 years ago
- A PyTorch implementation of Legendre Memory Units (LMUs) and its FFT variant☆43Updated 4 years ago
- Code for minimum-entropy coupling.☆32Updated 3 weeks ago
- The accompanying code for "Simplifying and Understanding State Space Models with Diagonal Linear RNNs" (Ankit Gupta, Harsh Mehta, Jonatha…☆23Updated 3 years ago
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆40Updated 2 months ago
- ☆41Updated 3 years ago
- ☆62Updated last year
- ☆161Updated 2 months ago
- ☆28Updated 3 years ago
- Gradient Boosting Reinforcement Learning (GBRL)☆134Updated 2 months ago
- Implementation of numerous Vision Transformers in Google's JAX and Flax.☆22Updated 3 years ago
- fastrl is a reinforcement learning library that extends Fastai. This project is not affiliated with fastai or Jeremy Howard.☆26Updated last year
- ☆37Updated 2 years ago