drozzy / reinforce
Implementation of Reinforce for educational purposes.
☆11Updated last year
Alternatives and similar repositories for reinforce:
Users that are interested in reinforce are comparing it to the libraries listed below
- Scalable Computation of Hessian Diagonals☆13Updated 10 months ago
- ☆38Updated 2 years ago
- Implementation of GateLoop Transformer in Pytorch and Jax☆87Updated 9 months ago
- The accompanying code for "Simplifying and Understanding State Space Models with Diagonal Linear RNNs" (Ankit Gupta, Harsh Mehta, Jonatha…☆20Updated 2 years ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆33Updated last year
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆32Updated 5 months ago
- Clean RL implementation using MLX☆30Updated last year
- ☆30Updated 4 months ago
- Unofficial implementation of Linear Recurrent Units, by Deepmind, in Pytorch☆68Updated last year
- Repo to reproduce the First-Explore paper results☆37Updated 3 months ago
- Source code for the paper "Positional Attention: Out-of-Distribution Generalization and Expressivity for Neural Algorithmic Reasoning"☆14Updated 2 months ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated 10 months ago
- ☆37Updated last year
- An annotated implementation of the Hyena Hierarchy paper☆32Updated last year
- ☆27Updated 9 months ago
- Implementation of numerous Vision Transformers in Google's JAX and Flax.☆22Updated 2 years ago
- Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"☆57Updated last year
- Code for minimum-entropy coupling.☆31Updated 9 months ago
- Non official implementation of the Linear Recurrent Unit (LRU, Orvieto et al. 2023)☆52Updated 5 months ago
- Implementations of growing and pruning in neural networks☆22Updated last year
- Generative cellular automaton-like learning environments for RL.☆19Updated 2 months ago
- Efficiently Composable Data Augmentation on the GPU with Jax☆33Updated 8 months ago
- Pytorch implementation of a simple way to enable (Stochastic) Frame Averaging for any network☆49Updated 8 months ago
- ☆14Updated 2 years ago
- ☆28Updated 2 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆49Updated last year
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆83Updated last year
- Implementation of Soft Actor Critic and some of its improvements in Pytorch☆55Updated 2 months ago
- ☆20Updated 11 months ago
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆26Updated last month