drozzy / reinforceLinks
Implementation of Reinforce for educational purposes.
☆12Updated 2 years ago
Alternatives and similar repositories for reinforce
Users that are interested in reinforce are comparing it to the libraries listed below
Sorting:
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆93Updated 2 years ago
- ☆35Updated last year
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆25Updated last year
- Implementation of GateLoop Transformer in Pytorch and Jax☆92Updated last year
- Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)☆15Updated last year
- ☆35Updated last year
- ☆39Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆37Updated 2 years ago
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆36Updated last year
- A State-Space Model with Rational Transfer Function Representation.☆83Updated last year
- Some utility functions to help myself (and perhaps others) go faster with ML/AI work☆45Updated this week
- ☆124Updated 8 months ago
- Scalable Computation of Hessian Diagonals☆14Updated last year
- Automatically take good care of your preemptible TPUs☆37Updated 2 years ago
- ☆62Updated last year
- Implementation of Soft Actor Critic and some of its improvements in Pytorch☆64Updated last month
- ☆13Updated last month
- Implementation of Infini-Transformer in Pytorch☆112Updated last year
- ☆168Updated 3 months ago
- Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"☆26Updated last year
- ☆31Updated 2 weeks ago
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆37Updated last year
- Evaluating the Mamba architecture on the Othello game☆49Updated last year
- Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learnin…☆21Updated this week
- DiT (training + flow matching) in Jax☆11Updated last year
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆103Updated last year
- Simple, minimal implementation of the Mamba SSM in one pytorch file. Using logcumsumexp (Heisen sequence).☆130Updated last year
- JAX/Flax implementation of the Hyena Hierarchy☆34Updated 2 years ago
- Official JAX implementation of xLSTM including fast and efficient training and inference code. 7B model available at https://huggingface.…☆105Updated last year
- [TMLR'25] Official implementation for "Large-Scale Targeted Cause Discovery via Learning from Simulated Data"☆26Updated 4 months ago