lucidrains / scaling-vin-pytorch
Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group
☆36Updated 4 months ago
Alternatives and similar repositories for scaling-vin-pytorch:
Users that are interested in scaling-vin-pytorch are comparing it to the libraries listed below
- ☆30Updated 2 months ago
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆55Updated 9 months ago
- ☆31Updated 10 months ago
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆23Updated last month
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated last year
- Explorations into improving ViTArc with Slot Attention☆37Updated 4 months ago
- Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"☆57Updated last year
- FID computation in Jax/Flax.☆27Updated 7 months ago
- Implementation of the proposed Spline-Based Transformer from Disney Research☆86Updated 3 months ago
- ☆15Updated 2 years ago
- Official codebase for "Sampling For Learnability", published at NeurIPS 2024☆13Updated last month
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆48Updated 2 months ago
- Implemenation of the HIERarchical imagionation On Structured State Space Sequence Models (HIEROS) paper☆15Updated 7 months ago
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated last year
- Pytorch implementation of a simple way to enable (Stochastic) Frame Averaging for any network☆49Updated 6 months ago
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆26Updated 3 months ago
- A repo where I play with conditional flow approaches for learning time-varying vector-fields.☆14Updated 8 months ago
- Self contained pytorch implementation of a sinkhorn based router, for mixture of experts or otherwise☆32Updated 5 months ago
- Official code for "Reward-Free Curricula for Training Robust World Models", ICLR 2024.☆27Updated last year
- ☆28Updated 3 months ago
- VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)☆13Updated last month
- ☆51Updated 8 months ago
- ☆23Updated 10 months ago
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆31Updated 3 months ago
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆30Updated 2 months ago
- Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …☆13Updated 8 months ago