NVlabs / gbrl
Gradient Boosting Reinforcement Learning (GBRL)
☆87Updated this week
Related projects ⓘ
Alternatives and complementary repositories for gbrl
- BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGO☆46Updated last month
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆84Updated 2 months ago
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆110Updated this week
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆84Updated last month
- ☆76Updated 6 months ago
- Cost aware hyperparameter tuning algorithm☆119Updated 4 months ago
- σ-GPT: A New Approach to Autoregressive Models☆59Updated 2 months ago
- ☆53Updated 9 months ago
- Official Implementation of "ADOPT: Modified Adam Can Converge with Any β2 with the Optimal Rate"☆176Updated this week
- GBRL-based Actor-Critic algorithms implemented in stable-baselines3☆25Updated this week
- ☆121Updated this week
- XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning☆49Updated 3 months ago
- Collection of autoregressive model implementation☆66Updated this week
- fast + parallel AlphaZero in JAX☆84Updated 7 months ago
- This repository contains a better implementation of Kolmogorov-Arnold networks☆59Updated 6 months ago
- Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC☆52Updated last year
- Implementation of the proposed Spline-Based Transformer from Disney Research☆74Updated this week
- Training small GPT-2 style models using Kolmogorov-Arnold networks.☆108Updated 5 months ago
- Pytorch implementation of the PEER block from the paper, Mixture of A Million Experts, by Xu Owen He at Deepmind☆111Updated 2 months ago
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆36Updated last month
- ☆39Updated 9 months ago
- ☆46Updated last month
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆117Updated 3 months ago
- Efficient baselines for autocurricula in JAX.☆173Updated 2 months ago
- ☆138Updated 2 months ago
- Jax like function transformation engine but micro, microjax☆26Updated 2 weeks ago
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆29Updated 3 weeks ago
- Implementation of Soft Actor Critic and some of its improvements in Pytorch☆38Updated 3 weeks ago
- ☆27Updated 6 months ago