NVlabs / gbrl
Gradient Boosting Reinforcement Learning (GBRL)
☆108Updated last month
Alternatives and similar repositories for gbrl:
Users that are interested in gbrl are comparing it to the libraries listed below
- GBRL-based Actor-Critic algorithms implemented in stable-baselines3☆34Updated last month
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆74Updated 2 weeks ago
- FlashRNN - Fast RNN Kernels with I/O Awareness☆85Updated last month
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆99Updated 4 months ago
- Implementation of Soft Actor Critic and some of its improvements in Pytorch☆56Updated 2 months ago
- This repository contains a better implementation of Kolmogorov-Arnold networks☆61Updated last year
- BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGO☆58Updated 7 months ago
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆33Updated 6 months ago
- ☆81Updated last year
- For optimization algorithm research and development.☆512Updated this week
- Cost aware hyperparameter tuning algorithm☆151Updated 10 months ago
- ☆218Updated last week
- ☆31Updated last year
- ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"☆80Updated 2 weeks ago
- ☆53Updated last year
- Accelerated minigrid environments with JAX☆135Updated 9 months ago
- Tabular In-Context Learning☆61Updated 2 months ago
- Unofficial JAX implementations of deep learning research papers☆156Updated 2 years ago
- XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning - - —☆69Updated 2 months ago
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆98Updated 7 months ago
- ☆30Updated 5 months ago
- ☆47Updated 6 months ago
- Training small GPT-2 style models using Kolmogorov-Arnold networks.☆116Updated 11 months ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆27Updated 2 years ago
- Exploration into the Firefly algorithm in Pytorch☆38Updated 2 months ago
- The AdEMAMix Optimizer: Better, Faster, Older.☆183Updated 7 months ago
- fast + parallel AlphaZero in JAX☆96Updated 4 months ago
- Implementation of the new SOTA for model based RL, from the paper "Improving Transformer World Models for Data-Efficient RL", in Pytorch☆117Updated last week
- ☆34Updated 2 years ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆91Updated last month