NVlabs / gbrl
Gradient Boosting Reinforcement Learning (GBRL)
☆88Updated this week
Related projects ⓘ
Alternatives and complementary repositories for gbrl
- ☆76Updated 7 months ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆85Updated 2 months ago
- BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGO☆47Updated last month
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆111Updated last week
- XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning☆53Updated 3 months ago
- ☆28Updated 6 months ago
- This repository contains a better implementation of Kolmogorov-Arnold networks☆59Updated 6 months ago
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆84Updated 2 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆37Updated 5 months ago
- GBRL-based Actor-Critic algorithms implemented in stable-baselines3☆25Updated this week
- ☆53Updated 10 months ago
- ☆39Updated 10 months ago
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆29Updated last month
- σ-GPT: A New Approach to Autoregressive Models☆59Updated 3 months ago
- ☆28Updated 7 months ago
- This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog po…☆86Updated last year
- fast + parallel AlphaZero in JAX☆84Updated 7 months ago
- ☆128Updated last week
- Implementation of the proposed Spline-Based Transformer from Disney Research☆76Updated last week
- Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC☆52Updated last year
- Training small GPT-2 style models using Kolmogorov-Arnold networks.☆108Updated 5 months ago
- Collection of tests performed during the study of the new Kolmogorov-Arnold Neural Networks (KAN)☆34Updated last month
- Exploration into the Firefly algorithm in Pytorch☆35Updated 2 months ago
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆21Updated 3 weeks ago
- ☆29Updated 2 months ago
- Explainable Reinforcement Learning (XRL) Resources☆33Updated last month
- Cost aware hyperparameter tuning algorithm☆124Updated 4 months ago
- Collection of autoregressive model implementation☆67Updated this week
- ☆48Updated 9 months ago