NVlabs / gbrlLinks
Gradient Boosting Reinforcement Learning (GBRL)
☆136Updated last week
Alternatives and similar repositories for gbrl
Users that are interested in gbrl are comparing it to the libraries listed below
Sorting:
- GBRL-based Actor-Critic algorithms implemented in stable-baselines3☆43Updated last week
- ☆239Updated 2 months ago
- Implementation of Soft Actor Critic and some of its improvements in Pytorch☆64Updated last month
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆103Updated last year
- This repository contains a better implementation of Kolmogorov-Arnold networks☆63Updated 8 months ago
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆130Updated 2 months ago
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆36Updated last year
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆137Updated last week
- Official implementation of "Fourier Head: Helping Large Language Models Learn Complex Probability Distributions" (ICLR 2025)☆66Updated 10 months ago
- [ICLR 2025] Official implementation of DICL (Disentangled In-Context Learning), featured in the paper "Zero-shot Model-based Reinforcemen…☆26Updated 11 months ago
- Explorations into the recently proposed Taylor Series Linear Attention☆100Updated last year
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆93Updated 2 years ago
- Modular, scalable library to train ML models☆208Updated this week
- Implementation of the new SOTA for model based RL, from the paper "Improving Transformer World Models for Data-Efficient RL", in Pytorch☆153Updated 9 months ago
- Exploration into the Firefly algorithm in Pytorch☆41Updated 11 months ago
- Official implementation of MetaTree: Learning a Decision Tree Algorithm with Transformers☆114Updated last year
- Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efficient, non-parametric inf…☆25Updated last year
- An implementation of PSGD Kron second-order optimizer for PyTorch☆98Updated 6 months ago
- my attempts at implementing various bits of Sepp Hochreiter's new xLSTM architecture☆134Updated last year
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆40Updated this week
- ☆168Updated 3 months ago
- ☆35Updated last year
- ☆35Updated last year
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆37Updated last year
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆117Updated last year
- The Gaussian Histogram Loss (HL-Gauss) proposed by Imani et al. with a few convenient wrappers for regression, in Pytorch☆73Updated 2 months ago
- ☆82Updated last year
- OMNI: Open-endedness via Models of human Notions of Interestingness☆58Updated last year
- ☆33Updated last year
- Tiled Flash Linear Attention library for fast and efficient mLSTM Kernels.☆85Updated 2 months ago