nslyubaykin / relaxLinks
ReLAx - Reinforcement Learning Applications Library
☆15Updated 2 years ago
Alternatives and similar repositories for relax
Users that are interested in relax are comparing it to the libraries listed below
Sorting:
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆42Updated 2 years ago
- ☆18Updated 7 months ago
- Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Worksho…☆21Updated 2 years ago
- Vintix: Action Model via In-Context Reinforcement Learning - - — ICML 2025☆43Updated 5 months ago
- Multi-node distributed LLM training framework☆18Updated last month
- ☆14Updated last year
- Official Implementation for "In-Context Reinforcement Learning for Variable Action Spaces"☆90Updated last year
- ☆30Updated 5 years ago
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆76Updated 2 years ago
- ☆13Updated last year
- XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning - - — ICLR 2025☆79Updated 8 months ago
- Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC☆56Updated 2 years ago
- Keras Implementation of DDPG(Deep Deterministic Policy Gradient) with PER(Prioritized Experience Replay) option on OpenAI gym framework☆13Updated 2 years ago
- Reinforcement Learning Library.☆29Updated 3 years ago
- ☆19Updated 2 years ago
- Код для файнтюна LM (rugpt, LLaMa, FRED T5) средствами transformers + deepspeed + LoRa☆15Updated 2 years ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆77Updated 5 years ago
- Drop-in environment replacements that make your RL algorithm train faster.☆21Updated last year
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆33Updated 4 months ago
- WebGym: Web-browser-based tasks for RL Agents☆14Updated 4 years ago
- Repo to reproduce the First-Explore paper results☆38Updated 10 months ago
- Deep Q-Network (DQN) and Fitted Q-Iteration (FQI) tutorial for RL Summer School 2023☆84Updated last month
- ☆31Updated last year
- Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023☆54Updated 2 years ago
- TaskMet Task-driven Metric Learning for Model Learning☆19Updated last year
- Implementation of Proximal Policy Optimization in Jax+Flax☆20Updated 2 years ago
- ☆22Updated 2 years ago
- Примеры пропозалов для подачи заявки в Open.TLab☆28Updated 2 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆41Updated 3 years ago
- A2C is a special case of PPO!☆22Updated 3 years ago