nslyubaykin / relaxLinks
ReLAx - Reinforcement Learning Applications Library
☆15Updated 2 years ago
Alternatives and similar repositories for relax
Users that are interested in relax are comparing it to the libraries listed below
Sorting:
- ☆30Updated 5 years ago
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆40Updated 2 years ago
- Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Worksho…☆21Updated 2 years ago
- ☆18Updated 5 months ago
- Reinforcement Learning Library.☆29Updated 3 years ago
- Multi-node distributed LLM training framework☆17Updated 3 weeks ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆77Updated 5 years ago
- Vintix: Action Model via In-Context Reinforcement Learning - - — ICML 2025☆42Updated 3 months ago
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆75Updated 2 years ago
- ☆13Updated last year
- XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning - - — ICLR 2025☆78Updated 6 months ago
- Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC☆55Updated 2 years ago
- Made for a reading group at the Center for Safe AGI.☆12Updated 2 years ago
- ☆19Updated 2 years ago
- A2C is a special case of PPO!☆22Updated 3 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆41Updated 2 years ago
- Customizable RecSys Simulator for OpenAI Gym☆26Updated 3 years ago
- An implementation of PPO in Pytorch☆95Updated last month
- NLP course @ FinTech☆18Updated 5 years ago
- Official Implementation for "In-Context Reinforcement Learning for Variable Action Spaces"☆91Updated last year
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.☆97Updated 2 years ago
- Drop-in environment replacements that make your RL algorithm train faster.☆21Updated last year
- ☆13Updated last year
- Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"☆34Updated 2 years ago
- WebGym: Web-browser-based tasks for RL Agents☆14Updated 4 years ago
- Код для файнтюна LM (rugpt, LLaMa, FRED T5) средствами transformers + deepspeed + LoRa☆15Updated 2 years ago
- A PyTorch Implementation of PlaNet: A Deep Planning Network for Reinforcement Learning☆12Updated 5 years ago
- ☆31Updated 11 months ago
- Train environment model for RL based agent in browser-based multiplayer battle royale game «surviv.io»☆26Updated 3 years ago
- Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023☆53Updated 2 years ago