A PyTorch implementation of Advantage weighted Actor-Critic (AWAC)
☆56Mar 30, 2021Updated 5 years ago
Alternatives and similar repositories for Pytorch-AWAC
Users that are interested in Pytorch-AWAC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Advantage weighted Actor Critic for Offline RL☆53Aug 27, 2022Updated 3 years ago
- ☆39Jul 2, 2023Updated 2 years ago
- [ICML 2023] Official code for "DevFormer: A Symmetric Transformer for Context-Aware Device Placement"☆21Dec 7, 2024Updated last year
- Official codebase for GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning.☆29Nov 12, 2024Updated last year
- An official 're'-implementation of Physics-induced graph neural network: An application to wind-farm power estimation (PGNN).☆29Jul 27, 2021Updated 4 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Official Code for Local Search GFlowNets (ICLR 2024 Spotlight)☆25Feb 27, 2025Updated last year
- ☆60Feb 3, 2023Updated 3 years ago
- 2019 Fall - Game theory and Multi-agent RL Termproject☆10Dec 13, 2019Updated 6 years ago
- The official implementation of Convergent Graph Solvers (CGS)☆21Feb 1, 2022Updated 4 years ago
- A PyTorch implementation of Implicit Q-Learning☆97Oct 23, 2021Updated 4 years ago
- ☆28Nov 5, 2023Updated 2 years ago
- Implementations of Deep RL Algorithms in OpenAI Gym Environments☆15Dec 11, 2020Updated 5 years ago
- [ICLR 2024 Spotlight] Code for the paper "Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making"☆12Apr 22, 2024Updated last year
- Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction (ICML 2023)☆54Aug 26, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Conservative Q Learning on top of SAC☆138Oct 15, 2022Updated 3 years ago
- Official Code for Guided Trajectory Generation with Diffusion Models for Offline Model-based Optimization (NIPS 2024)☆22Aug 15, 2024Updated last year
- Brain Agent for Large-Scale and Multi-Task Agent Learning☆93Jan 4, 2024Updated 2 years ago
- ☆25Dec 11, 2022Updated 3 years ago
- PyTorch implementation of the implicit Q-learning algorithm (IQL)☆44Dec 17, 2021Updated 4 years ago
- ☆21Jul 4, 2019Updated 6 years ago
- A collection of MuJoCo based environments.☆20Nov 30, 2020Updated 5 years ago
- High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC…☆1,340Aug 3, 2023Updated 2 years ago
- EDIS: Energy-guided DIffusion Sampling☆18Aug 10, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- SEIKO is a novel reinforcement learning method to efficiently fine-tune diffusion models in an online setting. Our methods outperform all…☆30Jul 18, 2024Updated last year
- A collection of reference environments for offline reinforcement learning☆1,666Nov 18, 2024Updated last year
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆27Jul 19, 2023Updated 2 years ago
- Annotated Tutorial for PerAct☆19Sep 11, 2023Updated 2 years ago
- ☆34Jun 9, 2025Updated 10 months ago
- ☆14Oct 27, 2019Updated 6 years ago
- Distributed DRL by Ray and TensorFlow Tutorial.☆10Dec 26, 2019Updated 6 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆148May 6, 2024Updated last year
- Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)☆69Jul 17, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Trading Robot based on LSTM-PPO☆28Dec 27, 2019Updated 6 years ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆403Dec 18, 2021Updated 4 years ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Jul 6, 2023Updated 2 years ago
- Code for conservative Q-learning☆478Dec 7, 2021Updated 4 years ago
- Implementation of advantage-weighted regression.☆209May 30, 2020Updated 5 years ago
- Official implementation of "Accelerating Reinforcement Learning with Learned Skill Priors", Pertsch et al., CoRL 2020☆223Jun 5, 2023Updated 2 years ago
- 바벨파이☆11Feb 23, 2017Updated 9 years ago