Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020
☆33Jul 22, 2021Updated 4 years ago
Alternatives and similar repositories for Meta-SAC
Users that are interested in Meta-SAC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [AAMAS 2023] Code for the paper "Automatic Noise Filtering with Dynamic Sparse Training in Deep Reinforcement Learning"☆12Feb 22, 2024Updated 2 years ago
- Code for the paper "Meta-Q-Learning"( ICLR 2020)☆108Jun 18, 2022Updated 3 years ago
- ☆11Oct 19, 2020Updated 5 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆30Sep 24, 2019Updated 6 years ago
- code of IJCAI submission "Soft Hindsight Experience Replay"☆13Mar 23, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆10Aug 17, 2022Updated 3 years ago
- A TF2.0 implementation of RL baselines.☆10Sep 24, 2021Updated 4 years ago
- Implementation for "ROLL: Visual Self-Supervised Reinforcement Learning with Object Reasoning", CoRL 2020☆16Jun 22, 2022Updated 3 years ago
- Official PyTorch code for "Sample Efficient Offline-to-Online Reinforcement Learning" in TKDE'23.☆16Aug 14, 2023Updated 2 years ago
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆23Apr 17, 2024Updated 2 years ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)☆121Jul 31, 2024Updated last year
- ☆15Apr 5, 2023Updated 3 years ago
- Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)☆17Feb 10, 2024Updated 2 years ago
- ☆20Feb 8, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Inverse Reinforcement Learning via State Marginal Matching, CoRL 2020☆45Jul 19, 2023Updated 2 years ago
- Repository for the paper: "Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation" @ NeurIPS 2022☆21Jul 10, 2023Updated 2 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆30Sep 16, 2022Updated 3 years ago
- Benchmark present methods for efficient reinforcement learning. Methods include Reptile, MAML, Residual Policy, etc. RL algorithms includ…☆32Jan 19, 2023Updated 3 years ago
- A2C is a special case of PPO!☆22May 20, 2022Updated 4 years ago
- Synthetic Experience Replay☆112Apr 16, 2026Updated last month
- Actor Prioritized Experience Replay☆19Nov 20, 2023Updated 2 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆24May 30, 2019Updated 6 years ago
- Repository with environment and training scripts for paper "Cross-Environment-Cooperation Enables Zero-shot Multi-agent Cooperation"☆21Sep 12, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Some multiagent deep reinforcement learning algorithms and its PyTorch implementation.☆14Feb 4, 2020Updated 6 years ago
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".☆24Feb 9, 2023Updated 3 years ago
- ☆22Mar 3, 2025Updated last year
- Code used in our paper "Robust Deep Reinforment Learning through Adversarial Loss"☆33Oct 3, 2023Updated 2 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆29Nov 27, 2019Updated 6 years ago
- A cell counter using computer vision techniques.☆10May 13, 2022Updated 4 years ago
- Multi-objective reinforcement learning for covid-19 control☆12Aug 12, 2021Updated 4 years ago
- PyOblige is Python wrapper for OBLIGE - random level generator for Doom☆11Jul 2, 2018Updated 7 years ago
- Minimum Energy Resource Allocation Strategy with partial offloading☆10Jan 17, 2022Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- The test code for the paper "Attention-based advantage actor-critic algorithm with prioritized experience replay for complex 2-D robotic …☆10Aug 7, 2022Updated 3 years ago
- Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…☆18Nov 8, 2024Updated last year
- This repository contains the code of the simulator used in the paper "Effect of LOS/NLOS Propagation on 5G Ultra-Dense Networks", submitt…☆12Mar 9, 2017Updated 9 years ago
- Weekly assignment solutions passed with 100/100☆11Feb 5, 2017Updated 9 years ago
- soft q learning and soft actor critic☆16Dec 23, 2018Updated 7 years ago
- ☆15Nov 4, 2021Updated 4 years ago
- [MM2024] LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention☆13Dec 24, 2024Updated last year