Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020
☆33Jul 22, 2021Updated 4 years ago
Alternatives and similar repositories for Meta-SAC
Users that are interested in Meta-SAC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Evolution-based Soft Actor-Critic (ESAC)☆42Jul 25, 2024Updated last year
- Code for the paper "Meta-Q-Learning"( ICLR 2020)☆108Jun 18, 2022Updated 3 years ago
- ☆11Oct 19, 2020Updated 5 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆30Sep 24, 2019Updated 6 years ago
- code of IJCAI submission "Soft Hindsight Experience Replay"☆13Mar 23, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A TF2.0 implementation of RL baselines.☆10Sep 24, 2021Updated 4 years ago
- Implementation for "ROLL: Visual Self-Supervised Reinforcement Learning with Object Reasoning", CoRL 2020☆16Jun 22, 2022Updated 3 years ago
- Official PyTorch code for "Sample Efficient Offline-to-Online Reinforcement Learning" in TKDE'23.☆16Aug 14, 2023Updated 2 years ago
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆23Apr 17, 2024Updated 2 years ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)☆121Jul 31, 2024Updated last year
- ☆15Apr 5, 2023Updated 3 years ago
- Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)☆17Feb 10, 2024Updated 2 years ago
- ☆20Feb 8, 2023Updated 3 years ago
- Implementation of the Discrete Soft Actor-Critic algorithm with RNN policy in PyTorch☆26Jan 7, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Inverse Reinforcement Learning via State Marginal Matching, CoRL 2020☆45Jul 19, 2023Updated 2 years ago
- Repository for the paper: "Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation" @ NeurIPS 2022☆21Jul 10, 2023Updated 2 years ago
- Code for CoRL 2022 paper: https://arxiv.org/abs/2211.09006 (simulation environments)☆12Feb 9, 2023Updated 3 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆30Sep 16, 2022Updated 3 years ago
- Authors' implementation of PEER☆11Jul 13, 2023Updated 2 years ago
- A standalone release of DeepMind Lab's maze generator with Python bindings.☆67Oct 3, 2023Updated 2 years ago
- Benchmark present methods for efficient reinforcement learning. Methods include Reptile, MAML, Residual Policy, etc. RL algorithms includ…☆32Jan 19, 2023Updated 3 years ago
- Code for the paper "Functional Regularization for Reinforcement Learning via Learned Fourier Features"☆20Oct 2, 2022Updated 3 years ago
- Code for CoRL 2022 paper: https://arxiv.org/abs/2211.09006 (ToolFlowNet, for simulation envs)☆12Mar 16, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆27Oct 10, 2020Updated 5 years ago
- An easy-to-use Nelder-Mead optimizer for n-Vectors☆13Sep 24, 2018Updated 7 years ago
- A2C is a special case of PPO!☆22May 20, 2022Updated 3 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆24May 30, 2019Updated 6 years ago
- [AAAI 2026 Oral] HiMo-CLIP: Modeling Semantic Hierarchy and Monotonicity in Vision-Language Alignment☆31Dec 17, 2025Updated 4 months ago
- Repository with environment and training scripts for paper "Cross-Environment-Cooperation Enables Zero-shot Multi-agent Cooperation"☆20Sep 12, 2025Updated 7 months ago
- Main repository of the BeFaaS project☆15Jun 29, 2023Updated 2 years ago
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".☆24Feb 9, 2023Updated 3 years ago
- ☆21Mar 3, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆29Nov 27, 2019Updated 6 years ago
- Multi-objective reinforcement learning for covid-19 control☆12Aug 12, 2021Updated 4 years ago
- PyOblige is Python wrapper for OBLIGE - random level generator for Doom☆11Jul 2, 2018Updated 7 years ago
- Minimum Energy Resource Allocation Strategy with partial offloading☆10Jan 17, 2022Updated 4 years ago
- The test code for the paper "Attention-based advantage actor-critic algorithm with prioritized experience replay for complex 2-D robotic …☆10Aug 7, 2022Updated 3 years ago
- Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…☆18Nov 8, 2024Updated last year
- Multi-Objective Deep Reinforcement Learning☆45Jan 1, 2017Updated 9 years ago