Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020
☆33Jul 22, 2021Updated 4 years ago
Alternatives and similar repositories for Meta-SAC
Users that are interested in Meta-SAC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Evolution-based Soft Actor-Critic (ESAC)☆42Jul 25, 2024Updated last year
- [AAMAS 2023] Code for the paper "Automatic Noise Filtering with Dynamic Sparse Training in Deep Reinforcement Learning"☆12Feb 22, 2024Updated 2 years ago
- Code for the paper "Meta-Q-Learning"( ICLR 2020)☆108Jun 18, 2022Updated 4 years ago
- ☆11Oct 19, 2020Updated 5 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆30Sep 24, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- code of IJCAI submission "Soft Hindsight Experience Replay"☆13Mar 23, 2020Updated 6 years ago
- ☆10Aug 17, 2022Updated 3 years ago
- A TF2.0 implementation of RL baselines.☆10Sep 24, 2021Updated 4 years ago
- Implementation for "ROLL: Visual Self-Supervised Reinforcement Learning with Object Reasoning", CoRL 2020☆16Jun 22, 2022Updated 4 years ago
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆22Apr 17, 2024Updated 2 years ago
- ☆15Apr 5, 2023Updated 3 years ago
- Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)☆17Feb 10, 2024Updated 2 years ago
- Repository for the paper: "Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation" @ NeurIPS 2022☆21Jul 10, 2023Updated 2 years ago
- Code for CoRL 2022 paper: https://arxiv.org/abs/2211.09006 (simulation environments)☆12Feb 9, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆31Sep 16, 2022Updated 3 years ago
- A standalone release of DeepMind Lab's maze generator with Python bindings.☆69Oct 3, 2023Updated 2 years ago
- Code for the paper "Functional Regularization for Reinforcement Learning via Learned Fourier Features"☆20Oct 2, 2022Updated 3 years ago
- Code for CoRL 2022 paper: https://arxiv.org/abs/2211.09006 (ToolFlowNet, for simulation envs)☆12Mar 16, 2023Updated 3 years ago
- A2C is a special case of PPO!☆23May 20, 2022Updated 4 years ago
- Synthetic Experience Replay☆114Apr 16, 2026Updated 2 months ago
- Actor Prioritized Experience Replay☆19Nov 20, 2023Updated 2 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆24May 30, 2019Updated 7 years ago
- [AAAI 2026 Oral] HiMo-CLIP: Modeling Semantic Hierarchy and Monotonicity in Vision-Language Alignment☆29Dec 17, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Repository with environment and training scripts for paper "Cross-Environment-Cooperation Enables Zero-shot Multi-agent Cooperation"☆22Sep 12, 2025Updated 9 months ago
- using recurrent networks(LSTM) to solve POMDPs☆35Oct 10, 2018Updated 7 years ago
- Some multiagent deep reinforcement learning algorithms and its PyTorch implementation.☆14Feb 4, 2020Updated 6 years ago
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".☆24Feb 9, 2023Updated 3 years ago
- ☆26Mar 3, 2025Updated last year
- Code used in our paper "Robust Deep Reinforment Learning through Adversarial Loss"☆33Oct 3, 2023Updated 2 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆29Nov 27, 2019Updated 6 years ago
- awesome-edge-computing,边缘计算各种资料汇总,相关技术资料汇总☆23Nov 8, 2021Updated 4 years ago
- N-Layered FeUdal Networks based on FeUdal Networks adapted to suit PySC2 observations☆19Sep 17, 2019Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A cell counter using computer vision techniques.☆10May 13, 2022Updated 4 years ago
- Multi-objective reinforcement learning for covid-19 control☆12Aug 12, 2021Updated 4 years ago
- Minimum Energy Resource Allocation Strategy with partial offloading☆10Jan 17, 2022Updated 4 years ago
- Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…☆18Nov 8, 2024Updated last year
- This repository contains the code of the simulator used in the paper "Effect of LOS/NLOS Propagation on 5G Ultra-Dense Networks", submitt…☆12Mar 9, 2017Updated 9 years ago
- Weekly assignment solutions passed with 100/100☆11Feb 5, 2017Updated 9 years ago
- Multi-Objective Deep Reinforcement Learning☆45Jan 1, 2017Updated 9 years ago