twni2016/Meta-SAC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/twni2016/Meta-SAC)

twni2016 / Meta-SAC

Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020

☆33

Alternatives and similar repositories for Meta-SAC

Users that are interested in Meta-SAC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

karush17 / esac
View on GitHub
Evolution-based Soft Actor-Critic (ESAC)
☆42Jul 25, 2024Updated last year
bramgrooten / automatic-noise-filtering
View on GitHub
[AAMAS 2023] Code for the paper "Automatic Noise Filtering with Dynamic Sparse Training in Deep Reinforcement Learning"
☆12Feb 22, 2024Updated 2 years ago
zwfightzw / Meta-Critic
View on GitHub
☆11Oct 19, 2020Updated 5 years ago
FSLight1996 / SHER
View on GitHub
code of IJCAI submission "Soft Hindsight Experience Replay"
☆13Mar 23, 2020Updated 6 years ago
yihaosun1124 / mobile
View on GitHub
Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization
☆22Apr 17, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
yufeiwang63 / ROLL
View on GitHub
Implementation for "ROLL: Visual Self-Supervised Reinforcement Learning with Object Reasoning", CoRL 2020
☆16Jun 22, 2022Updated 4 years ago
guosyjlu / OEMA
View on GitHub
Official PyTorch code for "Sample Efficient Offline-to-Online Reinforcement Learning" in TKDE'23.
☆16Aug 14, 2023Updated 2 years ago
RobvanGastel / rnn-sac
View on GitHub
Implementation of the Discrete Soft Actor-Critic algorithm with RNN policy in PyTorch
☆26Jan 7, 2023Updated 3 years ago
nakamotoo / Cal-QL
View on GitHub
official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)
☆123Jul 31, 2024Updated last year
NagisaZj / MetaCURE-Public
View on GitHub
☆15Apr 5, 2023Updated 3 years ago
chanb / metalearning_RL
View on GitHub
☆20Feb 8, 2023Updated 3 years ago
twni2016 / f-IRL
View on GitHub
Inverse Reinforcement Learning via State Marginal Matching, CoRL 2020
☆45Jul 19, 2023Updated 3 years ago
DanielTakeshi / softgym_tfn
View on GitHub
Code for CoRL 2022 paper: https://arxiv.org/abs/2211.09006 (simulation environments)
☆12Feb 9, 2023Updated 3 years ago
quantumiracle / Benchmark-Efficient-Reinforcement-Learning-with-Demonstrations
View on GitHub
Benchmark present methods for efficient reinforcement learning. Methods include Reptile, MAML, Residual Policy, etc. RL algorithms includ…
☆32Jan 19, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
xingchenwan / bgpbt
View on GitHub
[AutoML'22] Bayesian Generational Population-based Training (BG-PBT)
☆31Sep 16, 2022Updated 3 years ago
martius-lab / cee-us
View on GitHub
Repository for the paper: "Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation" @ NeurIPS 2022
☆21Jul 10, 2023Updated 3 years ago
DanielTakeshi / softagent_tfn
View on GitHub
Code for CoRL 2022 paper: https://arxiv.org/abs/2211.09006 (ToolFlowNet, for simulation envs)
☆12Mar 16, 2023Updated 3 years ago
nik7273 / covid-pgmorl
View on GitHub
Multi-objective reinforcement learning for covid-19 control
☆12Aug 12, 2021Updated 4 years ago
brewinn / Roadrunner-CellCounter
View on GitHub
A cell counter using computer vision techniques.
☆10May 13, 2022Updated 4 years ago
LAMDA-RL / ACT
View on GitHub
Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)
☆17Feb 10, 2024Updated 2 years ago
vwxyzjn / a2c_is_a_special_case_of_ppo
View on GitHub
A2C is a special case of PPO!
☆23May 20, 2022Updated 4 years ago
conglu1997 / SynthER
View on GitHub
Synthetic Experience Replay
☆114Apr 16, 2026Updated 3 months ago
ruizhaogit / mep
View on GitHub
Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)
☆24May 30, 2019Updated 7 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
bitoceango / awesome-edge-computing
View on GitHub
awesome-edge-computing，边缘计算各种资料汇总，相关技术资料汇总
☆23Nov 8, 2021Updated 4 years ago
Steven-Ho / madrl-baselines
View on GitHub
Some multiagent deep reinforcement learning algorithms and its PyTorch implementation.
☆14Feb 4, 2020Updated 6 years ago
tuomaso / radial_rl
View on GitHub
Code used in our paper "Robust Deep Reinforment Learning through Adversarial Loss"
☆33Oct 3, 2023Updated 2 years ago
rpatrik96 / AttA2C
View on GitHub
Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning
☆29Nov 27, 2019Updated 6 years ago
4rChon / NL-FuN
View on GitHub
N-Layered FeUdal Networks based on FeUdal Networks adapted to suit PySC2 observations
☆19Sep 17, 2019Updated 6 years ago
frbinucci / MultiUserExtensionPartialOffloading
View on GitHub
Minimum Energy Resource Allocation Strategy with partial offloading
☆10Jan 17, 2022Updated 4 years ago
LAMDA-RL / PRDC
View on GitHub
Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…
☆18Nov 8, 2024Updated last year
birlrobotics / ITER_KER_GER
View on GitHub
This repo refers to paper Invariant Transform Experience Replay. And this repo is built on top of OpenAI Baseline. For more information p…
☆12Feb 2, 2021Updated 5 years ago
Dekki-Aero / DDPG
View on GitHub
Implimenting DDPG Algorithm in Tensorflow-2.0
☆10Mar 25, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
CHUENGMINCHOU / AW-PER-A2C
View on GitHub
The test code for the paper "Attention-based advantage actor-critic algorithm with prioritized experience replay for complex 2-D robotic …
☆10Aug 7, 2022Updated 3 years ago
syntnc / Coursera-Algorithms-Part1-Princeton
View on GitHub
Weekly assignment solutions passed with 100/100
☆11Feb 5, 2017Updated 9 years ago
hossam-mossalam / multi-objective-deep-rl
View on GitHub
Multi-Objective Deep Reinforcement Learning
☆45Jan 1, 2017Updated 9 years ago
g-carlo / g-carlo-Code_for_LOS_NLOS_effect_on_network_densification
View on GitHub
This repository contains the code of the simulator used in the paper "Effect of LOS/NLOS Propagation on 5G Ultra-Dense Networks", submitt…
☆12Mar 9, 2017Updated 9 years ago
EdgeSimulation / EdgeSimulation
View on GitHub
☆10Nov 5, 2024Updated last year
guaguakai / decision-focused-RL
View on GitHub
☆16Nov 4, 2021Updated 4 years ago
andrewschreiber / agent
View on GitHub
Interpretability dashboard for reinforcement learners
☆16Jun 4, 2019Updated 7 years ago