wyjung0625 / QCPOLinks
Implementation of Quantile-Constrained Policy Optimization (QCPO)
☆11Updated 3 years ago
Alternatives and similar repositories for QCPO
Users that are interested in QCPO are comparing it to the libraries listed below
Sorting:
- Implementation of Robust Imitation Learning against Variations in Environment Dynamics☆83Updated 2 years ago
- ☆15Updated 2 years ago
- ☆77Updated 3 years ago
- formulate diet optimization as sequence generation that produces a diet of recommended intake☆75Updated 4 years ago
- Zico (ATC'21) source code (based on TensorFlow 1.13)☆72Updated last year
- ☆84Updated 3 years ago
- (WACV'24) MICS: Midpoint Interpolation to Learn Compact and Separated Representations for Few-Shot Class-Incremental Learning☆86Updated last year
- ☆70Updated 4 years ago
- ☆80Updated 10 months ago
- [SenSys 2023] On-NAS: On-Device Neural Architecture Search on Memory-Constrained Intelligent Embedded Systems☆89Updated last year
- [NeurIPS 2023] Softmax Output Approximation for Activation Memory-Efficient Training of Attention-based Networks☆80Updated last year
- Domain generalization method code based on DomainBed☆99Updated 5 months ago
- This repository considers the implementation of the paper "FoX: Formation-aware exploration in multi-agent reinforcement learning" which …☆22Updated 11 months ago
- ☆107Updated 3 years ago
- solving ml10☆26Updated last year
- Deep recurrent Q learning on CartPole-v1 environment☆94Updated last year
- ☆12Updated 2 years ago
- StarCraft II Multi Agent Challenge : QMIX, COMA, LIIR, QTRAN, Central V, ROMA, RODE, DOP, Graph MIX☆73Updated 4 years ago
- Code for Weighted QMIX☆140Updated 4 years ago
- This repository is an implementation of "MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer"…☆22Updated 2 years ago
- [NeurIPS 2022] Code for Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments☆12Updated 3 years ago
- ☆20Updated 2 years ago
- The official code base of Shared Experience Actor-Critic (NeurIPS2020)☆40Updated last year
- IQL, QMIX, VDN, COMA, QTRAN (QTRAN-Base and QTRAN-Alt), MAVEN, CommNet, DYMA-Cl, G2ANet, and MADDPG☆18Updated 3 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML☆19Updated last year
- ☆50Updated 3 years ago
- (Official) PyTorch implementation for Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning (EMU) (ICLR…☆52Updated last year
- ☆40Updated 3 years ago
- ☆13Updated 2 years ago
- ☆15Updated 2 years ago