wyjung0625 / QCPOLinks
Implementation of Quantile-Constrained Policy Optimization (QCPO)
☆11Updated 2 years ago
Alternatives and similar repositories for QCPO
Users that are interested in QCPO are comparing it to the libraries listed below
Sorting:
- Implementation of Robust Imitation Learning against Variations in Environment Dynamics☆84Updated 2 years ago
- ☆78Updated 2 years ago
- ☆81Updated 7 months ago
- ☆15Updated last year
- formulate diet optimization as sequence generation that produces a diet of recommended intake☆76Updated 3 years ago
- Zico (ATC'21) source code (based on TensorFlow 1.13)☆73Updated last year
- BoIR: Box-Supervised Instance Representation for Multi-Person Pose Estimation☆97Updated last year
- ☆71Updated 3 years ago
- ☆93Updated 2 years ago
- ☆85Updated 3 years ago
- Domain generalization method code based on DomainBed☆100Updated 2 months ago
- [NeurIPS 2023] Softmax Output Approximation for Activation Memory-Efficient Training of Attention-based Networks☆81Updated last year
- (WACV'24) MICS: Midpoint Interpolation to Learn Compact and Separated Representations for Few-Shot Class-Incremental Learning☆86Updated last year
- This repository considers the implementation of the paper "FoX: Formation-aware exploration in multi-agent reinforcement learning" which …☆20Updated 8 months ago
- NeurIPS 2023 - TopP&R: Robust Support Estimation Approach for Evaluating Fidelity and Diversity in Generative Models Official Code☆103Updated last year
- Conversation Chronicles: Towards Diverse Temporal and Relational Dynamics in Multi-Session Conversations☆113Updated last year
- [SenSys 2023] On-NAS: On-Device Neural Architecture Search on Memory-Constrained Intelligent Embedded Systems☆89Updated last year
- Can We Find Strong Lottery Tickets in Generative Models? - Official Code (Pytorch)☆99Updated 11 months ago
- Carousel Memory: Rethinking the Design of Episodic Memory for Continual Learning☆83Updated 2 years ago
- solving ml10☆24Updated last year
- StarCraft II Multi Agent Challenge : QMIX, COMA, LIIR, QTRAN, Central V, ROMA, RODE, DOP, Graph MIX☆72Updated 3 years ago
- A minimal codebase for PPO training on MuJoCo environments with some customization supports.☆14Updated 3 years ago
- ☆11Updated 2 years ago
- (Official) PyTorch implementation for Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning (EMU) (ICLR…☆48Updated last year
- This repo contains PPO implementation in PyTorch for LunarLander-v2☆11Updated 5 years ago
- (Official) PyTorch implementation for Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning (EMU) in IC…☆11Updated last year
- [NeurIPS 2022] Code for Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments☆12Updated 2 years ago
- (Official) PyTorch implementation for LAGMA: LAtent Goal-guided Multi-Agent Reinforcement Learning (ICML 2024)☆21Updated last year
- Official PyTorch implementation For Sharpness-Aware Active Learning [ICML 2023]☆11Updated last year
- Original PyTorch implementation of PMIC from PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Colla…☆20Updated last year