wyjung0625 / QCPO
Implementation of Quantile-Constrained Policy Optimization (QCPO)
☆11Updated 2 years ago
Alternatives and similar repositories for QCPO:
Users that are interested in QCPO are comparing it to the libraries listed below
- This repository considers the implementation of the paper "FoX: Formation-aware exploration in multi-agent reinforcement learning" which …☆18Updated 6 months ago
- ☆15Updated last year
- Implementation of Robust Imitation Learning against Variations in Environment Dynamics☆83Updated 2 years ago
- ☆78Updated 2 years ago
- Zico (ATC'21) source code (based on TensorFlow 1.13)☆73Updated last year
- ☆81Updated 4 months ago
- formulate diet optimization as sequence generation that produces a diet of recommended intake☆75Updated 3 years ago
- code of RMFER: Semi-supervised Contrastive Learning for Facial Expression Recognition with Reaction Mashup Video☆89Updated last year
- BoIR: Box-Supervised Instance Representation for Multi-Person Pose Estimation☆96Updated last year
- ☆71Updated 3 years ago
- Domain generalization method code based on DomainBed☆100Updated 2 years ago
- [NeurIPS 2023] Softmax Output Approximation for Activation Memory-Efficient Training of Attention-based Networks☆81Updated 10 months ago
- ☆85Updated 2 years ago
- ☆93Updated 2 years ago
- Carousel Memory: Rethinking the Design of Episodic Memory for Continual Learning☆83Updated 2 years ago
- [SenSys 2023] On-NAS: On-Device Neural Architecture Search on Memory-Constrained Intelligent Embedded Systems☆89Updated last year
- (WACV'24) MICS: Midpoint Interpolation to Learn Compact and Separated Representations for Few-Shot Class-Incremental Learning☆86Updated last year
- Can We Find Strong Lottery Tickets in Generative Models? - Official Code (Pytorch)☆99Updated 9 months ago
- solving ml10☆23Updated last year
- NeurIPS 2023 - TopP&R: Robust Support Estimation Approach for Evaluating Fidelity and Diversity in Generative Models Official Code☆103Updated 9 months ago
- Conversation Chronicles: Towards Diverse Temporal and Relational Dynamics in Multi-Session Conversations☆110Updated last year
- ☆106Updated 3 years ago
- Official repository for SlaBins: Fisheye Depth Estimation using Slanted Bins on Road Environments (ICCV 2023)☆102Updated 6 months ago
- SPHARM-Net: Spherical Harmonics-based Convolutional Neural Network☆97Updated 2 months ago
- StarCraft II Multi Agent Challenge : QMIX, COMA, LIIR, QTRAN, Central V, ROMA, RODE, DOP, Graph MIX☆70Updated 3 years ago
- ☆10Updated last year
- Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)☆21Updated 5 months ago
- Actor Prioritized Experience Replay☆15Updated last year
- ☆46Updated 2 years ago
- Open AI Gym - Pendulum-v1 reinforcement learning (DQN, SAC)☆18Updated last year