Implementation of Quantile-Constrained Policy Optimization (QCPO)
☆11Sep 28, 2022Updated 3 years ago
Alternatives and similar repositories for QCPO
Users that are interested in QCPO are comparing it to the libraries listed below
Sorting:
- ☆15Jul 25, 2023Updated 2 years ago
- This repository considers the implementation of the paper "FoX: Formation-aware exploration in multi-agent reinforcement learning" which …☆23Oct 24, 2024Updated last year
- solving ml10☆26Nov 10, 2023Updated 2 years ago
- Implementation of Robust Imitation Learning against Variations in Environment Dynamics☆84Jan 30, 2023Updated 3 years ago
- ☆78Dec 16, 2024Updated last year
- [NeurIPS 2023] Softmax Output Approximation for Activation Memory-Efficient Training of Attention-based Networks☆77Jun 7, 2024Updated last year
- (WACV'24) MICS: Midpoint Interpolation to Learn Compact and Separated Representations for Few-Shot Class-Incremental Learning☆85Feb 19, 2024Updated 2 years ago
- ☆89Jan 28, 2023Updated 3 years ago
- SPHARM-Net: Spherical Harmonics-based Convolutional Neural Network☆98Feb 1, 2025Updated last year
- BoIR: Box-Supervised Instance Representation for Multi-Person Pose Estimation☆94Nov 27, 2023Updated 2 years ago
- Official repository for SlaBins: Fisheye Depth Estimation using Slanted Bins on Road Environments (ICCV 2023)☆103Sep 30, 2024Updated last year
- ☆105Apr 25, 2022Updated 3 years ago
- A repository of a paper named "Can We Use Diffusion Probabilistic Models for 3D Motion Prediction?", accepted to ICRA 2023.☆109Sep 6, 2023Updated 2 years ago
- ☆69Oct 18, 2021Updated 4 years ago
- Zico (ATC'21) source code (based on TensorFlow 1.13)☆71Oct 30, 2023Updated 2 years ago
- formulate diet optimization as sequence generation that produces a diet of recommended intake☆75Sep 2, 2021Updated 4 years ago
- ☆76Aug 17, 2022Updated 3 years ago
- Official repository for LANIT: Language-Driven Image-to-Image Translation for Unlabeled Data (CVPR 2023)☆142Oct 1, 2023Updated 2 years ago
- ☆82Jul 13, 2022Updated 3 years ago
- Carousel Memory: Rethinking the Design of Episodic Memory for Continual Learning☆81Dec 17, 2022Updated 3 years ago
- ☆31Feb 3, 2026Updated last month
- A toolbox for Drone Show animation in Blender☆11Jul 3, 2019Updated 6 years ago
- Training an autonomous driving robot in Gazebo Simulator by soft actor critic method☆13Apr 12, 2022Updated 3 years ago
- code of RMFER: Semi-supervised Contrastive Learning for Facial Expression Recognition with Reaction Mashup Video☆90Sep 22, 2025Updated 5 months ago
- Deep Multi-Agent Reinforcement Learning with StarCraft 2☆10Sep 27, 2020Updated 5 years ago
- RoboCup Rescue Agent Development Framework☆12Feb 27, 2026Updated last week
- Domain generalization method code based on DomainBed☆99May 9, 2025Updated 10 months ago
- ☆99Sep 11, 2022Updated 3 years ago
- Official Pytorch Implementation for "Fix the Noise: Disentangling Source Feature for Controllable Domain Translation" (CVPR 2023, CVPRW 2…☆175May 17, 2023Updated 2 years ago
- Can We Find Strong Lottery Tickets in Generative Models? - Official Code (Pytorch)☆95Jul 23, 2024Updated last year
- ☆11Dec 15, 2024Updated last year
- Conversation Chronicles: Towards Diverse Temporal and Relational Dynamics in Multi-Session Conversations☆111Dec 21, 2023Updated 2 years ago
- ☆13May 21, 2023Updated 2 years ago
- ☆17Mar 22, 2023Updated 2 years ago
- A sample team using RCRS Agent Development Framework☆14Feb 27, 2026Updated last week
- Application of Deep Reinforcement learning to RoboCup Rescue Simulator☆13Jul 5, 2023Updated 2 years ago
- Microgrid/distribution network level energy market managed by an RL agent☆13Feb 19, 2021Updated 5 years ago
- Diffusion-Based Signed Distance Fields for 3D Shape Generation (CVPR 2023)☆145Feb 24, 2025Updated last year
- Conformer-RLpatching achieves multi-objective dispatching for the hybrid power system under the long-term fluctuations of renewable energ…☆18May 31, 2022Updated 3 years ago