This repository contains all code and experiments for competitive policy gradient (CoPG) algorithm.
☆24Aug 1, 2020Updated 5 years ago
Alternatives and similar repositories for copg
Users that are interested in copg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Safe guaranteed exploration for non-linear systems☆20Feb 9, 2024Updated 2 years ago
- This repository contains the Julia code for the paper "Competitive Gradient Descent"☆25Dec 18, 2019Updated 6 years ago
- Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>☆11Oct 8, 2021Updated 4 years ago
- EasyRLHF aims to provide an easy and minimal interface to train aligned language models, using off-the-shelf solutions and datasets☆10Dec 12, 2023Updated 2 years ago
- Official implementation of COAt-MPC (IEEE RA-L 2025), a method with theoretical guarantees to automatically tune the cost function weight…☆13Feb 2, 2026Updated last month
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆18Mar 2, 2021Updated 5 years ago
- Codebase for Prioritizing samples in Reinforcement Learning with Reducible Loss☆12Oct 10, 2022Updated 3 years ago
- ☆12Apr 18, 2023Updated 2 years ago
- ☆18Jul 25, 2024Updated last year
- Study repo for David Silver's Reinforcement Learning Course☆12Apr 26, 2019Updated 6 years ago
- Code for paper Target-Guided Dialogue Response Generation Using Commonsense and Data Augmentation☆14Jun 10, 2022Updated 3 years ago
- Regularized Learning under label shifts☆18May 1, 2019Updated 6 years ago
- My solution to Collaboration and Competition using MADDPG algorithm, Udacity 3rd project of Deep RL Nanodegree from the paper "Multi-Agen…☆10Oct 6, 2019Updated 6 years ago
- [NeurIPS 2025 Spotlight] "Stochastic Process Learning via Operator Flow Matching"☆20Nov 4, 2025Updated 4 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Reference implementation of algorithms for reinforcement learning and Markov decision processes.☆12Jan 28, 2021Updated 5 years ago
- Efficient Exploration through Bayesian Deep-Q Networks.☆18Mar 22, 2022Updated 4 years ago
- Public examples for FORCES NLP☆12Jun 20, 2017Updated 8 years ago
- ☆10Oct 11, 2022Updated 3 years ago
- ☆25Dec 8, 2023Updated 2 years ago
- A repo to design basic Policy Gradient labs☆12Jul 6, 2023Updated 2 years ago
- Customizable RecSys Simulator for OpenAI Gym☆26Dec 7, 2021Updated 4 years ago
- Non-orthogonal multiple access (NOMA) for Indoor Visible Light Communications. We offer a complete review of PD-NOMA-based VLC systems in…☆17Oct 18, 2023Updated 2 years ago
- Implementation of Deep Learning for Predicting Human Strategic Behavior☆15Apr 6, 2017Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆47Dec 3, 2025Updated 3 months ago
- Un-official technical documents aimed at helping building Apollo auto-driving system on prototype cars☆10Jul 18, 2020Updated 5 years ago
- TRISTAN: TRI's Situation and Trajectory Anticipation Networks☆14Jul 18, 2023Updated 2 years ago
- Code for "Goodtriever: Toxicity Mitigation with Retrieval-augmented Language Models"☆25May 30, 2024Updated last year
- Traffic Steering (TS) xApp for OAIC O-RAN Testbed☆12Nov 8, 2023Updated 2 years ago
- ☆15Sep 13, 2024Updated last year
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆33Oct 6, 2022Updated 3 years ago
- Code for GLAT (Global Local Transformer), ECCV 2020 "Learning Visual Commonsense for Robust Scene Graph Generation"☆11Dec 16, 2020Updated 5 years ago
- An empathetic counselling chatbot. Retrieval-based, uses finetuned LMs for emotion identification and to boost empathy, novelty and fluen…☆17Jun 8, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆18Jun 14, 2018Updated 7 years ago
- code for our EACL 2021 paper: "Challenges in Automated Debiasing for Toxic Language Detection" by Xuhui Zhou, Maarten Sap, Swabha Swayamd…☆19Aug 20, 2021Updated 4 years ago
- ☆31Jul 1, 2019Updated 6 years ago
- TrafficBots: Towards World Models for Autonomous Driving Simulation and Motion Prediction. ICRA 2023. Code is now available at https://gi…☆54Mar 8, 2023Updated 3 years ago
- Implementation of Hierarchical Control for Head-to-Head Autonomous Racing paper☆19Feb 11, 2024Updated 2 years ago
- Spectral Tensor Train Parameterization of Deep Learning Layers☆17Jul 1, 2021Updated 4 years ago
- An environment for mobile angets to interact with realistic android device or android emulator☆13Jul 19, 2024Updated last year