[AAAI 2023 Oral] Official code for "PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction".
☆21Jul 26, 2025Updated 8 months ago
Alternatives and similar repositories for PiCor
Users that are interested in PiCor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [AAAI 2025 Oral] Official code for "RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors"☆34Feb 15, 2025Updated last year
- VGDFR: Diffuison-based Video Generation with Dynamic Frame Rate☆17May 16, 2025Updated 10 months ago
- This repository is the official implementation of ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot Coordination. P…☆55Nov 22, 2025Updated 4 months ago
- FlowBotHD: History-Aware Diffuser Handling Ambiguities in Articulated Objects Manipulation☆13Dec 13, 2024Updated last year
- Exploring techniques to generate diverse conventions in multi-agent settings☆15Nov 14, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [SIGKDD 2023] HardSATGEN: Understanding the Difficulty of Hard SAT Formula Generation and A Strong Structure-Hardness-Aware Baseline☆22Jun 16, 2023Updated 2 years ago
- An NLP research and data collection platform.☆17Mar 13, 2024Updated 2 years ago
- ☆46Dec 21, 2025Updated 3 months ago
- ☆29Mar 25, 2026Updated 2 weeks ago
- ☆13Oct 11, 2022Updated 3 years ago
- This repository is the official implementation for the paper “REFRAME: Reflective Surface Real-Time Rendering for Mobile Devices”.☆21Jul 27, 2025Updated 8 months ago
- ☆17Apr 8, 2025Updated last year
- ☆14Jun 28, 2022Updated 3 years ago
- The implementation of the paper "Where2Explore: Few-shot Affordance Learning for Unseen Novel Categories of Articulated Objects". [NeurIP…☆15Jun 13, 2025Updated 9 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Direct X game controller server/client written in Python☆10Jul 10, 2018Updated 7 years ago
- ☆15Mar 26, 2024Updated 2 years ago
- DQN with freezing target network in tensorflow on pygame FlappyBird☆11Dec 19, 2018Updated 7 years ago
- ☆18May 28, 2024Updated last year
- [ICLR 2022 Spotlight] Multi-Stage Episodic Control for Strategic Exploration in Text Games☆15Feb 8, 2026Updated 2 months ago
- RL environment replicating the werewolf game to study emergent communication☆20May 25, 2023Updated 2 years ago
- Telegram bot which sends alerts when new papers, articles, books, etc. related to your keywords are released on Google Scholar or arXiv☆12Feb 26, 2022Updated 4 years ago
- Preliminary version of AutoBio (https://arxiv.org/abs/2505.14030)☆73Jun 12, 2025Updated 10 months ago
- This is a repository for Hidden-utility Self-Play.☆26Jul 27, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A large-scale multi-modal pre-trained model☆134Feb 7, 2023Updated 3 years ago
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆77May 28, 2024Updated last year
- The official codebase for "AdaManip: Adaptive Articulated Object Manipulation Environments and Policy Learning" (ICLR 2025)☆32Jul 21, 2025Updated 8 months ago
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆25Aug 4, 2022Updated 3 years ago
- ☆13Sep 14, 2021Updated 4 years ago
- [NeurIPS 2024] Code for Federated Ensemble-Directed Offline Reinforcement Learning☆27Sep 25, 2024Updated last year
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆110Apr 17, 2023Updated 2 years ago
- Model-based reinforcement learning (generative simulator models and planning agents)☆16Mar 13, 2026Updated 3 weeks ago
- Reinforced Multi-LLM Agents training☆80Jan 18, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This is a repository for fine-tuning Qwen2-Audio, currently supporting Distributed Data Parallel (DDP) and DeepSpeed.☆52Jul 28, 2025Updated 8 months ago
- ☆24Oct 22, 2023Updated 2 years ago
- Collection of RL Environments built using Madrona☆38Aug 11, 2023Updated 2 years ago
- Official pytorch implementation of the paper <Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts>.☆23Nov 22, 2025Updated 4 months ago
- ☆16Jul 1, 2021Updated 4 years ago
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Sep 24, 2019Updated 6 years ago
- Reviews of part of courses of AI☆24Jun 19, 2023Updated 2 years ago