[AAAI 2023 Oral] Official code for "PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction".
☆21Jul 26, 2025Updated 7 months ago
Alternatives and similar repositories for PiCor
Users that are interested in PiCor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [AAAI 2025 Oral] Official code for "RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors"☆34Feb 15, 2025Updated last year
- VGDFR: Diffuison-based Video Generation with Dynamic Frame Rate☆17May 16, 2025Updated 10 months ago
- This repository is the official implementation of ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot Coordination. P…☆55Nov 22, 2025Updated 4 months ago
- FlowBotHD: History-Aware Diffuser Handling Ambiguities in Articulated Objects Manipulation☆14Dec 13, 2024Updated last year
- Exploring techniques to generate diverse conventions in multi-agent settings☆15Nov 14, 2023Updated 2 years ago
- [SIGKDD 2023] HardSATGEN: Understanding the Difficulty of Hard SAT Formula Generation and A Strong Structure-Hardness-Aware Baseline☆22Jun 16, 2023Updated 2 years ago
- An NLP research and data collection platform.☆17Mar 13, 2024Updated 2 years ago
- ☆46Dec 21, 2025Updated 3 months ago
- ☆29Mar 13, 2026Updated last week
- This repository is the official implementation for the paper “REFRAME: Reflective Surface Real-Time Rendering for Mobile Devices”.☆21Jul 27, 2025Updated 7 months ago
- ☆13Oct 11, 2022Updated 3 years ago
- ☆17Apr 8, 2025Updated 11 months ago
- ☆14Jun 28, 2022Updated 3 years ago
- The implementation of the paper "Where2Explore: Few-shot Affordance Learning for Unseen Novel Categories of Articulated Objects". [NeurIP…☆17Jun 13, 2025Updated 9 months ago
- Direct X game controller server/client written in Python☆10Jul 10, 2018Updated 7 years ago
- ☆15Mar 26, 2024Updated last year
- DQN with freezing target network in tensorflow on pygame FlappyBird☆11Dec 19, 2018Updated 7 years ago
- ☆18May 28, 2024Updated last year
- [ICLR 2022 Spotlight] Multi-Stage Episodic Control for Strategic Exploration in Text Games☆15Feb 8, 2026Updated last month
- Preliminary version of AutoBio (https://arxiv.org/abs/2505.14030)☆70Jun 12, 2025Updated 9 months ago
- RL environment replicating the werewolf game to study emergent communication☆20May 25, 2023Updated 2 years ago
- Telegram bot which sends alerts when new papers, articles, books, etc. related to your keywords are released on Google Scholar or arXiv☆12Feb 26, 2022Updated 4 years ago
- This is a repository for Hidden-utility Self-Play.☆26Jul 27, 2023Updated 2 years ago
- A large-scale multi-modal pre-trained model☆134Feb 7, 2023Updated 3 years ago
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆77May 28, 2024Updated last year
- The official codebase for "AdaManip: Adaptive Articulated Object Manipulation Environments and Policy Learning" (ICLR 2025)☆31Jul 21, 2025Updated 8 months ago
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆25Aug 4, 2022Updated 3 years ago
- ☆13Sep 14, 2021Updated 4 years ago
- [NeurIPS 2024] Code for Federated Ensemble-Directed Offline Reinforcement Learning☆27Sep 25, 2024Updated last year
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆110Apr 17, 2023Updated 2 years ago
- [ICLR'26] Stronger-MAS: A RL Framework for multi LLM agent system☆125Updated this week
- Model-based reinforcement learning (generative simulator models and planning agents)☆16Mar 13, 2026Updated last week
- Reinforced Multi-LLM Agents training☆76Jan 18, 2026Updated 2 months ago
- This is a repository for fine-tuning Qwen2-Audio, currently supporting Distributed Data Parallel (DDP) and DeepSpeed.☆51Jul 28, 2025Updated 7 months ago
- ☆24Oct 22, 2023Updated 2 years ago
- Collection of RL Environments built using Madrona☆37Aug 11, 2023Updated 2 years ago
- Official pytorch implementation of the paper <Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts>.☆23Nov 22, 2025Updated 4 months ago
- ☆15Jul 1, 2021Updated 4 years ago
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Sep 24, 2019Updated 6 years ago