[AAAI 2023 Oral] Official code for "PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction".
☆21Jul 26, 2025Updated 7 months ago
Alternatives and similar repositories for PiCor
Users that are interested in PiCor are comparing it to the libraries listed below
Sorting:
- [AAAI 2025 Oral] Official code for "RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors"☆34Feb 15, 2025Updated last year
- This repository is the official implementation of ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot Coordination. P…☆54Nov 22, 2025Updated 3 months ago
- Direct X game controller server/client written in Python☆10Jul 10, 2018Updated 7 years ago
- Telegram bot which sends alerts when new papers, articles, books, etc. related to your keywords are released on Google Scholar or arXiv☆12Feb 26, 2022Updated 4 years ago
- FlowBotHD: History-Aware Diffuser Handling Ambiguities in Articulated Objects Manipulation☆14Dec 13, 2024Updated last year
- DQN with freezing target network in tensorflow on pygame FlappyBird☆11Dec 19, 2018Updated 7 years ago
- VGDFR: Diffuison-based Video Generation with Dynamic Frame Rate☆17May 16, 2025Updated 9 months ago
- ☆14Jun 28, 2022Updated 3 years ago
- [ICLR 2022 Spotlight] Multi-Stage Episodic Control for Strategic Exploration in Text Games☆15Feb 8, 2026Updated 3 weeks ago
- ☆13Oct 11, 2022Updated 3 years ago
- ☆46Dec 21, 2025Updated 2 months ago
- Exploring techniques to generate diverse conventions in multi-agent settings☆15Nov 14, 2023Updated 2 years ago
- ☆13Sep 14, 2021Updated 4 years ago
- ☆15Mar 26, 2024Updated last year
- An NLP research and data collection platform.☆17Mar 13, 2024Updated last year
- Production calculator for Anno 1800☆12Jan 7, 2023Updated 3 years ago
- ☆16Apr 8, 2025Updated 10 months ago
- Model-based reinforcement learning (generative simulator models and planning agents)☆16Sep 15, 2021Updated 4 years ago
- ☆15Jul 1, 2021Updated 4 years ago
- [SIGKDD 2023] HardSATGEN: Understanding the Difficulty of Hard SAT Formula Generation and A Strong Structure-Hardness-Aware Baseline☆22Jun 16, 2023Updated 2 years ago
- Waldorf is an efficient, parallel task execution framework written in Python.☆20Jul 20, 2023Updated 2 years ago
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Sep 24, 2019Updated 6 years ago
- The implementation of the paper "Where2Explore: Few-shot Affordance Learning for Unseen Novel Categories of Articulated Objects". [NeurIP…☆17Jun 13, 2025Updated 8 months ago
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆78May 28, 2024Updated last year
- Assistant tool for game Anno 1800☆14Dec 9, 2022Updated 3 years ago
- Implementation of the AlphaZero algorithm for playing the simple board game Gomoku☆14May 22, 2023Updated 2 years ago
- Belief-state planning for POMDPs using learned approximations☆23Jan 21, 2025Updated last year
- The official codebase for "AdaManip: Adaptive Articulated Object Manipulation Environments and Policy Learning" (ICLR 2025)☆29Jul 21, 2025Updated 7 months ago
- RL environment replicating the werewolf game to study emergent communication☆20May 25, 2023Updated 2 years ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆24Apr 8, 2024Updated last year
- Pytorch implementation of DreamerV2: Mastering Atari with Discrete World Models, based on the original implementation☆22Jul 25, 2022Updated 3 years ago
- ☆18May 28, 2024Updated last year
- ☆19Jun 25, 2023Updated 2 years ago
- An illustration program which visualizes the MCTS mechanism inside AlphaZero in order to provide a better understanding of how an AI make…☆19Aug 6, 2018Updated 7 years ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆22Dec 29, 2023Updated 2 years ago
- 魔兽世界怀旧服 团队buf检查监控插件☆19Sep 2, 2020Updated 5 years ago
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆25Aug 4, 2022Updated 3 years ago
- Framework to build and train RL algorithms☆39Oct 11, 2021Updated 4 years ago
- This is a repository for Hidden-utility Self-Play.☆26Jul 27, 2023Updated 2 years ago