ChangWinde/PiCor

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ChangWinde/PiCor)

ChangWinde / PiCor

[AAAI 2023 Oral] Official code for "PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction".

☆21

Alternatives and similar repositories for PiCor

Users that are interested in PiCor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ChangWinde / RAT
View on GitHub
[AAAI 2025 Oral] Official code for "RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors"
☆34Feb 15, 2025Updated last year
sjtu-marl / ZSC-Eval
View on GitHub
This repository is the official implementation of ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot Coordination. P…
☆56Nov 22, 2025Updated 8 months ago
SijiaCui / play-urts
View on GitHub
☆15Oct 28, 2024Updated last year
Stanford-ILIAD / Diverse-Conventions
View on GitHub
Exploring techniques to generate diverse conventions in multi-agent settings
☆16Nov 14, 2023Updated 2 years ago
Qwen-Applications / CLIPO
View on GitHub
CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR
☆21Apr 7, 2026Updated 3 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
xihuai18 / A2PO-ICLR2023
View on GitHub
Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)
☆32Nov 22, 2025Updated 8 months ago
liyang619 / COLE-Platform
View on GitHub
Overcooked human-AI experiment platform
☆41Dec 21, 2023Updated 2 years ago
liy1shu / FlowBotHD
View on GitHub
FlowBotHD: History-Aware Diffuser Handling Ambiguities in Articulated Objects Manipulation
☆13Dec 13, 2024Updated last year
HumanCompatibleAI / human_aware_rl
View on GitHub
Code for "On the Utility of Learning about Humans for Human-AI Coordination"
☆112Apr 17, 2023Updated 3 years ago
UniquezCs / Narwhal-volumetric-video-streaming-system
View on GitHub
☆12Feb 17, 2022Updated 4 years ago
lil-lab / cb2
View on GitHub
An NLP research and data collection platform.
☆17Jul 4, 2026Updated 2 weeks ago
wsjeon / multiagent-gail
View on GitHub
multiagent-gail working with multiagent-particle-env-v2 (which was modified by magail authors)
☆13Aug 17, 2019Updated 6 years ago
zhanghuanhuan1994 / arsenal
View on GitHub
☆12Apr 12, 2022Updated 4 years ago
amimibear / KartRider
View on GitHub
a toolkit of KartRider
☆17Oct 8, 2024Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
kid-yang233 / robots
View on GitHub
The homework of robos learning base.
☆11May 23, 2023Updated 3 years ago
azuredsky / YoloOCLInference
View on GitHub
An extremely light weight tiny-YOLO inference engine targeted towards OpenCL hardware.
☆16Oct 15, 2017Updated 8 years ago
sjtu-marl / DPT-Agent
View on GitHub
This is the official implementation of paper "Leveraging Dual Process Theory in Language Agent Framework for Simultaneous Human-AI Collab…
☆61Nov 22, 2025Updated 8 months ago
mit-ll / hanabi_AnyPlay
View on GitHub
☆15Jun 28, 2022Updated 4 years ago
HumanCompatibleAI / overcooked-hAI-exp
View on GitHub
Overcooked-AI Experiment Psiturk Demo (for MTurk experiments)
☆13May 10, 2021Updated 5 years ago
chenxie95 / deeplearning_course_sjtu
View on GitHub
☆18Apr 8, 2025Updated last year
Thinklab-SJTU / HardSATGEN
View on GitHub
[SIGKDD 2023] HardSATGEN: Understanding the Difficulty of Hard SAT Formula Generation and A Strong Structure-Hardness-Aware Baseline
☆23Jun 16, 2023Updated 3 years ago
pavancm / GREED
View on GitHub
Official implementation for "ST-GREED: Space-Time Generalized EntropicDifferences for Frame Rate Dependent VideoQuality"
☆17Jun 22, 2022Updated 4 years ago
SkyRiver-2000 / TRAD-Official
View on GitHub
[SIGIR 2024] TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision
☆20Mar 28, 2024Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
ProsusAI / stack-eval
View on GitHub
Official implementation for the paper, StackEval: Benchmarking LLMs in Coding Assistance, https://arxiv.org/abs/2412.05288
☆20Oct 30, 2024Updated last year
Tencent-RoboticsX / CraftEnv
View on GitHub
A flexible Multi-Agent Reinforcement Learning (MARL) environment for Collective Robotic Construction (CRC) systems
☆13Mar 22, 2023Updated 3 years ago
bigrl-team / gear
View on GitHub
A distributed GPU-centric experience replay system for large AI models.
☆19Aug 1, 2023Updated 2 years ago
TritiumR / Where2Explore
View on GitHub
The implementation of the paper "Where2Explore: Few-shot Affordance Learning for Unseen Novel Categories of Articulated Objects". [NeurIP…
☆15Jun 13, 2025Updated last year
FYQ0919 / PTSA-MCTS
View on GitHub
A PyTorch implementation of PTSA-MCTS from [Accelerating Monte Carlo Tree Search with Probability Tree State Abstraction].
☆16Oct 21, 2023Updated 2 years ago
nicofirst1 / rl_werewolf
View on GitHub
RL environment replicating the werewolf game to study emergent communication
☆20May 25, 2023Updated 3 years ago
PKU-Alignment / ReDMan
View on GitHub
ReDMan is an open-source simulation platform that provides a standardized implementation of safe RL algorithms for Reliable Dexterous Man…
☆29May 2, 2023Updated 3 years ago
ventr1c / memma
View on GitHub
The official repository of "MemMA: Coordinating the Memory Cycle through Multi-Agent Reasoning and In-Situ Self-Evolution".
☆19Mar 20, 2026Updated 4 months ago
2644521362 / SC-MLLM
View on GitHub
☆18May 28, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
fhamborg / NewsWCL50
View on GitHub
The first, open access evaluation dataset for methods to identify bias by word choice and labeling
☆26Oct 30, 2025Updated 8 months ago
initial-h / FlappyBird_DQN_with_target_network
View on GitHub
DQN with freezing target network in tensorflow on pygame FlappyBird
☆11Dec 19, 2018Updated 7 years ago
godmoves / DeeperStack
View on GitHub
This is an implementation of DeepStack for No Limit Texas Hold'em, extended from DeepStack-Leduc.
☆26Jun 16, 2019Updated 7 years ago
hairuoliu1 / ICLR-2025-Robotics
View on GitHub
A list of robotics related papers accepted by ICLR'25
☆25Aug 28, 2025Updated 10 months ago
Aladoro / Stabilizing-Off-Policy-RL
View on GitHub
☆18Aug 3, 2022Updated 3 years ago
RufaelDev / pcc-mp3dg
View on GitHub
test code pcc in 3dg
☆19Nov 20, 2017Updated 8 years ago
jon--lee / decision-pretrained-transformer
View on GitHub
Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…
☆79May 28, 2024Updated 2 years ago