Shaokang-Agent / DCVTD
Implementation of the paper "Decentralized Counterfactual Value with Threat Detection for Multi-Agent Reinforcement Learning in Mixed Cooperative and Competitive Environments"
☆17Updated 5 months ago
Alternatives and similar repositories for DCVTD
Users that are interested in DCVTD are comparing it to the libraries listed below
Sorting:
- Implementation of the paper "WToE: Learning When to Explore in Multi-Agent Reinforcement Learning"☆20Updated 9 months ago
- Implementation of the paper "Multi-Agent Exploration via Self-Learning and Social Learning"☆20Updated 5 months ago
- Implementation of the paper "Egoism, Utilitarianism and Egalitarianism in Multi-Agent Reinforcement Learning"☆20Updated 9 months ago
- The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.☆12Updated last year
- 📚 List of Top-tier Conference Papers on Reinforcement Learning (RL),including: NeurIPS, AAAI, IJCAI, ICML, AAMAS, ICLR, ICRA, etc. | (AI…☆12Updated last year
- Official github repo for SafeDialBench, a comprehensive multi-turn dialogue benchmark to evaluate LLMs' safety.☆31Updated this week
- A Framework of Continual Learning☆104Updated 2 weeks ago
- ☆95Updated last year
- Model Predictive Task Sampling☆20Updated 2 months ago
- Continual reinforcement learning baselines: experiment specifications, implementation of existing methods, and common metrics. Easily ext…☆118Updated last year
- Official PyTorch code for "Sample Efficient Offline-to-Online Reinforcement Learning" in TKDE'23.☆14Updated last year
- Source code of the ICML24 paper "Self-Composing Policies for Scalable Continual Reinforcement Learning" (selected for oral presentation)☆20Updated 9 months ago
- A beamer template for LAMDA lab at NJU☆14Updated 4 years ago
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆51Updated last month
- The official implementation of the CVPR'2024 work Interference-Free Low-Rank Adaptation for Continual Learning☆77Updated 2 months ago
- MineStudio: A Streamlined Package for Minecraft AI Agent Development☆238Updated this week
- [ICLR 2024 Spotlight] Code for the paper "Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making"☆12Updated last year
- Official code for the paper: Continual Task Allocation in Meta-Policy Network via Sparse Prompting☆17Updated 3 months ago
- ☆14Updated 2 years ago
- Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models☆14Updated last year
- A PyTorch implementation of Implicit Q-Learning☆81Updated 3 years ago
- The code repository for "OmniEvalKit: A Modular, Lightweight Toolbox for Evaluating Large Language Model and its Omni-Extensions"☆14Updated 2 months ago
- A list of papers regarding generalization in (deep) reinforcement learning☆152Updated last year
- PyTorch implementation of various distillation approaches for continual learning of Diffusion Models.☆22Updated 2 months ago
- Official Implementation of CL-ALFRED (ICLR'24)☆22Updated 6 months ago
- A trustworthy benchmark for IAIR Reinforcement Learning homework☆9Updated 2 years ago
- ☆14Updated last year
- Code for GO4Align: Group Optimization for Multi-Task Alignment☆19Updated 7 months ago
- ☆16Updated last year
- Text world based on Minecraft rules.☆15Updated last year