Implementation of the paper "Decentralized Counterfactual Value with Threat Detection for Multi-Agent Reinforcement Learning in Mixed Cooperative and Competitive Environments"
☆17Dec 7, 2024Updated last year
Alternatives and similar repositories for DCVTD
Users that are interested in DCVTD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of the paper "Egoism, Utilitarianism and Egalitarianism in Multi-Agent Reinforcement Learning"☆21Aug 17, 2024Updated last year
- Implementation of the paper "WToE: Learning When to Explore in Multi-Agent Reinforcement Learning"☆21Aug 17, 2024Updated last year
- Implementation of the paper "Multi-Agent Exploration via Self-Learning and Social Learning"☆20Dec 7, 2024Updated last year
- The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.☆12May 2, 2024Updated last year
- A Framework of Continual Learning☆132Dec 9, 2025Updated 4 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Source Code for "Adapters for Enhanced Modeling of Multilingual Knowledge and Text"☆12Oct 28, 2022Updated 3 years ago
- Official PyTorch code for "Sample Efficient Offline-to-Online Reinforcement Learning" in TKDE'23.☆16Aug 14, 2023Updated 2 years ago
- What Has Been Enhanced in my Knowledge-Enhanced Language Model?☆13Oct 26, 2022Updated 3 years ago
- Code Releasement for 'Generative Object Insertion in Gaussian Splatting with a Multi-View Diffusion Model'☆16Apr 26, 2025Updated last year
- Official github repo for SafeDialBench, a comprehensive multi-turn dialogue benchmark to evaluate LLMs' safety.☆50May 12, 2025Updated 11 months ago
- ☆20Oct 22, 2024Updated last year
- ☆38May 28, 2025Updated 11 months ago
- Code of LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents☆31Nov 24, 2025Updated 5 months ago
- Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models☆15Nov 4, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆15Oct 11, 2022Updated 3 years ago
- ☆18Jul 14, 2023Updated 2 years ago
- A collection of environments and reference agents for planning and reinforcement learning research in partially observable, multi-agent …☆33Jun 2, 2025Updated 10 months ago
- Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)☆38Sep 10, 2024Updated last year
- The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)☆25Jan 16, 2024Updated 2 years ago
- intrinsic motivation in grid worlds☆26May 3, 2020Updated 5 years ago
- 服务群众:给群众搭建一个南大开源镜像站的帮助文档网站。☆20Dec 29, 2021Updated 4 years ago
- Official PyTorch implementation of "ACE:Off-Policy Actor-Critic with Causality-Aware Entropy Regularization"☆35May 13, 2024Updated last year
- [ACL 25] SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities☆30Apr 2, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Sparrow: Data-Efficient Video-LLM with Text-to-Image Augmentation☆32Mar 28, 2025Updated last year
- PyTorch code for: Frustratingly Simple Domain Generalization via Image Stylization☆23Jun 25, 2020Updated 5 years ago
- RLlib超参数详解(中文)☆18Jan 24, 2022Updated 4 years ago
- ☆18Jan 6, 2025Updated last year
- Rethinking Data Perturbation and Model Stabilization for Semi-supervised Medical Image Segmentation☆14Aug 15, 2023Updated 2 years ago
- 天池工业AI大赛-智能制造质量预测,排名89/2539☆39Oct 11, 2018Updated 7 years ago
- [TPAMI 2023] LibFewShot: A Comprehensive Library for Few-shot Learning.☆1,065Oct 27, 2025Updated 6 months ago
- ☆35Oct 23, 2022Updated 3 years ago
- The official Pytorch implementation of paper Where is My Spot? Few-shot Image Generation via Latent Subspace Optimization, CVPR 2023.☆11Jan 6, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ACMMM 2023] BMMAL: Towards Balanced Active Learning for Multimodal Classification☆17Sep 25, 2023Updated 2 years ago
- Implementation codes and datasets used in ICLR'22 Spotlight paper AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning…☆41Apr 17, 2024Updated 2 years ago
- [ICCV 2023 Oral] IOMatch: Simplifying Open-Set Semi-Supervised Learning with Joint Inliers and Outliers Utilization☆56Jan 28, 2024Updated 2 years ago
- Meta learning for generative models.☆16Jul 24, 2019Updated 6 years ago
- Cross Domain Disentangled Deep Representation (CVPR'18)☆12May 15, 2019Updated 6 years ago
- Model Predictive Task Sampling☆87Feb 28, 2026Updated 2 months ago
- ☆26Mar 25, 2023Updated 3 years ago