Shaokang-Agent / D-FView external linksLinks
Implementation of the paper "Egoism, Utilitarianism and Egalitarianism in Multi-Agent Reinforcement Learning"
☆21Aug 17, 2024Updated last year
Alternatives and similar repositories for D-F
Users that are interested in D-F are comparing it to the libraries listed below
Sorting:
- Implementation of the paper "Decentralized Counterfactual Value with Threat Detection for Multi-Agent Reinforcement Learning in Mixed Coo…☆17Dec 7, 2024Updated last year
- Implementation of the paper "WToE: Learning When to Explore in Multi-Agent Reinforcement Learning"☆21Aug 17, 2024Updated last year
- Implementation of the paper "Multi-Agent Exploration via Self-Learning and Social Learning"☆20Dec 7, 2024Updated last year
- The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.☆12May 2, 2024Updated last year
- 📚 List of Top-tier Conference Papers on Reinforcement Learning (RL),including: NeurIPS, AAAI, IJCAI, ICML, AAMAS, ICLR, ICRA, etc. | (AI…☆11Aug 20, 2023Updated 2 years ago
- Source Code for "Adapters for Enhanced Modeling of Multilingual Knowledge and Text"☆12Oct 28, 2022Updated 3 years ago
- Official PyTorch code for "Sample Efficient Offline-to-Online Reinforcement Learning" in TKDE'23.☆16Aug 14, 2023Updated 2 years ago
- A Framework of Continual Learning☆130Dec 9, 2025Updated 2 months ago
- SocialJax: sequential social dilemma environments☆67Nov 25, 2025Updated 2 months ago
- Official github repo for SafeDialBench, a comprehensive multi-turn dialogue benchmark to evaluate LLMs' safety.☆42May 12, 2025Updated 9 months ago
- Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models☆15Nov 4, 2023Updated 2 years ago
- solver for discrete Mixed Observable Markov Decision Processes☆11Oct 30, 2020Updated 5 years ago
- Implementation and evaluation of Almanac (Automaton/Logic Multi-Agent Natural Actor-Critic), an algorithm for multi-agent reinforcement l…☆10May 5, 2022Updated 3 years ago
- POPGym Library in JAX☆12Apr 15, 2024Updated last year
- [NeurIPS 2023] Official pytorch implementation of "Domain Re-Modulation for Few-Shot Generative Domain Adaption"☆13Aug 2, 2024Updated last year
- ☆10Sep 9, 2022Updated 3 years ago
- ☆12Mar 12, 2024Updated last year
- ☆19Oct 22, 2024Updated last year
- StyleSwin: Transformer-based GAN for High-resolution Image Generation☆11Dec 21, 2021Updated 4 years ago
- The official Pytorch implementation of paper Where is My Spot? Few-shot Image Generation via Latent Subspace Optimization, CVPR 2023.☆11Jan 6, 2024Updated 2 years ago
- 经典坦克大战游戏(SDL2 + C++开发)☆13Apr 9, 2017Updated 8 years ago
- This is the python implementation of the NEDI (New Edge-Directed Interpolation)☆15Sep 29, 2020Updated 5 years ago
- This repository contains a collection of the most influential papers, and benchmarks related to Large Language Models (LLMs) based Agent …☆46Jul 7, 2025Updated 7 months ago
- ☆15Jul 14, 2023Updated 2 years ago
- Code of LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents☆28Nov 24, 2025Updated 2 months ago
- Cross Domain Disentangled Deep Representation (CVPR'18)☆12May 15, 2019Updated 6 years ago
- Meta learning for generative models.☆16Jul 24, 2019Updated 6 years ago
- Tensorflow code for paper: Deformable Generator Network: Unsupervised Disentanglement of Appearance and Geometry☆18Nov 3, 2018Updated 7 years ago
- Matlab implementation of Echo State Network (reservoir computing)☆26Aug 3, 2017Updated 8 years ago
- Original PyTorch implementation of PMIC from PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Colla…☆21Mar 26, 2024Updated last year
- ☆26Mar 25, 2023Updated 2 years ago
- A collection of environments and reference agents for planning and reinforcement learning research in partially observable, multi-agent …☆29Jun 2, 2025Updated 8 months ago
- intrinsic motivation in grid worlds☆26May 3, 2020Updated 5 years ago
- Data and code for the paper "NormBank: A Knowledge Bank of Situational Social Norms"☆32Jul 18, 2023Updated 2 years ago
- [Siggraph Asia 2025] Official code release of our paper "Shape-for-Motion: Precise and Consistent Video Editing with 3D Proxy"☆57Sep 26, 2025Updated 4 months ago
- The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)☆25Jan 16, 2024Updated 2 years ago
- Official Code for: "DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency"☆41Dec 26, 2025Updated last month
- Panda Guard is designed for researching jailbreak attacks, defenses, and evaluation algorithms for large language models (LLMs).☆61Jan 19, 2026Updated 3 weeks ago
- This is the official implementation of Multi-Agent PPO.☆133Jan 17, 2023Updated 3 years ago