170928 / -Review-Multi-Agent-Actor-Critic-for-Mixed-Cooperative-Competitive-EnvironmentView external linksLinks
[Review] Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environment
☆10Dec 22, 2018Updated 7 years ago
Alternatives and similar repositories for -Review-Multi-Agent-Actor-Critic-for-Mixed-Cooperative-Competitive-Environment
Users that are interested in -Review-Multi-Agent-Actor-Critic-for-Mixed-Cooperative-Competitive-Environment are comparing it to the libraries listed below
Sorting:
- An RPG Maker MZ plugin☆12Nov 2, 2023Updated 2 years ago
- Precision Knowledge Editing (PKE): A novel method to reduce toxicity in LLMs while preserving performance, with robust evaluations and ha…☆11Nov 26, 2024Updated last year
- A free and open-source GUI tool that simplifies combining multiple code files into one, with automatic labeling and support for various p…☆14Jan 3, 2025Updated last year
- Simple implementation of an AABB Tree (Axis Aligned Bounding Box Tree) to optimize 3d collision detection☆10Oct 22, 2024Updated last year
- Implementation of the Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning by Tianmin Shu, Caiming Xiong…☆11Jun 18, 2018Updated 7 years ago
- Pytorch implementation of NASA: NEURAL ARTICULATED SHAPE APPROXIMATION☆12May 4, 2021Updated 4 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- ☆16Mar 14, 2025Updated 11 months ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Code repository for the paper on "Predicting the Performance of Black-Box LLMs through Self-Queries".☆12Jan 9, 2025Updated last year
- Attend - to what matters.☆17Feb 22, 2025Updated 11 months ago
- The source code for the paper "Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation, ACM TOIS 2026".☆13Dec 20, 2025Updated last month
- ☆11Mar 5, 2024Updated last year
- SImple Tensorflow implementations of " Image-to-Image Translation with Conditional Adversarial Networks" (CVPR 2017)☆10Apr 11, 2018Updated 7 years ago
- A short guide and example on how to fine-tune OpenAI's gpt-3.5-turbo for better roleplay☆14Aug 26, 2023Updated 2 years ago
- Inverse kinematic solver (FABRIK) for a simple 3D chain☆12Apr 23, 2021Updated 4 years ago
- [CVPR 2025 - HuMoGen] "MDMP: Multi-modal Diffusion for supervised Motion Predictions with uncertainty"☆16Mar 12, 2025Updated 11 months ago
- 历年EMNLP论文和开源项目合集,包含EMNLP2025、EMNLP2024、EMNLP2023、EMNLP2022、EMNLP2021。☆19Jul 16, 2025Updated 7 months ago
- SpyGame: An interactive multi-agent framework to evaluate intelligence with large language models :D☆15Nov 9, 2023Updated 2 years ago
- self implementation of DPPO, Distributed Proximal Policy Optimization, by using tensorflow☆12Sep 1, 2017Updated 8 years ago
- Audiense Insights MCP Server is a server based on the Model Context Protocol (MCP) that allows Claude and other MCP-compatible clients to…☆17Jun 16, 2025Updated 8 months ago
- Local LLM Discord Bot☆18Jun 20, 2025Updated 7 months ago
- a character-ai like UI for LLM☆10Dec 3, 2024Updated last year
- ☆21Mar 25, 2025Updated 10 months ago
- Locust on k8s example for scalable load tests☆14Apr 16, 2022Updated 3 years ago
- research and implementations of recurrent neural networks and their applications☆14Aug 31, 2025Updated 5 months ago
- Predict links in a citation network☆12Mar 20, 2018Updated 7 years ago
- MABA stable PD controller☆17Oct 20, 2020Updated 5 years ago
- ☆20Dec 5, 2025Updated 2 months ago
- Scala implementation of Aho-Corasick algorithm☆15May 6, 2022Updated 3 years ago
- Sources for a Medium article☆11Dec 2, 2021Updated 4 years ago
- MCP Server for Trino☆18Apr 22, 2025Updated 9 months ago
- gym 框架下的多智能体追逃博弈强化学习平台☆17Jun 20, 2023Updated 2 years ago
- This project analyzes code and generates report using AI. It takes a folder of code, sends it to Claude API, and outputs result to a file…☆16Oct 22, 2023Updated 2 years ago
- multi-agent reinforcement learning for competitive environments using pytorch☆14Dec 31, 2019Updated 6 years ago
- KMM: Key Frame Mask Mamba for Extended Motion Generation☆19Sep 22, 2025Updated 4 months ago
- AI-powered fashion recommendation system leveraging LLMs, embeddings, and retrieval techniques to deliver personalized shopping experienc…☆30Jul 23, 2025Updated 6 months ago
- Tensorflow implementation of proximal policy optimization (PPO) algorithm☆13Feb 28, 2018Updated 7 years ago
- ☆16Oct 9, 2021Updated 4 years ago