Shaokang-Agent / Awesome-Reinforcement-Learning-PapersView external linksLinks
📚 List of Top-tier Conference Papers on Reinforcement Learning (RL),including: NeurIPS, AAAI, IJCAI, ICML, AAMAS, ICLR, ICRA, etc. | (AI顶会已录用的强化学习方向文章,持续更新)
☆11Aug 20, 2023Updated 2 years ago
Alternatives and similar repositories for Awesome-Reinforcement-Learning-Papers
Users that are interested in Awesome-Reinforcement-Learning-Papers are comparing it to the libraries listed below
Sorting:
- The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.☆12May 2, 2024Updated last year
- Implementation of the paper "WToE: Learning When to Explore in Multi-Agent Reinforcement Learning"☆21Aug 17, 2024Updated last year
- Implementation of the paper "Egoism, Utilitarianism and Egalitarianism in Multi-Agent Reinforcement Learning"☆21Aug 17, 2024Updated last year
- Implementation of the paper "Multi-Agent Exploration via Self-Learning and Social Learning"☆20Dec 7, 2024Updated last year
- Implementation of the paper "Decentralized Counterfactual Value with Threat Detection for Multi-Agent Reinforcement Learning in Mixed Coo…☆17Dec 7, 2024Updated last year
- Source Code for "Adapters for Enhanced Modeling of Multilingual Knowledge and Text"☆12Oct 28, 2022Updated 3 years ago
- Official PyTorch code for "Sample Efficient Offline-to-Online Reinforcement Learning" in TKDE'23.☆16Aug 14, 2023Updated 2 years ago
- Bird’s Eye: Probing for Linguistic Graph Structureswith a Simple Information-Theoretic Approach☆11Aug 1, 2021Updated 4 years ago
- What Has Been Enhanced in my Knowledge-Enhanced Language Model?☆13Oct 26, 2022Updated 3 years ago
- ☆15Oct 11, 2022Updated 3 years ago
- Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models☆15Nov 4, 2023Updated 2 years ago
- A Framework of Continual Learning☆130Dec 9, 2025Updated 2 months ago
- Code Releasement for 'Generative Object Insertion in Gaussian Splatting with a Multi-View Diffusion Model'☆15Apr 26, 2025Updated 9 months ago
- A Representation Learning Framework for Property Graphs☆11Dec 21, 2019Updated 6 years ago
- ☆37May 28, 2025Updated 8 months ago
- ☆13Feb 8, 2022Updated 4 years ago
- 经典坦克大战游戏(SDL2 + C++开发)☆13Apr 9, 2017Updated 8 years ago
- [ACL 2025] A curated list of papers and resources based on "PlanGenLLMs: A Modern Survey of LLM Planning Capabilities"☆30Jun 10, 2025Updated 8 months ago
- ☆12Feb 23, 2023Updated 2 years ago
- 这是中国人民大学高瓴人工智能学院本科课程《强化学习》的期末项目安排,项目内容是训练一个适用于国标麻将的强化学习智能体。☆27Aug 25, 2024Updated last year
- This is the python implementation of the NEDI (New Edge-Directed Interpolation)☆15Sep 29, 2020Updated 5 years ago
- This repository contains a collection of the most influential papers, and benchmarks related to Large Language Models (LLMs) based Agent …☆46Jul 7, 2025Updated 7 months ago
- Code of LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents☆28Nov 24, 2025Updated 2 months ago
- How to use Bootstrap with Flask - Free Sample | AppSeed☆17Mar 12, 2024Updated last year
- Matlab implementation of Echo State Network (reservoir computing)☆26Aug 3, 2017Updated 8 years ago
- Python Fan calculator for Chinese Standard Mahjong☆28Jan 26, 2025Updated last year
- A collection of environments and reference agents for planning and reinforcement learning research in partially observable, multi-agent …☆29Jun 2, 2025Updated 8 months ago
- Sparrow: Data-Efficient Video-LLM with Text-to-Image Augmentation☆31Mar 28, 2025Updated 10 months ago
- ☆26Apr 21, 2023Updated 2 years ago
- This is the official implementation of Multi-Agent PPO.☆133Jan 17, 2023Updated 3 years ago
- Official github repo for SafeDialBench, a comprehensive multi-turn dialogue benchmark to evaluate LLMs' safety.☆42May 12, 2025Updated 9 months ago
- iNEDI (improved New Edge-Directed Interpolation) Image Zooming Algorithm☆30May 13, 2025Updated 9 months ago
- SCUT Robotlab Middlewares Layer Library☆31May 6, 2021Updated 4 years ago
- ✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models☆43Apr 10, 2025Updated 10 months ago
- A Doudizhu reinforcement learning AI☆45May 20, 2025Updated 8 months ago
- 📖 Full Stack Practice of the Large Language Model Training @ RLChina 2024☆41Oct 15, 2024Updated last year
- 一个专门为大模型设计的财经信息MCP(Model Context Protocol)服务,通过高效的爬虫技术从各大财经网站(同花顺、东方财富等)获取实时资讯,为AI模型提供准确、及时的财经数据支持。☆94Dec 15, 2025Updated 2 months ago
- Analyzing code using GPT .(通过GPT分析代码,增加注释,生成文档)☆40Jan 6, 2024Updated 2 years ago
- ☆34Jul 18, 2019Updated 6 years ago