A curated list of reinforcement learning (RL) for agents.
☆98Jun 6, 2026Updated last week
Alternatives and similar repositories for awesome-rl-for-agents
Users that are interested in awesome-rl-for-agents are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2024] CHASE: Learning Convex Hull Adaptive Shift for Skeleton-based Multi-Entity Action Recognition☆16Nov 12, 2025Updated 7 months ago
- [IROS 2023] Interactive Spatiotemporal Token Attention Network for Skeleton-based General Interactive Action Recognition☆21Jul 12, 2025Updated 11 months ago
- Implementation of the paper: VG4D: Vision-Language Model Goes 4D Video Recognition(ICRA 2024)☆15Apr 23, 2024Updated 2 years ago
- ☆48Mar 15, 2025Updated last year
- Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.☆11Apr 5, 2023Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [EMNLP 2024 Findings] Wrong-of-Thought: An Integrated Reasoning Framework with Multi-Perspective Verification and Wrong Information☆13Oct 1, 2024Updated last year
- R1V, trained with AI feedback, answers open-ended visual questions.☆14Apr 12, 2025Updated last year
- [TPAMI 2026] Implementation of the paper “Heatmap Pooling for Action Recognition from RGB Videos”.☆67Feb 20, 2026Updated 3 months ago
- Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"☆15Aug 30, 2021Updated 4 years ago
- Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".☆17Sep 15, 2024Updated last year
- ☆11Jul 31, 2020Updated 5 years ago
- [CVPR2024] Learning from Synthetic Human Group Activities☆14Feb 24, 2025Updated last year
- Repository about single/multi-agent, robotics, llm/vlm/vla, scientific discovery, etc.☆20Jul 10, 2025Updated 11 months ago
- [ICCV 25] Official repository of "Collaborative Instance Object Navigation: Leveraging Uncertainty-Awareness to Minimize Human-Agent Dial…☆30Apr 1, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ICCV2023] Chaotic World: A Large and Challenging Benchmark for Human Behavior Understanding in Chaotic Events☆10Dec 7, 2024Updated last year
- [TRIT 2024] Implementation of the paper “Explore Human Parsing Modality for Action Recognition”.☆39Aug 26, 2024Updated last year
- ☆30Mar 11, 2025Updated last year
- [ACL 2026] OPT-BENCH: Evaluating the Iterative Self-Optimization of LLM Agents in Large-Scale Search Spaces☆125May 12, 2026Updated last month
- A project about deploying a yolo server to support inferring image sent by different clients.☆10Mar 23, 2024Updated 2 years ago
- ☆513Oct 11, 2025Updated 8 months ago
- PyTorch code of “Out-of-Sample Representation Learning for Multi-Relational Graphs” (EMNLP 2020)☆10Oct 2, 2020Updated 5 years ago
- Another Wheel to parse json☆11Mar 13, 2020Updated 6 years ago
- ☆12Oct 29, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆11Dec 6, 2020Updated 5 years ago
- A collection of papers and libraries for performing multi-agent optimization☆19Jun 6, 2026Updated last week
- Code for ICLR 2025 Paper "GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment"☆24Feb 10, 2025Updated last year
- ☆55Apr 7, 2026Updated 2 months ago
- Code for the paper "Pretraining task diversity and the emergence of non-Bayesian in-context learning for regression"☆27Jun 28, 2023Updated 2 years ago
- Materials for paper "Are Large Language Models Temporally Grounded?"☆14Nov 16, 2023Updated 2 years ago
- ICML 2024 - Self-Driven Entropy Aggregation for Byzantine-Robust Heterogeneous Federated Learning☆10Jul 16, 2024Updated last year
- [ICLR'25] Official repository for "AVHBench: A Cross-Modal Hallucination Evaluation for Audio-Visual Large Language Models"☆25Mar 8, 2026Updated 3 months ago
- 北语 246 实验室新生简明指南☆10May 30, 2022Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Forecastbench Datasets, updated nightly☆30Updated this week
- 空间推理验证码生成器☆20Jun 26, 2019Updated 6 years ago
- Pytorch I3D implmentation on Toyota Smarthome Dataset☆18Apr 23, 2022Updated 4 years ago
- A method which takes advantage of causal features for classification☆24Nov 17, 2018Updated 7 years ago
- This is the project page for paper `CAM Back Again: Large Kernel CNNs from a Weakly Supervised Object Localization Perspective`, in CVPR2…☆13Mar 19, 2024Updated 2 years ago
- Official code for the paper: DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models☆24Jan 6, 2026Updated 5 months ago
- Training VLM agents with multi-turn reinforcement learning☆472May 11, 2026Updated last month