A curated list of reinforcement learning (RL) for agents.
☆90Mar 30, 2026Updated 2 weeks ago
Alternatives and similar repositories for awesome-rl-for-agents
Users that are interested in awesome-rl-for-agents are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding☆19Dec 5, 2025Updated 4 months ago
- [NeurIPS 2024] CHASE: Learning Convex Hull Adaptive Shift for Skeleton-based Multi-Entity Action Recognition☆16Nov 12, 2025Updated 5 months ago
- [MMAsia 2023] Official PyTorch implementation of the paper " Cross-Modal Retrieval for Motion and Text via DropTriple Loss "☆37Nov 30, 2024Updated last year
- [IEEE TIP 2024] Facial Prior Guided Micro-Expression Generation☆13Nov 8, 2024Updated last year
- [CVPR2024] Official implementation of the paper: Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning☆40Aug 15, 2025Updated 8 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.☆11Apr 5, 2023Updated 3 years ago
- CVPR 2025☆25May 9, 2025Updated 11 months ago
- [EMNLP 2024 Findings] Wrong-of-Thought: An Integrated Reasoning Framework with Multi-Perspective Verification and Wrong Information☆13Oct 1, 2024Updated last year
- Github repository for "Internalizing World Models via Self-Play Finetuning for Agentic RL"☆34Nov 1, 2025Updated 5 months ago
- [TPAMI 2026] Implementation of the paper “Heatmap Pooling for Action Recognition from RGB Videos”.☆65Feb 20, 2026Updated last month
- ☆17Feb 24, 2025Updated last year
- Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".☆16Sep 15, 2024Updated last year
- ☆11Jul 31, 2020Updated 5 years ago
- [ICLR 2026] SR-Scientist: Scientific Equation Discovery With Agentic AI☆38Jan 27, 2026Updated 2 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Repository about single/multi-agent, robotics, llm/vlm/vla, scientific discovery, etc.☆19Jul 10, 2025Updated 9 months ago
- MICCAI 2024: Tri-Plane Mamba: Efficiently Adapting Segment Anything Model for 3D Medical Images☆27Apr 3, 2025Updated last year
- Code for safety test in "Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt Templates"☆22Sep 21, 2025Updated 6 months ago
- ☆30Mar 11, 2025Updated last year
- [TRIT 2024] Implementation of the paper “Explore Human Parsing Modality for Action Recognition”.☆38Aug 26, 2024Updated last year
- OPT-BENCH: Evaluating LLM Agent on Large-Scale Search Spaces Optimization Problems☆122Jul 13, 2025Updated 9 months ago
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated last year
- ☆10Apr 26, 2023Updated 2 years ago
- An inequality benchmark for theorem proving☆22Feb 1, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- DCPO: Dynamic Adaptive Clipping for RL☆48Apr 1, 2026Updated 2 weeks ago
- A project about deploying a yolo server to support inferring image sent by different clients.☆10Mar 23, 2024Updated 2 years ago
- ☆506Oct 11, 2025Updated 6 months ago
- [ICLR'25] Official repository for "AVHBench: A Cross-Modal Hallucination Evaluation for Audio-Visual Large Language Models"☆21Mar 8, 2026Updated last month
- PyTorch code of “Out-of-Sample Representation Learning for Multi-Relational Graphs” (EMNLP 2020)☆10Oct 2, 2020Updated 5 years ago
- ☆12Oct 29, 2022Updated 3 years ago
- ☆11Dec 6, 2020Updated 5 years ago
- A collection of papers and libraries for performing multi-agent optimization☆18Feb 7, 2026Updated 2 months ago
- Materials for paper "Are Large Language Models Temporally Grounded?"☆13Nov 16, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [CICAI 2023] Implementation of the paper “Integrating Human Parsing and Pose Network for Human Action Recognition”.☆11Sep 24, 2024Updated last year
- Multiview variant of Pointpillars. Contains Pytorch reimplementation of Pillar-od.☆14Jan 15, 2021Updated 5 years ago
- ICML 2024 - Self-Driven Entropy Aggregation for Byzantine-Robust Heterogeneous Federated Learning☆10Jul 16, 2024Updated last year
- The official implementation of “Dual Focus-Attention Transformer for Robust Point Cloud Registration”(CVPR2025)☆21Mar 31, 2026Updated 2 weeks ago
- ☆38Oct 25, 2025Updated 5 months ago
- Training VLM agents with multi-turn reinforcement learning☆444Apr 11, 2026Updated last week
- ☆13Dec 9, 2022Updated 3 years ago