A curated list of reinforcement learning (RL) for agents.
☆94Mar 30, 2026Updated last month
Alternatives and similar repositories for awesome-rl-for-agents
Users that are interested in awesome-rl-for-agents are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding☆21Dec 5, 2025Updated 5 months ago
- [MMAsia 2023] Official PyTorch implementation of the paper " Cross-Modal Retrieval for Motion and Text via DropTriple Loss "☆37Nov 30, 2024Updated last year
- ☆47Mar 15, 2025Updated last year
- CVPR 2025☆25May 9, 2025Updated last year
- SearchGPT: Building a quick conversation-based search engine with LLMs.☆46Jan 5, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Github repository for "Internalizing World Models via Self-Play Finetuning for Agentic RL"☆35Nov 1, 2025Updated 6 months ago
- Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".☆17Sep 15, 2024Updated last year
- [CVPR2024] Learning from Synthetic Human Group Activities☆14Feb 24, 2025Updated last year
- The official repository of [CVPR2025] DSPNet: Dual-vision Scene Perception for Robust 3D Question Answering☆27Apr 18, 2025Updated last year
- [ICLR 2026] SR-Scientist: Scientific Equation Discovery With Agentic AI☆43Jan 27, 2026Updated 3 months ago
- Repository about single/multi-agent, robotics, llm/vlm/vla, scientific discovery, etc.☆19Jul 10, 2025Updated 9 months ago
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated last year
- An inequality benchmark for theorem proving☆22Feb 1, 2026Updated 3 months ago
- ☆510Oct 11, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for ICLR 2025 Paper "GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment"☆22Feb 10, 2025Updated last year
- Another Wheel to parse json☆11Mar 13, 2020Updated 6 years ago
- ☆12Oct 29, 2022Updated 3 years ago
- [Siggraph Asia 2025] Official code release of our paper "Shape-for-Motion: Precise and Consistent Video Editing with 3D Proxy"☆60Sep 26, 2025Updated 7 months ago
- 用触动精灵lua脚本刷各种广告,由后台下发个广告SDK任务,支持23种分辨率,支持水军,留存上报,留存可后台设置百分比,10多个平台广告同时刷,实时上报刷量数据 本脚本建议配合盘古后台系统,fakeapk hook 工具使用。☆13Mar 30, 2018Updated 8 years ago
- A collection of papers and libraries for performing multi-agent optimization☆18Feb 7, 2026Updated 3 months ago
- ☆13Jan 7, 2023Updated 3 years ago
- ☆16Oct 6, 2024Updated last year
- [ICLR'25] Official repository for "AVHBench: A Cross-Modal Hallucination Evaluation for Audio-Visual Large Language Models"☆23Mar 8, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 北语 246 实验室新生简明指南☆10May 30, 2022Updated 3 years ago
- Forecastbench Datasets, updated nightly☆28Updated this week
- Training VLM agents with multi-turn reinforcement learning☆454Apr 17, 2026Updated 3 weeks ago
- This is the project page for paper `CAM Back Again: Large Kernel CNNs from a Weakly Supervised Object Localization Perspective`, in CVPR2…☆13Mar 19, 2024Updated 2 years ago
- Temporal Graph Rewiring Method with Expander Graphs☆12Oct 18, 2024Updated last year
- Empower your React apps with robust image/document annotation capabilities! 🚀 Supports bounding boxes, polygons, points, zooming, draggi…☆10Feb 12, 2025Updated last year
- Code for "Semantic Perturbations with Normalizing Flows for Improved Generalization"☆11Jul 13, 2021Updated 4 years ago
- This repo consists all my RL work and learnings☆12Dec 5, 2021Updated 4 years ago
- This repository will contain python code that automates the georeferencing of any image that has a latitude and longitude associated with…☆11May 1, 2021Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code for paper: Weakly Supervised Co-training with Swapping Assignments for Semantic Segmentation☆12Aug 3, 2024Updated last year
- Search Self-Play: Pushing the Frontier of Agent Capability without Supervision☆99Mar 4, 2026Updated 2 months ago
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated 2 years ago
- Official repo for "ProSec: Fortifying Code LLMs with Proactive Security Alignment"☆17Feb 26, 2026Updated 2 months ago
- D3 Force Layout demo written in React and TypeScript☆11Dec 8, 2017Updated 8 years ago
- The official implementation of “Dual Focus-Attention Transformer for Robust Point Cloud Registration”(CVPR2025)☆25Apr 29, 2026Updated last week
- C++自然语言处理库☆14Jan 22, 2020Updated 6 years ago