tongjingqi / Awesome-Agent-RLView external linksLinks
A curated list of awesome resources about reward construction for AI agents. This repository covers cutting-edge research, and practical guides on defining and collecting rewards to build more intelligent and aligned AI agents.
☆55Sep 1, 2025Updated 5 months ago
Alternatives and similar repositories for Awesome-Agent-RL
Users that are interested in Awesome-Agent-RL are comparing it to the libraries listed below
Sorting:
- UnifiedToolHub is a comprehensive project supporting LLM-based tool use, designed to unify various tool-use dataset formats and provide t…☆19Jul 23, 2025Updated 6 months ago
- We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThinkBench shows that S…☆237Jan 31, 2026Updated 2 weeks ago
- VehicleWorld is the first comprehensive multi-device environment for intelligent vehicle interaction that accurately models the complex, …☆21Sep 16, 2025Updated 5 months ago
- ☆38Oct 2, 2024Updated last year
- Lecture Notes for Scientific Machine Learning 2025☆31Oct 25, 2025Updated 3 months ago
- Open-source Traditional Chinese Medical Large Language Models. (开源中文医疗大模型合集)☆47Oct 5, 2025Updated 4 months ago
- ☆15Nov 11, 2025Updated 3 months ago
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning☆70Jul 13, 2025Updated 7 months ago
- Game-RL: Synthesizing Multimodal Verifiable Game Data to Boost VLMs' General Reasoning☆134Jan 31, 2026Updated 2 weeks ago
- Predicting Activated Sludge Microbial Communities based on time series of continuous sludge samples by using graph neural networks☆17Feb 2, 2026Updated 2 weeks ago
- open source code for NeurIPS 2024 paper☆12Nov 9, 2025Updated 3 months ago
- GreenLambert macOS IDA plugin to deobfuscate strings☆14Oct 4, 2021Updated 4 years ago
- ☆14Feb 5, 2025Updated last year
- TOD-Flow: Modeling the Structure of Task-Oriented Dialogues☆13Feb 7, 2024Updated 2 years ago
- Document intricacies of using WinDBG to aid Rust project development☆15Nov 19, 2024Updated last year
- ☆12Jun 21, 2025Updated 7 months ago
- Content Moderation using Reality.Eth with Kleros arbitration☆12Feb 19, 2025Updated 11 months ago
- Code for ASGEA: Exploiting Logic Rules from Align-Subgraphs for Entity Alignment☆11Feb 28, 2024Updated last year
- MOSS-Speech is a true speech-to-speech large language model without text guidance.☆122Dec 4, 2025Updated 2 months ago
- https://avocado-captioner.github.io/☆28Oct 16, 2025Updated 4 months ago
- A supervised fine-tuning method for controllable reasoning length in large language models (一种通过有监督微调实现大语言模型思考长度可控的方法)☆10May 8, 2025Updated 9 months ago
- Information Extraction related tools and models☆10Mar 16, 2023Updated 2 years ago
- Sound Separation, Omni modal☆28Sep 15, 2025Updated 5 months ago
- R1V, trained with AI feedback, answers open-ended visual questions.☆14Apr 12, 2025Updated 10 months ago
- MetaFX – library for feature extraction from whole-genome metagenome sequencing data☆16Feb 17, 2025Updated last year
- Public code release for the paper "Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training"☆11Oct 27, 2025Updated 3 months ago
- FamilyTool benchmark☆12Sep 10, 2025Updated 5 months ago
- a Video Quality Analysis Toolkit☆13May 16, 2025Updated 9 months ago
- Antimicrobial Peptide Structural Evolution Miner (AMP-SEMiner), an integrated AI framework designed for the simultaneous identification o…☆13May 10, 2025Updated 9 months ago
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated last year
- A python tool help to interact with chatgpt.☆10Dec 11, 2022Updated 3 years ago
- Backend services for an AI-powered, privacy-first team collaboration platform. Manages secure data, AI processing, and real-time communic…☆18Oct 16, 2025Updated 4 months ago
- Github repository for "Internalizing World Models via Self-Play Finetuning for Agentic RL"☆33Nov 1, 2025Updated 3 months ago
- Scripts for KGIRNet model for ESWC☆10Jul 6, 2023Updated 2 years ago
- ☆12Jan 25, 2024Updated 2 years ago
- This is a repository containing code for a hybrid quantum-classical transformer model from the paper: A Hybrid Transformer Architecture w…☆20Mar 6, 2025Updated 11 months ago
- Towards a Mechanistic Understanding of Large Reasoning Models: A Survey of Training, Inference, and Failures☆30Jan 29, 2026Updated 2 weeks ago
- Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation☆28Dec 10, 2025Updated 2 months ago
- ☆12Jan 2, 2024Updated 2 years ago