thinkwee / AgentsMeetRLLinks
An Awesome List of Reinforcement Learning-based Large Language Agent Works. Collect directly from official code base.
☆205Updated this week
Alternatives and similar repositories for AgentsMeetRL
Users that are interested in AgentsMeetRL are comparing it to the libraries listed below
Sorting:
- ☆270Updated last month
- Awesome Agent Training☆179Updated this week
- This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the…☆120Updated this week
- ☆266Updated last month
- Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆197Updated last week
- ☆238Updated last month
- ☆147Updated 5 months ago
- ☆154Updated 2 months ago
- Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models☆110Updated last month
- A Comprehensive Survey on Long Context Language Modeling☆161Updated this week
- Open Source Implementation of Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evo…☆58Updated last week
- ☆69Updated last month
- Awesome-Large-Search-Models is a collection of papers and resources (Methods, Datasets and other resources) about awesome agentic search …☆109Updated 2 weeks ago
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆192Updated 2 months ago
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆60Updated last month
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆503Updated 2 months ago
- Latest Advances on Long Chain-of-Thought Reasoning☆415Updated this week
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆104Updated 4 months ago
- ☆136Updated last month
- A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.☆244Updated 2 months ago
- Awesome LLM pre-training resources, including data, frameworks, and methods.☆193Updated 2 months ago
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆152Updated 3 weeks ago
- ☆138Updated 2 months ago
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆145Updated 6 months ago
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆78Updated last month
- This is a survey of research on AI scientists, AI researchers, AI engineers, and a series of AI-driven research studies☆76Updated last month
- A research repo for experiments about Reinforcement Finetuning☆49Updated 3 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆121Updated 3 months ago
- ☆47Updated 4 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆55Updated 7 months ago