inclusionAI / AWorld-RLLinks
Agentic Learning Powered by AWorld
☆30Updated this week
Alternatives and similar repositories for AWorld-RL
Users that are interested in AWorld-RL are comparing it to the libraries listed below
Sorting:
- MCP server integrating GEPA (Genetic-Evolutionary Prompt Architecture) for automatic prompt optimization with Claude Desktop☆38Updated 2 months ago
- ☆95Updated 10 months ago
- ☆26Updated 4 months ago
- [ICLR 2025]ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning https://arxiv.org/abs/2501.06590☆72Updated 2 months ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆59Updated 3 months ago
- Code for Robust Fine-tuning (RbFT)☆15Updated 8 months ago
- DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking☆49Updated 7 months ago
- Code and data for QueryAgent(ACL 2024)☆21Updated 10 months ago
- ☆16Updated 3 months ago
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22Updated 5 months ago
- ☆79Updated last year
- This is the official code for the paper "ParallelSearch: Train your LLMs to Decompose Query and Search Sub-queries in Parallel with Reinf…☆26Updated last week
- Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning☆37Updated 4 months ago
- DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL☆185Updated 3 weeks ago
- ☆31Updated last year
- The code for paper: Hierarchical Document Refinement for Long-context Retrieval-augmented Generation [ACL2025 Oral]☆36Updated 2 months ago
- Recursive Abstractive Processing for Tree-Organized Retrieval☆10Updated last year
- The official implementation of "LevelRAG: Enhancing Retrieval-Augmented Generation with Multi-hop Logic Planning over Rewriting Augmented…☆43Updated 6 months ago
- Informative Conversational Query Rewriting☆33Updated last year
- ☆32Updated last year
- Data and Code for EMNLP 2025 Findings Paper "MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search"☆73Updated 3 months ago
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆63Updated last month
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆71Updated last year
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆87Updated 5 months ago
- ☆46Updated 4 months ago
- An interactive thinking and deep reasoning model. It provides a cognitive reasoning paradigm for complex multi-hop problems.☆67Updated 3 months ago
- Open replication of DeepSeek R1 for text-to-graph extraction.☆99Updated 8 months ago
- MCP DeepResearch Server: 基于 LangGraph + Ollama + Tavily 的深度研究服务器,支持异步运行、超时控制与进度推送☆31Updated 4 months ago
- ☆83Updated last year
- 超简单复现Deepseek-R1-Zero和Deepseek-R1,以「24点游戏」为例。通过zero-RL、SFT以及SFT+RL,以激发LLM的自主验证反思能力。 About Clean, minimal, accessible reproduction of Dee…☆30Updated 6 months ago