tiannuo-yang / SearchAgent-XLinks
A High-Efficiency System of Large Language Model Based Search Agents
☆41Updated last week
Alternatives and similar repositories for SearchAgent-X
Users that are interested in SearchAgent-X are comparing it to the libraries listed below
Sorting:
- ☆47Updated 5 months ago
- ☆47Updated 3 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated last year
- ☆79Updated 6 months ago
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆54Updated 3 months ago
- ☆59Updated last week
- ☆83Updated 2 weeks ago
- ☆49Updated 3 weeks ago
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆54Updated 2 weeks ago
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆75Updated 2 weeks ago
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆90Updated 2 months ago
- On Memorization of Large Language Models in Logical Reasoning☆65Updated 2 months ago
- MARFT stands for Multi-Agent Reinforcement Fine-Tuning. This repository implements an LLM-based multi-agent reinforcement fine-tuning fra…☆35Updated 2 weeks ago
- ☆68Updated 8 months ago
- ☆94Updated 6 months ago
- The official implementation of "LevelRAG: Enhancing Retrieval-Augmented Generation with Multi-hop Logic Planning over Rewriting Augmented…☆32Updated last month
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆94Updated 3 months ago
- ☆25Updated 3 months ago
- ☆102Updated 5 months ago
- ☆94Updated 5 months ago
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆55Updated 8 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆98Updated last month
- Simple extension on vLLM to help you speed up reasoning model without training.☆152Updated this week
- ☆104Updated last month
- Repo for "Z1: Efficient Test-time Scaling with Code"☆59Updated last month
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆57Updated 3 weeks ago
- ☆18Updated last month
- PGRAG☆48Updated 10 months ago
- ☆45Updated 3 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆111Updated 2 months ago