Liushiyu-0709 / BAPO-Reliable-SearchLinks
β21Updated last week
Alternatives and similar repositories for BAPO-Reliable-Search
Users that are interested in BAPO-Reliable-Search are comparing it to the libraries listed below
Sorting:
- β137Updated last month
- π A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyondβ340Updated last week
- π§Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learningβ312Updated 3 weeks ago
- β303Updated 6 months ago
- Latest Advances on Long Chain-of-Thought Reasoningβ601Updated 6 months ago
- β57Updated 7 months ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It containsβ¦β258Updated 5 months ago
- β421Updated 3 months ago
- A comprehensive collection of process reward models.β134Updated 3 months ago
- Official implementation of MATPO: Multi-Agent Tool-Integrated Policy Optimization.β70Updated 3 months ago
- A collection on the recent reproduction papers and projects on DeepSeek-R1β32Updated 11 months ago
- RAG methods, benchmarks, and toolkitsβ19Updated last year
- β182Updated last week
- Awesome-Parallel-Reasoning: Unlocking the reasoning potential of LLMs. Papers, Code, Resources & Survey.β45Updated 3 weeks ago
- ParamMute: Suppressing Knowledge-Critical FFNs for Faithful Retrieval-Augmented Generationβ56Updated 3 months ago
- β152Updated 8 months ago
- EMNLP MAIN 2025 StepSearch: Igniting LLMs Search Ability via Step-Wise Proximal Policy Optimizationβ55Updated 4 months ago
- β59Updated last year
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learningβ168Updated last year
- simpleR1: A Simple Framework for Training R1-like Modelsβ30Updated 5 months ago
- A collection of survey papers and resources related to Large Language Models (LLMs).β40Updated last year
- The official code of ARPO & AEPOβ872Updated 3 weeks ago
- β332Updated 8 months ago
- A Collection of Papers about Memory for Language Agentsβ289Updated last week
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"β151Updated last year
- π Awesome Agentic Search is a curated list of papers, tools, and resources on agentic searchβwhere AI agents plan, search, and reason toβ¦β52Updated 5 months ago
- Controllable Text Generation for Large Language Models: A Surveyβ199Updated last year
- A list of awesome papers on LLM tool learning.β28Updated last year
- A curated list of personalized alignment resources (continually updated).β56Updated 3 months ago
- Search Self-Play: Pushing the Frontier of Agent Capability without Supervisionβ84Updated 3 weeks ago