ventr1c / Awesome-RL-based-Agentic-Search-PapersView external linksLinks
The official repository of "A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaluations, and Applications".
☆144Jan 31, 2026Updated 2 weeks ago
Alternatives and similar repositories for Awesome-RL-based-Agentic-Search-Papers
Users that are interested in Awesome-RL-based-Agentic-Search-Papers are comparing it to the libraries listed below
Sorting:
- ☆18Jun 10, 2025Updated 8 months ago
- UR2: Unify RAG and Reasoning through Reinforcement Learning☆127Nov 19, 2025Updated 2 months ago
- Dynamic human image animation with strong identity preservation, heterogeneous character driving, and controllable backgrounds.☆140May 23, 2025Updated 8 months ago
- The code for paper: Hierarchical Document Refinement for Long-context Retrieval-augmented Generation [ACL2025 Oral]☆42Aug 25, 2025Updated 5 months ago
- ☆16Sep 17, 2024Updated last year
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆23Sep 21, 2025Updated 4 months ago
- ☆23Jul 29, 2025Updated 6 months ago
- An adaptive sampling framework for Reinforce-style LLM post training.☆89Nov 29, 2025Updated 2 months ago
- ☆28May 24, 2025Updated 8 months ago
- INFTY Engine: An Optimization Toolkit to Support Continual AI☆567Sep 13, 2025Updated 5 months ago
- Code, notebooks, and other material for FuturePath AI's training course on Generative AI☆12Apr 24, 2025Updated 9 months ago
- MegaRAG: Multimodal Graph-based RAG☆33Sep 16, 2025Updated 5 months ago
- Beyond log-likelihood: exploring alternative objectives for supervised fine-tuning of language model post-training☆55Oct 4, 2025Updated 4 months ago
- ☆12Jan 10, 2025Updated last year
- ☆36Feb 21, 2025Updated 11 months ago
- [NeurIPS'25] KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems☆127Nov 3, 2025Updated 3 months ago
- ☆154Jan 2, 2024Updated 2 years ago
- This repository contains the code for the paper “Neuro-Symbolic Query Compiler”, accepted to the Findings of ACL 2025.☆16Oct 20, 2025Updated 3 months ago
- [AAAI'25] CharacterBench: Benchmarking Character Customization of Large Language Models☆19Aug 1, 2025Updated 6 months ago
- CPG-SPMT: Control-oriented Parameter-Grouped Single Particle Model with Thermal effects☆39Jan 19, 2026Updated 3 weeks ago
- (ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning☆19Nov 22, 2025Updated 2 months ago
- CFT-RAG: An Entity Tree Based Retrieval Augmented Generation Algorithm With Cuckoo Filter☆22May 28, 2025Updated 8 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆29Oct 9, 2025Updated 4 months ago
- ☆25Jun 18, 2025Updated 7 months ago
- ☆67Aug 14, 2025Updated 6 months ago
- HierSearch: A Hierarchical Enterprise Deep Search Framework Integrating Local and Web Searches☆37Oct 9, 2025Updated 4 months ago
- The official implementation of "Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding"☆23Jun 26, 2025Updated 7 months ago
- The code for "MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking"☆19Jan 25, 2025Updated last year
- A curated collection of resources, tools, and frameworks for developing GUI Agents.☆303Updated this week
- ☆223Nov 5, 2025Updated 3 months ago
- [AAAI 2026] Multimodal Deepresearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Framework☆44Jan 25, 2026Updated 3 weeks ago
- Chinese-native image generation while compatible with SD eco-system, 1st-gen, AAAI2025☆13Jun 25, 2024Updated last year
- ☆23Jul 2, 2025Updated 7 months ago
- This is the repo for the paper Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining.☆46Aug 22, 2025Updated 5 months ago
- ☆33Jul 15, 2025Updated 7 months ago
- A scalable automated alignment method for large language models. Resources for "Aligning Large Language Models via Self-Steering Optimiza…☆20Nov 21, 2024Updated last year
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆66Dec 8, 2025Updated 2 months ago
- Test-time Scaling for VAR models☆31Sep 19, 2025Updated 4 months ago
- A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward model…☆63Jun 13, 2025Updated 8 months ago