The official repository of "A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaluations, and Applications".
☆162Mar 6, 2026Updated 2 weeks ago
Alternatives and similar repositories for Awesome-RL-based-Agentic-Search-Papers
Users that are interested in Awesome-RL-based-Agentic-Search-Papers are comparing it to the libraries listed below
Sorting:
- ☆80Mar 5, 2026Updated 2 weeks ago
- Dynamic human image animation with strong identity preservation, heterogeneous character driving, and controllable backgrounds.☆140May 23, 2025Updated 9 months ago
- INFTY Engine: An Optimization Toolkit to Support Continual AI☆567Sep 13, 2025Updated 6 months ago
- official implementation for paper titled "Training-free Horizon Extension for Autoregressive Video Generation"☆110Feb 17, 2026Updated last month
- Beyond log-likelihood: exploring alternative objectives for supervised fine-tuning of language model post-training☆55Oct 4, 2025Updated 5 months ago
- The code for paper: Hierarchical Document Refinement for Long-context Retrieval-augmented Generation [ACL2025 Oral]☆43Aug 25, 2025Updated 6 months ago
- ☆232Nov 5, 2025Updated 4 months ago
- A curated collection of resources, tools, and frameworks for developing GUI Agents.☆328Updated this week
- ☆18Jun 10, 2025Updated 9 months ago
- An adaptive sampling framework for Reinforce-style LLM post training.☆92Nov 29, 2025Updated 3 months ago
- ☆16Sep 17, 2024Updated last year
- ☆26Mar 10, 2026Updated last week
- (ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning☆19Nov 22, 2025Updated 3 months ago
- This repository contains the code for the paper “Neuro-Symbolic Query Compiler”, accepted to the Findings of ACL 2025.☆16Oct 20, 2025Updated 5 months ago
- "LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?"☆594Nov 1, 2025Updated 4 months ago
- Gotta Hear Them All: Towards Sound Source Aware Audio Generation.☆67Nov 15, 2025Updated 4 months ago
- [EMNLP 2022] TaCube: Pre-computing Data Cubes for Answering Numerical-Reasoning Questions over Tabular Data☆17May 17, 2023Updated 2 years ago
- The official implementation of "Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding"☆23Jun 26, 2025Updated 8 months ago
- OmniNWM: Omniscient Navigation World Models for Autonomous Driving☆318Mar 6, 2026Updated 2 weeks ago
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆24Sep 21, 2025Updated 5 months ago
- ☆26Sep 3, 2025Updated 6 months ago
- [WIP] A key value separation time series data storage engine inspired by Wisckey and TSM.☆15May 27, 2025Updated 9 months ago
- [EMNLP2025]Official implementation: Agent-style vision question answer in Autonomous Driving!☆140Sep 27, 2025Updated 5 months ago
- ☆28May 24, 2025Updated 9 months ago
- ☆38Apr 21, 2025Updated 10 months ago
- WSDM 2021 Tutorial on Advances in Bias-aware Recommendation on the Web☆11Mar 8, 2021Updated 5 years ago
- ☆34Jul 23, 2024Updated last year
- HierSearch: A Hierarchical Enterprise Deep Search Framework Integrating Local and Web Searches☆37Oct 9, 2025Updated 5 months ago
- ☆36Feb 21, 2025Updated last year
- [AAAI'25] CharacterBench: Benchmarking Character Customization of Large Language Models☆21Aug 1, 2025Updated 7 months ago
- A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward model…☆64Jun 13, 2025Updated 9 months ago
- ☆15Aug 19, 2025Updated 7 months ago
- 🔥 A continuously updated collection of papers, datasets, and benchmarks on post-training and alignment for video generation.☆67Mar 5, 2026Updated 2 weeks ago
- ☆144Jun 20, 2025Updated 9 months ago
- ☆31May 8, 2025Updated 10 months ago
- [AAAI 2026] Multimodal Deepresearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Framework☆49Jan 25, 2026Updated last month
- This is a sample project for getting started with Unity and data visualization.☆11Jun 5, 2020Updated 5 years ago
- Ling-Coder-Lite is a MoE LLM provided and open-sourced by CodeFuse and InclusionAI.☆14Apr 22, 2025Updated 10 months ago
- ☆26Jun 17, 2022Updated 3 years ago