Ruiyang-061X / Awesome-Search-RLLinks
☆33Updated last week
Alternatives and similar repositories for Awesome-Search-RL
Users that are interested in Awesome-Search-RL are comparing it to the libraries listed below
Sorting:
- An Awesome List of Reinforcement Learning-based Large Language Agent Works. Collect directly from official code base.☆93Updated this week
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆57Updated last month
- This is a survey of research on AI scientists, AI researchers, AI engineers, and a series of AI-driven research studies☆68Updated last month
- A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward model…☆45Updated last week
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆56Updated 8 months ago
- Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models☆108Updated last week
- A Survey of Personalization: From RAG to Agent☆46Updated last month
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents☆82Updated this week
- diagnosis_zero, R1 Zero reproduce on disease diagnosis☆29Updated 4 months ago
- ☆100Updated last year
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆78Updated 3 weeks ago
- MPO: Boosting LLM Agents with Meta Plan Optimization☆58Updated 3 months ago
- ☆48Updated 3 months ago
- Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning☆26Updated 2 weeks ago
- Test-time compute in information retrieval☆32Updated 2 months ago
- ☆43Updated 3 months ago
- This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the…☆108Updated last month
- ☆56Updated 7 months ago
- Reading List of Memory Augmented Multimodal Research, including multimodal context modeling, memory in vision and robotics, and external …☆33Updated 9 months ago
- This repo aims to record resource of role-playing abilities in LLMs, including dataset, paper, application, etc.☆123Updated 8 months ago
- 珠算代码大模型(Abacus Code LLM)☆55Updated 8 months ago
- Open source code of the paper: "OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain"☆65Updated 6 months ago
- CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation☆53Updated 3 weeks ago
- The implementation of paper "LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Fee…☆39Updated 10 months ago
- ☆51Updated last month
- A framework for editing the CoTs for better factuality☆50Updated last year
- Computer Agent Arena: Test & compare AI agents in real desktop apps & web environments. Code/data coming soon!☆46Updated 2 months ago
- [ICLR 2025] This is the code repo for our ICLR’25 paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rew…☆40Updated 4 months ago
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆60Updated last month
- A Comprehensive Survey on Long Context Language Modeling☆151Updated 2 weeks ago