pat-jj / s3Links
[EMNLP'25] s3 - ⚡ Efficient & Effective Search Agent Training via RL for RAG (RLVR for Search with Minimal Data)
☆804Updated 2 months ago
Alternatives and similar repositories for s3
Users that are interested in s3 are comparing it to the libraries listed below
Sorting:
- ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://arxiv.org/abs/2311.098…☆318Updated 5 months ago
- Pytorch Library for Relational Table Learning with LLMs.☆438Updated last week
- When Agent Becomes the Scientist – Building Closed-Loop System from Hypothesis to Verification☆837Updated 2 months ago
- [COLM’25] DeepRetrieval — 🔥 Training Search Agent by RLVR with Retrieval Outcome☆693Updated 3 months ago
- [NIPS'25 Spotlight] Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS☆1,234Updated 4 months ago
- [EMNLP2025] From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery☆292Updated 2 months ago
- [ICML 2025] "SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator"☆559Updated 5 months ago
- DocAgent is a system designed to generate high-quality, context-aware code documentation for Python codebases using a multi-agent approac…☆412Updated 8 months ago
- [arXiv'25] EraRAG: Efficient and Incremental Retrieval-Augmented Generation for Growing Corpora☆167Updated 3 months ago
- ☆56Updated last week
- ScaleCUA is the open-sourced computer use agents that can operate on cross-platform environments (Windows, macOS, Ubuntu, Android).☆1,054Updated last week
- ☆254Updated 2 weeks ago
- [ICLR Workshop 2025] An official source code for paper "GuardReasoner: Towards Reasoning-based LLM Safeguards".☆164Updated 8 months ago
- Source code of LogicRAG at AAAI'26.☆166Updated last month
- RepoMaster: The open-source AI agent that masters GitHub. It turns any code repository into a powerful tool, achieving a new level of aut…☆470Updated 2 months ago
- [NeurIPS 2025] A Graph-based LLM Framework for Real-world SE Tasks☆516Updated 4 months ago
- [AAAI 2026 Oral] Official repository for InfiGUI-G1. We introduce Adaptive Exploration Policy Optimization (AEPO) to overcome semantic al…☆127Updated 2 months ago
- Open source code for Paper: Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions☆200Updated last month
- ☆546Updated 4 months ago
- (ICML'25 Outstanding) CollabLLM: From Passive Responders to Active Collaborators☆272Updated 3 months ago
- Tree Search for LLM Agent Reinforcement Learning☆267Updated 3 months ago
- ☆239Updated 2 weeks ago
- ☆332Updated 4 months ago
- A relation-free graph constrcution method for efficient GraphRAG.☆293Updated last week
- This repository contains the implementation of AutoSchemaKG, a novel framework for automatic knowledge graph construction that combines s…☆661Updated this week
- [AAAI'26, Oral] Code for "Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learni…☆43Updated 6 months ago
- Implementation of Controlled Self-Evolution for Algorithmic Code Optimization☆98Updated this week
- Official implementation of RARE: Retrieval-Augmented Reasoning Modeling☆186Updated 7 months ago
- UR2: Unify RAG and Reasoning through Reinforcement Learning☆126Updated 2 months ago
- [Neurips 2025] R-KV: Redundancy-aware KV Cache Compression for Reasoning Models☆1,165Updated 3 months ago