THUDM/CaRR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/THUDM/CaRR)

THUDM / CaRR

This repository contains the code and data for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards".

☆72

Alternatives and similar repositories for CaRR

Users that are interested in CaRR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kxfan2002 / Reagent
View on GitHub
Agent-RRM: Exploring Reasoning Reward Model for Agents
☆70Mar 17, 2026Updated 4 months ago
MiroMindAI / MiroEval
View on GitHub
MiroEval: A benchmark and evaluation framework for deep research agents — 100 tasks (70 text, 30 multimodal) assessed across synthesis qu…
☆46Jul 6, 2026Updated 2 weeks ago
hkust-nlp / deepsearch-tts
View on GitHub
Pushing Test-Time Scaling Limits of Deep Search with Asymmetric Verification
☆21Oct 8, 2025Updated 9 months ago
RUCKBReasoning / SoAy
View on GitHub
Codes for paper SoAy: A Service-oriented APIs Applying Framework of Large Language Models
☆27Jul 14, 2025Updated last year
zzfoutofspace / ATPO
View on GitHub
AT2PO: Agentic Turn-based Policy Optimization via Tree Search
☆22May 21, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Kwai-Klear / CE-GPPO
View on GitHub
CE-GPPO: Controlling Entropy via Gradient-Preserving Clipping Policy Optimization in Reinforcement Learning
☆16Jan 23, 2026Updated 5 months ago
THUDM / LongReward
View on GitHub
☆63Oct 29, 2024Updated last year
THU-KEG / LongTraceRL
View on GitHub
LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards
☆38Jun 1, 2026Updated last month
howard-yen / SLIM
View on GitHub
☆27Jun 22, 2026Updated 3 weeks ago
thunlp / LLM-generated-text-detection
View on GitHub
☆13Nov 7, 2023Updated 2 years ago
OpenMOSS / ABC-Bench
View on GitHub
ABC-Bench is a benchmark for Agentic Backend Coding. It evaluates whether code agents can explore real repositories, edit code, configure…
☆33Jan 20, 2026Updated 6 months ago
RedSearchAgent / REDSearcher
View on GitHub
REDSearch: A scalable, cost-efficient framework for long-horizon search agents. Features complex task synthesis, optimized mid-training, …
☆128Feb 26, 2026Updated 4 months ago
VectorSpaceLab / agentic-search
View on GitHub
Advancing search on top of AI agents
☆31Jun 9, 2026Updated last month
THUDM / DeepDive
View on GitHub
DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL
☆333Jun 17, 2026Updated last month
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Crazy-James26 / FlexLLM
View on GitHub
Composable HLS library for rapid development of LLM accelerators. FlexLLM enables spatial-temporal hybrid architectures, with parameteriz…
☆24May 31, 2026Updated last month
THU-KEG / R-Eval
View on GitHub
[KDD24-ADS] R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models
☆11Apr 9, 2024Updated 2 years ago
Search-Swarm / SearchSwarm
View on GitHub
☆83Jun 16, 2026Updated last month
OpenForecaster / futuresim
View on GitHub
☆51Jun 25, 2026Updated 3 weeks ago
Chengsong-Huang / RelayLLM
View on GitHub
☆40Jan 10, 2026Updated 6 months ago
NEUIR / ExpandR
View on GitHub
[EMNLP '25] Source code for paper "ExpandR: Teaching Dense Retrievers Beyond Queries with LLM Guidance"
☆40Aug 13, 2025Updated 11 months ago
facebookresearch / AdvancedIF
View on GitHub
This is the github to open source benchmark AdvancedIF, see LAMA L1387358RCRO
☆36Nov 26, 2025Updated 7 months ago
answers111 / alpha-research
View on GitHub
Repo for "AlphaResearch: Accelerating New Algorithm Discovery with Language Models"
☆58Nov 12, 2025Updated 8 months ago
ZihaoHuang-notabot / ConceptMoE
View on GitHub
☆45Jan 30, 2026Updated 5 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
yaof20 / verl
View on GitHub
verl: Volcano Engine Reinforcement Learning for LLMs
☆22Nov 6, 2025Updated 8 months ago
xiaofengShi / SPAR
View on GitHub
☆26Jul 23, 2025Updated 11 months ago
rlresearch / dr-tulu
View on GitHub
Official repository for DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research
☆687Jun 17, 2026Updated last month
huggingface / finephrase
View on GitHub
Synthetic pretraining data by rephrasing the web
☆24Jun 5, 2026Updated last month
google-deepmind / proeval
View on GitHub
GenAI evaluation framework, optimized for 100x lower cost 🚀.
☆40Jun 16, 2026Updated last month
plageon / HierSearch
View on GitHub
HierSearch: A Hierarchical Enterprise Deep Search Framework Integrating Local and Web Searches
☆40Oct 9, 2025Updated 9 months ago
Multimodal-Commonsense-and-Task / Knowledge-Base-and-NLP
View on GitHub
☆13Nov 29, 2024Updated last year
Tongyi-Zhiwen / Qwen-Doc
View on GitHub
☆548May 25, 2026Updated last month
KMnO4-zx / nips25-all-papers
View on GitHub
nips25-all-papers
☆44Feb 26, 2026Updated 4 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
WxxShirley / Agent-STAR
View on GitHub
Official implementation for paper "Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe"
☆32May 12, 2026Updated 2 months ago
EsmaeilNarimissa / aws-sft-grpo-budget-llm-finetune
View on GitHub
☆19May 17, 2025Updated last year
euReKa025 / AgentLongBench
View on GitHub
☆21Jan 29, 2026Updated 5 months ago
inclusionAI / DR-Venus
View on GitHub
☆93May 8, 2026Updated 2 months ago
OxRML / MADQA
View on GitHub
Multimodal Agentic Document QA benchmark (MADQA)
☆39Mar 13, 2026Updated 4 months ago
XueruiSu / Trust-Region-Preference-Approximation
View on GitHub
Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning
☆15Jun 28, 2025Updated last year
BaohaoLiao / SAGE
View on GitHub
Self-Hinting Language Models Enhance Reinforcement Learning
☆26Mar 28, 2026Updated 3 months ago