THUDM / CaRRLinks
This repository contains the code and data for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards".
☆42Updated last week
Alternatives and similar repositories for CaRR
Users that are interested in CaRR are comparing it to the libraries listed below
Sorting:
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆36Updated 3 months ago
- ☆19Updated 10 months ago
- RePo: Language Models with Context Re-Positioning☆21Updated 3 weeks ago
- LIMI: Less is More for Agency☆159Updated 3 months ago
- Official Project Page for Web World Models (https://arxiv.org/abs/2512.23676)☆74Updated 2 weeks ago
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆33Updated 4 months ago
- Resa: Transparent Reasoning Models via SAEs☆47Updated 3 months ago
- ☆20Updated 5 months ago
- ☆42Updated 7 months ago
- ☆29Updated 2 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆259Updated this week
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆102Updated 4 months ago
- ☆94Updated this week
- The official implementation of the paper "Chain-of-Tools: Utilizing Massive Unseen Tools in the CoT Reasoning of Frozen Language Models".☆86Updated 9 months ago
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆38Updated 2 months ago
- This is the official Python version of Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play.☆110Updated 3 months ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆169Updated 4 months ago
- XmodelLM☆38Updated last year
- ☆26Updated 3 weeks ago
- Implementation of the paper "Improving Multi-step RAG with Hypergraph-based Memory for Long-context Complex Relational Modeling"☆90Updated 3 weeks ago
- Code for Bolmo: Byteifying the Next Generation of Language Models☆113Updated 3 weeks ago
- ☆52Updated 7 months ago
- accompanying material for sleep-time compute paper☆118Updated 8 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 9 months ago
- ☆67Updated 9 months ago
- ☆43Updated 2 months ago
- Official Repository of Native Parallel Reasoner☆98Updated this week
- MemEvolve & EvolveLab☆102Updated 3 weeks ago
- A method for steering llms to better follow instructions☆74Updated 5 months ago
- Lottery Ticket Adaptation☆39Updated last year