This repository contains the code and data for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards".
☆68Apr 8, 2026Updated 2 months ago
Alternatives and similar repositories for CaRR
Users that are interested in CaRR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Nov 5, 2024Updated last year
- TIER: Text-Image Encoder-based Regression for AIGC Image Quality Assessment☆10Mar 1, 2025Updated last year
- [NeurIPS 2023] and [ICLR 2024] for robustness certification.☆10Nov 30, 2024Updated last year
- Mitigating Spurious Correlations in Multi-modal Models during Fine-tuning (ICML 2023)☆19Dec 15, 2023Updated 2 years ago
- ☆26Jun 2, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search [SIGIR 2026]☆63Jul 4, 2025Updated 11 months ago
- ☆69Updated this week
- ☆45Jan 30, 2026Updated 4 months ago
- A MCP Task Server☆11Mar 7, 2025Updated last year
- Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to …☆69Jan 28, 2026Updated 4 months ago
- Code for our paper "Learning to Generate Unit Tests for Automated Debugging"☆18Mar 7, 2025Updated last year
- ☆22Jul 23, 2025Updated 10 months ago
- Official PyTorch implementation for Revisiting LRP: Positional Attribution as the Missing Ingredient for Transformer Explainability [Neur…☆17Jul 7, 2025Updated 11 months ago
- 复旦研究生抢 课脚本☆10Feb 14, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [arXiv 2025] SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning☆71Dec 17, 2025Updated 5 months ago
- 📖Curated list about reasoning abilitiy of MLLM, including OpenAI o1, OpenAI o3-mini, and Slow-Thinking.☆13Feb 7, 2025Updated last year
- [ICLR26] Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs☆32Dec 9, 2025Updated 6 months ago
- ☆15Nov 18, 2025Updated 6 months ago
- Schoenfeld’s Anatomy of Mathematical Reasoning by Language Models☆26Dec 21, 2025Updated 5 months ago
- Code release for "UnSAMv2: Self-Supervised Learning Enables Segment Anything at Any Granularity"☆81Feb 1, 2026Updated 4 months ago
- External project in GitHub for marketing purposes. This repo will be used for code samples that accompany blog posts on https://stability…☆15May 13, 2025Updated last year
- FrontierSWE is an ultra long-horizon coding agent benchmark that tests implementation, performance eng and ML research☆130Apr 30, 2026Updated last month
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates☆26Jul 1, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆10Oct 18, 2023Updated 2 years ago
- ☆15Feb 2, 2025Updated last year
- Give Claude Code a cheap coworker. CLI tools that delegate bulk I/O to cheap LLMs (Kimi, DeepSeek, Ollama). Save 60-70% of your token bud…☆149May 4, 2026Updated last month
- MCP server for the Delinea Secret Server and Platform APIs☆46Jun 2, 2026Updated last week
- moodist☆28Apr 23, 2026Updated last month
- ☆36Jul 16, 2025Updated 10 months ago
- (ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning☆19Nov 22, 2025Updated 6 months ago
- Apply Tensorflow Object Detection API to DeepFashion Datatset☆17May 20, 2018Updated 8 years ago
- [ICML 2025] Differentiable Solver Search for Fast Diffusion Sampling☆21Jul 7, 2025Updated 11 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Implementation of the paper "Improving Multi-step RAG with Hypergraph-based Memory for Long-context Complex Relational Modeling"☆128Jan 22, 2026Updated 4 months ago
- ☆51Jan 8, 2026Updated 5 months ago
- Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows☆166Jun 2, 2026Updated last week
- An implementation of Deepmind's MuZero algorithm.☆16Aug 23, 2021Updated 4 years ago
- A curated list of papers on graph transfer learning (GTL).☆19Oct 23, 2023Updated 2 years ago
- Codes for WWW 2023 paper "TIGER: Temporal Interaction Graph Embedding with Restarts"☆12Feb 16, 2023Updated 3 years ago
- [System Prompt] Parent-Child Instruction Processing (PCIP) Framework with conversational learning, external knowledge integration, and dy…☆30Aug 19, 2025Updated 9 months ago