This repository contains the code and data for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards".
☆58Mar 14, 2026Updated this week
Alternatives and similar repositories for CaRR
Users that are interested in CaRR are comparing it to the libraries listed below
Sorting:
- ☆12Nov 5, 2024Updated last year
- A central repository for curating and managing diverse datasets used in healthcare applications.☆11Jun 8, 2024Updated last year
- TIER: Text-Image Encoder-based Regression for AIGC Image Quality Assessment☆10Mar 1, 2025Updated last year
- ☆24Oct 3, 2025Updated 5 months ago
- [NeurIPS 2023] and [ICLR 2024] for robustness certification.☆10Nov 30, 2024Updated last year
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆62Jul 4, 2025Updated 8 months ago
- Healthcare AI Model Evaluator (HAIME) empowers healthcare organizations to independently evaluate and customize AI solutions, addressing …☆45Feb 17, 2026Updated last month
- [NeurIPS VLM workshop 2024] In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Underst…☆23Mar 16, 2025Updated last year
- Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to …☆63Jan 28, 2026Updated last month
- Rethinking the Trust Region in LLM Reinforcement Learning☆45Mar 2, 2026Updated 2 weeks ago
- ☆20Jul 23, 2025Updated 7 months ago
- Stable-DiffCoder is a family of lightweight open-source code DLLMs(diffusion large language models) comprising base and instruct models, …☆79Mar 9, 2026Updated last week
- Code for our paper "Learning to Generate Unit Tests for Automated Debugging"☆17Mar 7, 2025Updated last year
- The official implementation of Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight☆88Jan 16, 2026Updated 2 months ago
- [arXiv 2025] SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning☆63Dec 17, 2025Updated 3 months ago
- ☆32Nov 18, 2025Updated 4 months ago
- Official PyTorch implementation for Revisiting LRP: Positional Attribution as the Missing Ingredient for Transformer Explainability [Neur…☆15Jul 7, 2025Updated 8 months ago
- 📖Curated list about reasoning abilitiy of MLLM, including OpenAI o1, OpenAI o3-mini, and Slow-Thinking.☆13Feb 7, 2025Updated last year
- iMessage RAG MCP Server from Anthropic MCP Hackathon (NYC)☆14Mar 10, 2025Updated last year
- ☆15Dec 21, 2017Updated 8 years ago
- ☆15Nov 18, 2025Updated 4 months ago
- Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs☆33Dec 9, 2025Updated 3 months ago
- Code release for "UnSAMv2: Self-Supervised Learning Enables Segment Anything at Any Granularity"☆78Feb 1, 2026Updated last month
- ☆47Jan 8, 2026Updated 2 months ago
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates☆24Jul 1, 2025Updated 8 months ago
- ☆10Oct 18, 2023Updated 2 years ago
- SWE-Exp: Experience-Driven Software Issue Resolution☆38Oct 17, 2025Updated 5 months ago
- A high-performance PDF summarization tool powered by Google's Gemma 3 LLM. Features parallel processing, async operations, and intelligen…☆20Apr 12, 2025Updated 11 months ago
- MCP server for the Delinea Secret Server and Platform APIs☆43Mar 13, 2026Updated last week
- ☆15Feb 2, 2025Updated last year
- Official implementation of the paper: [EMNLP 2025] RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruct…☆21Dec 9, 2025Updated 3 months ago
- [SIGKDD 2024] Rethinking Fair Graph Neural Networks from Re-balancing☆10Jul 15, 2024Updated last year
- ✨ Official code for our paper: "Uncertainty-o: One Model-agnostic Framework for Unveiling Epistemic Uncertainty in Large Multimodal Model…☆20Mar 13, 2025Updated last year
- ☆75Updated this week
- [ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆32Feb 26, 2026Updated 3 weeks ago
- This is an official implementation for "SAM-Swin: SAM-Driven Dual-Swin Transformers with Adaptive Lesion Enhancement for Laryngo-Pharynge…☆31Oct 4, 2025Updated 5 months ago
- Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows☆160Jan 19, 2026Updated 2 months ago
- Implementation of the paper "Improving Multi-step RAG with Hypergraph-based Memory for Long-context Complex Relational Modeling"☆118Jan 22, 2026Updated last month
- Towards Medical Small Language Models with Self-Evolved \\ Slow Thinking☆87Nov 11, 2025Updated 4 months ago