This repository contains the code and data for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards".
☆68Apr 8, 2026Updated 2 months ago
Alternatives and similar repositories for CaRR
Users that are interested in CaRR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Nov 5, 2024Updated last year
- TIER: Text-Image Encoder-based Regression for AIGC Image Quality Assessment☆10Mar 1, 2025Updated last year
- ☆27Jun 2, 2026Updated 3 weeks ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search [SIGIR 2026]☆64Jul 4, 2025Updated 11 months ago
- A MCP Task Server☆11Mar 7, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Rethinking the Trust Region in LLM Reinforcement Learning☆61Mar 2, 2026Updated 3 months ago
- Code for our paper "Learning to Generate Unit Tests for Automated Debugging"☆18Mar 7, 2025Updated last year
- ☆23Jul 23, 2025Updated 11 months ago
- ☆35Nov 18, 2025Updated 7 months ago
- [CVPR 2026] Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight☆92Jun 5, 2026Updated 3 weeks ago
- Stable-DiffCoder is a family of lightweight open-source code DLLMs(diffusion large language models) comprising base and instruct models, …☆84Mar 9, 2026Updated 3 months ago
- 📖Curated list about reasoning abilitiy of MLLM, including OpenAI o1, OpenAI o3-mini, and Slow-Thinking.☆13Feb 7, 2025Updated last year
- [ICLR26] Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs☆33Dec 9, 2025Updated 6 months ago
- ☆15Nov 18, 2025Updated 7 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Schoenfeld’s Anatomy of Mathematical Reasoning by Language Models☆27Dec 21, 2025Updated 6 months ago
- External project in GitHub for marketing purposes. This repo will be used for code samples that accompany blog posts on https://stability…☆15May 13, 2025Updated last year
- FrontierSWE is an ultra long-horizon coding agent benchmark that tests implementation, performance eng and ML research☆166Updated this week
- [ICCV 2025] Official Implementation of RefEdit: A Benchmark and Method for Improving Instruction-based Image Editing Model for Referring …☆20Jun 27, 2025Updated last year
- A modern X11 server written from scratch in Rust.☆436Updated this week
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates☆27Jul 1, 2025Updated last year
- ☆10Oct 18, 2023Updated 2 years ago
- 地图足迹故事,微信小程序☆10May 5, 2022Updated 4 years ago
- Give Claude Code a cheap coworker. CLI tools that delegate bulk I/O to cheap LLMs (Kimi, DeepSeek, Ollama). Save 60-70% of your token bud…☆158Updated this week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Official implementation of the paper: [EMNLP 2025] RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruct…☆21Dec 9, 2025Updated 6 months ago
- MCP server for the Delinea Secret Server and Platform APIs☆46Updated this week
- ☆15Apr 8, 2024Updated 2 years ago
- ✨ Official code for our paper: "Uncertainty-o: One Model-agnostic Framework for Unveiling Epistemic Uncertainty in Large Multimodal Model…☆21Mar 13, 2025Updated last year
- moodist☆28Apr 23, 2026Updated 2 months ago
- [ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆32Feb 26, 2026Updated 4 months ago
- ☆78May 31, 2026Updated last month
- ☆36Jul 16, 2025Updated 11 months ago
- Official PyTorch implementation of "Scaling Up Personalized Image Aesthetic Assessment via Task Vector Customization" (ECCV 2024)☆34Jun 8, 2026Updated 3 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆52Jan 8, 2026Updated 5 months ago
- Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows☆166Jun 2, 2026Updated 3 weeks ago
- An implementation of Deepmind's MuZero algorithm.☆16Aug 23, 2021Updated 4 years ago
- A curated list of papers on graph transfer learning (GTL).☆19Oct 23, 2023Updated 2 years ago
- ✨ PyTorch implementation of "Cora: Correspondence-aware Image Editing Using Few-Step Diffusion", accepted at SIGGRAPH 2025.☆34Jun 3, 2025Updated last year
- Codes for WWW 2023 paper "TIGER: Temporal Interaction Graph Embedding with Restarts"☆12Feb 16, 2023Updated 3 years ago
- Reinforcement learning with Equinox☆21Mar 4, 2025Updated last year