☆90Mar 30, 2026Updated 2 months ago
Alternatives and similar repositories for AgentDebug
Users that are interested in AgentDebug are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2025] "GraphRouter: A Graph-based Router for LLM Selections", Tao Feng, Yanzhen Shen, Jiaxuan You☆71Dec 30, 2025Updated 5 months ago
- Python client library for Graphlit Platform☆20Jun 1, 2026Updated last week
- Time-R1: Framework and resources for endowing LLMs with comprehensive temporal reasoning (understanding, prediction, creative generation)…☆72Jun 11, 2025Updated 11 months ago
- Unifew: Unified Fewshot Learning Model☆18Sep 10, 2021Updated 4 years ago
- This is the repository for paper EscapeBench: Pushing Language Models to Think Outside the Box☆18Dec 19, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆19Aug 4, 2025Updated 10 months ago
- Repo for paper "MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding".☆40Jun 9, 2025Updated last year
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).☆51Mar 31, 2026Updated 2 months ago
- 📖Curated list about reasoning abilitiy of MLLM, including OpenAI o1, OpenAI o3-mini, and Slow-Thinking.☆13Feb 7, 2025Updated last year
- The code implementation for TTCS: Test-Time Curriculum Synthesis for Self-Evolving.☆49Apr 22, 2026Updated last month
- CodeMind is a generic framework for evaluating inductive code reasoning of LLMs. It is equipped with a static analysis component that ena…☆42Feb 18, 2026Updated 3 months ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Feb 5, 2025Updated last year
- Code for paper "W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering"☆16Oct 2, 2025Updated 8 months ago
- A collection of interesting papers on Diffusion Models☆22Dec 19, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [WACV'23] Mixture Outlier Exposure for Out-of-Distribution Detection in Fine-grained Environments☆26Apr 12, 2023Updated 3 years ago
- Locally hosted AI Agent Python Tool To Generate Novel Research Hypothesis + Titles + Abstracts☆30Apr 30, 2025Updated last year
- Accepted LLM Papers in NeurIPS 2024☆38Oct 13, 2024Updated last year
- [NAACL 2025] Benchmark for Repository-Level Code Generation, focus on Executability, Correctness from Test Cases and Usage of Contexts fr…☆45Jan 8, 2026Updated 5 months ago
- [CVPR 2025] VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning☆13Jun 7, 2025Updated last year
- Safe Python Code Execution Environment for Language Models☆17May 19, 2026Updated 3 weeks ago
- ☆12Nov 5, 2024Updated last year
- ☆74Feb 20, 2023Updated 3 years ago
- TUI kanban board for orchestrating AI coding agents☆105Jan 28, 2026Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official code of HierCDF @ SIGKDD2022☆12Aug 14, 2022Updated 3 years ago
- ☆10Oct 11, 2022Updated 3 years ago
- [ICLR 2026] "Co-rewarding: Stable Self-supervised RL for Eliciting Reasoning in Large Language Models"☆57Feb 4, 2026Updated 4 months ago
- ☆53Mar 3, 2026Updated 3 months ago
- Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer☆37Apr 17, 2026Updated last month
- [WACV 2025] Official Pytorch code for "Background-aware Moment Detection for Video Moment Retrieval"☆16Feb 24, 2025Updated last year
- SWE-Debate: Competitive Multi-Agent Debate for Software Issue Resolution [ICSE 2026]☆30Nov 11, 2025Updated 6 months ago
- ☆11Nov 8, 2023Updated 2 years ago
- ☆16Aug 14, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Prompt-driven automation platform - Transform natural language into executable workflows☆34Jul 13, 2025Updated 10 months ago
- ☆38Jun 28, 2025Updated 11 months ago
- Evaluation Pipeline for medical tasks.☆12Apr 8, 2026Updated 2 months ago
- [ICCV 2025] Official repo of "EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow"☆27Oct 16, 2025Updated 7 months ago
- Chain of Images for Intuitively Reasoning☆10Nov 29, 2023Updated 2 years ago
- ☆75Oct 9, 2025Updated 8 months ago
- [ICML 2024] Official repository of ICML 2024 - RoboMP2: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language…☆11Apr 4, 2026Updated 2 months ago