β74Oct 1, 2025Updated 5 months ago
Alternatives and similar repositories for AgentDebug
Users that are interested in AgentDebug are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] "GraphRouter: A Graph-based Router for LLM Selections", Tao Feng, Yanzhen Shen, Jiaxuan Youβ62Dec 30, 2025Updated 2 months ago
- [ACL 2025 Main] (π Outstanding Paper Award) Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Probaβ¦β16Aug 15, 2025Updated 7 months ago
- β13Oct 19, 2023Updated 2 years ago
- Unifew: Unified Fewshot Learning Modelβ18Sep 10, 2021Updated 4 years ago
- This is a repository contains the implementation of our NeurIPS'24 paper "Temporal Sentence Grounding with Relevance Feedback in Videos"β14Aug 22, 2025Updated 6 months ago
- A collection of interesting papers on Diffusion Modelsβ16Dec 19, 2023Updated 2 years ago
- Repo for paper "MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding".β39Jun 9, 2025Updated 9 months ago
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).β48Oct 16, 2025Updated 5 months ago
- πCurated list about reasoning abilitiy of MLLM, including OpenAI o1, OpenAI o3-mini, and Slow-Thinking.β13Feb 7, 2025Updated last year
- Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-based Active Learningβ15Dec 12, 2023Updated 2 years ago
- CodeMind is a generic framework for evaluating inductive code reasoning of LLMs. It is equipped with a static analysis component that enaβ¦β42Feb 18, 2026Updated last month
- ReBase: Training Task Experts through Retrieval Based Distillationβ29Feb 5, 2025Updated last year
- β21Feb 28, 2025Updated last year
- Code for paper "W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering"β15Oct 2, 2025Updated 5 months ago
- β28Nov 10, 2025Updated 4 months ago
- More reliable Video Understanding Evaluationβ14Sep 23, 2025Updated 5 months ago
- Solving Inequality Proofs with Large Language Models.β58Dec 15, 2025Updated 3 months ago
- Code for "Adversarial Constraint Learning for Structured Prediction"β14May 30, 2018Updated 7 years ago
- Accepted LLM Papers in NeurIPS 2024β37Oct 13, 2024Updated last year
- [NAACL 2025] Benchmark for Repository-Level Code Generation, focus on Executability, Correctness from Test Cases and Usage of Contexts frβ¦β43Jan 8, 2026Updated 2 months ago
- β46Jan 21, 2026Updated last month
- 3DGS with vulkan backendβ13Apr 19, 2025Updated 11 months ago
- [CVPR 2025] VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoningβ13Jun 7, 2025Updated 9 months ago
- β40Mar 3, 2026Updated 2 weeks ago
- Official code of HierCDF @ SIGKDD2022β12Aug 14, 2022Updated 3 years ago
- β10Oct 11, 2022Updated 3 years ago
- Malicious Activity Detection System. Final Year Project. Deep Learning-based solution, which analyses Network Activity sequences to classβ¦β13Sep 27, 2024Updated last year
- β30Jun 28, 2025Updated 8 months ago
- Geometry-Consistent Video Diffusion for Robotic Visual Policy Transferβ30Mar 5, 2026Updated 2 weeks ago
- SWE-Debate: Competitive Multi-Agent Debate for Software Issue Resolutionβ25Nov 11, 2025Updated 4 months ago
- [IROS 2025] ReBot: Scaling Robot Learning with Real-to-Sim-to-Real Robotic Video Synthesisβ20May 17, 2025Updated 10 months ago
- LLMs represent numbers on a helix and manipulate that helix to do addition.β29Feb 4, 2025Updated last year
- Evaluation Pipeline for medical tasks.β12Feb 13, 2026Updated last month
- Chain of Images for Intuitively Reasoningβ10Nov 29, 2023Updated 2 years ago
- [ICML 2024] RoboMP2: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language Modelsβ12Jun 30, 2025Updated 8 months ago
- β13May 1, 2025Updated 10 months ago
- β14Jun 14, 2019Updated 6 years ago
- Efficient LLM query routing via multi-sampling. BEST-Route selects both model and number of responses based on query difficulty, cutting β¦β47Aug 6, 2025Updated 7 months ago
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learningβ118Dec 30, 2025Updated 2 months ago