bingreeky / AgenTracerLinks
AgenTracer: A Lightweight Failure Attributor for Agentic Systems
☆60Updated last week
Alternatives and similar repositories for AgenTracer
Users that are interested in AgenTracer are comparing it to the libraries listed below
Sorting:
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆66Updated 6 months ago
- [AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆88Updated 2 weeks ago
- ☆86Updated 3 months ago
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆79Updated 3 weeks ago
- [arxiv: 2505.02156] Adaptive Thinking via Mode Policy Optimization for Social Language Agents☆46Updated 4 months ago
- ☆158Updated 3 weeks ago
- A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward model…☆58Updated 5 months ago
- [NeurIPS 2025 Spotlight] ReasonFlux-Coder: Open-Source LLM Coders with Co-Evolving Reinforcement Learning☆131Updated 2 months ago
- ☆38Updated 3 months ago
- ☆118Updated this week
- Official code implementation for the ACL 2025 paper: 'CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis'☆31Updated 6 months ago
- ☆51Updated 8 months ago
- SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks☆106Updated this week
- Code, benchmark and environment for "ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows"☆118Updated this week
- The implementation for ICLR 2025 Oral: From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions.☆49Updated 3 months ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆150Updated 4 months ago
- ☆45Updated 3 months ago
- MemGen: Weaving Generative Latent Memory for Self-Evolving Agents☆196Updated 3 weeks ago
- ☆33Updated 5 months ago
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆65Updated 5 months ago
- The demo, code and data of FollowRAG☆75Updated 4 months ago
- MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models☆56Updated 3 months ago
- ☆165Updated last month
- ☆46Updated 3 weeks ago
- ☆104Updated 11 months ago
- [EMNLP 2025] LightThinker: Thinking Step-by-Step Compression☆123Updated 7 months ago
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluations☆138Updated last week
- ☆104Updated last month
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆116Updated 6 months ago
- ☆69Updated 5 months ago