☆71Oct 1, 2025Updated 4 months ago
Alternatives and similar repositories for AgentDebug
Users that are interested in AgentDebug are comparing it to the libraries listed below
Sorting:
- Code for paper "W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering"☆15Oct 2, 2025Updated 4 months ago
- TUI kanban board for orchestrating AI coding agents☆45Jan 28, 2026Updated 3 weeks ago
- Unifew: Unified Fewshot Learning Model☆18Sep 10, 2021Updated 4 years ago
- This is the repository for paper EscapeBench: Pushing Language Models to Think Outside the Box☆18Dec 19, 2024Updated last year
- Time-R1: Framework and resources for endowing LLMs with comprehensive temporal reasoning (understanding, prediction, creative generation)…☆64Jun 11, 2025Updated 8 months ago
- PGRAG☆53Jul 16, 2024Updated last year
- Process Orchestration Framework: A camunda 7 fork☆21Updated this week
- The jiant toolkit for general-purpose text understanding models☆22Oct 8, 2020Updated 5 years ago
- ☆28Nov 10, 2025Updated 3 months ago
- Locally hosted AI Agent Python Tool To Generate Novel Research Hypothesis + Titles + Abstracts☆30Apr 30, 2025Updated 10 months ago
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).☆48Oct 16, 2025Updated 4 months ago
- Official Code Release for "Training a Generally Curious Agent"☆45May 18, 2025Updated 9 months ago
- SDLC Copilot is an Agentic AI system designed to streamline and automate the Software Development Lifecycle (SDLC). From requirement gath…☆23Jun 14, 2025Updated 8 months ago
- A music composer and player with MATLAB☆11Mar 14, 2020Updated 5 years ago
- This repository contains the code for the paper in Findings of EMNLP 2021: "EfficientBERT: Progressively Searching Multilayer Perceptron …☆33Jun 14, 2023Updated 2 years ago
- Methods and evaluation for aligning language models temporally☆30Mar 2, 2024Updated last year
- [NAACL 2025] Benchmark for Repository-Level Code Generation, focus on Executability, Correctness from Test Cases and Usage of Contexts fr…☆43Jan 8, 2026Updated last month
- This repository is the code implementing some classic algorithms in co-location pattern minning.☆13May 9, 2014Updated 11 years ago
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆23Nov 13, 2025Updated 3 months ago
- Solving Inequality Proofs with Large Language Models.☆57Dec 15, 2025Updated 2 months ago
- CodeMind is a generic framework for evaluating inductive code reasoning of LLMs. It is equipped with a static analysis component that ena…☆42Feb 18, 2026Updated last week
- ☆10Oct 11, 2022Updated 3 years ago
- ☆14Mar 20, 2025Updated 11 months ago
- ☆13Oct 19, 2023Updated 2 years ago
- A collection of interesting papers on Diffusion Models☆15Dec 19, 2023Updated 2 years ago
- A Grand Sumo prediction game☆10Updated this week
- Evaluation Pipeline for medical tasks.☆12Feb 13, 2026Updated 2 weeks ago
- ☆11Nov 8, 2023Updated 2 years ago
- Code for "Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum" (ICML 2023)☆10Jul 6, 2023Updated 2 years ago
- ProxyExplainer for Graph Neural Networks☆15Oct 24, 2024Updated last year
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …☆42Oct 28, 2024Updated last year
- ☆16Feb 22, 2025Updated last year
- [AAAI 2025] Neural-Symbolic Collaborative Distillation: Advancing Small Language Models for Complex Reasoning Tasks☆11Jun 19, 2025Updated 8 months ago
- https://icml.cc/virtual/2023/poster/24354☆10Aug 15, 2023Updated 2 years ago
- ☆19Sep 4, 2025Updated 5 months ago
- Few-shot NLP benchmark for unified, rigorous eval☆93Jul 12, 2022Updated 3 years ago
- ☆46Oct 28, 2025Updated 4 months ago
- Synthetic Data Generation with Execution-Based Verification and Grounding for LLM Training.☆19Feb 7, 2025Updated last year
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆11Apr 18, 2025Updated 10 months ago