Code4Agent / codeagentLinks
☆21Updated last year
Alternatives and similar repositories for codeagent
Users that are interested in codeagent are comparing it to the libraries listed below
Sorting:
- Code for paper Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding☆88Updated last year
- NaturalCodeBench (Findings of ACL 2024)☆69Updated last year
- ☆20Updated last year
- PLease check our latest version of relased code at https://github.com/microsoft/TableProvider/tree/main.☆42Updated 4 months ago
- CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)☆73Updated last year
- A LLM Multi-Agent Framework toward Ultra Large-Scale Code Generation and Optimization☆16Updated last year
- Code and data for "MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models"☆51Updated 2 months ago
- ☆104Updated last year
- Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner☆29Updated last year
- StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback☆74Updated last year
- ☆30Updated last year
- This is the implementation for the paper "LARGE LANGUAGE MODEL CASCADES WITH MIX- TURE OF THOUGHT REPRESENTATIONS FOR COST- EFFICIENT REA…☆29Updated last year
- ☆32Updated 8 months ago
- Source code for EMNLP 2023 paper "Probabilistic Tree-of-thought Reasoning for Answering Knowledge-intensive Complex Questions".☆23Updated last year
- Code for our paper: "Building A Coding Assistant via Retrieval-Augmented Language Models"☆10Updated last year
- [NAACL 2024] Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models☆86Updated last year
- Retrieval-style In-Context Learning for Few-shot Hierarchical Text Classification☆16Updated 6 months ago
- Evaluation on Logical Reasoning and Abstract Reasoning Challenges☆29Updated 9 months ago
- Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"☆90Updated last year
- Large language models for document ranking.☆70Updated 2 weeks ago
- MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation☆28Updated last year
- Scalable Meta-Evaluation of LLMs as Evaluators☆43Updated last year
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆85Updated last year
- The implementation for CIKM 2024: Towards Completeness-Oriented Tool Retrieval for Large Language Models.☆23Updated last year
- A Survey of Personalization: From RAG to Agent☆99Updated 5 months ago
- Prompt-Guided Retrieval For Non-Knowledge-Intensive Tasks☆12Updated 2 years ago
- Code/data for MARG (multi-agent review generation)☆59Updated 4 months ago
- This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".☆30Updated last year
- ☆18Updated last year
- OrcaLoca: An LLM Agent Framework for Software Issue Localization [ICML 25]☆33Updated 9 months ago