Code4Agent / codeagentLinks
☆20Updated last year
Alternatives and similar repositories for codeagent
Users that are interested in codeagent are comparing it to the libraries listed below
Sorting:
- Code for paper Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding☆86Updated last year
- ☆30Updated last year
- ☆19Updated last year
- NaturalCodeBench (Findings of ACL 2024)☆68Updated last year
- This is the implementation for the paper "LARGE LANGUAGE MODEL CASCADES WITH MIX- TURE OF THOUGHT REPRESENTATIONS FOR COST- EFFICIENT REA…☆28Updated last year
- PLease check our latest version of relased code at https://github.com/microsoft/TableProvider/tree/main.☆42Updated 3 months ago
- The implementation for CIKM 2024: Towards Completeness-Oriented Tool Retrieval for Large Language Models.☆23Updated last year
- Prompt-Guided Retrieval For Non-Knowledge-Intensive Tasks☆12Updated 2 years ago
- Code and data for "MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models"☆49Updated 3 weeks ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆43Updated last year
- Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner☆28Updated last year
- Contrastive Chain-of-Thought Prompting☆68Updated 2 years ago
- ☆32Updated 6 months ago
- This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".☆30Updated last year
- StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback☆73Updated last year
- DataSciBench: An LLM Agent Benchmark for Data Science☆44Updated 3 months ago
- Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"☆83Updated last year
- ☆46Updated 6 months ago
- The official implementation of ACL'24 paper: Synergistic Interplay between Search and Large Language Models for Information Retrieval.☆36Updated last year
- ☆105Updated last year
- [NAACL 2024] Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models☆86Updated last year
- CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)☆72Updated last year
- ☆38Updated last year
- ☆18Updated last year
- [NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…☆29Updated last year
- ☆21Updated last year
- [NAACL 2024 Findings] Evaluation suite for the systematic evaluation of instruction selection methods.☆23Updated 2 years ago
- Code for Robust Fine-tuning (RbFT)☆15Updated 10 months ago
- ☆82Updated last year
- Code/data for MARG (multi-agent review generation)☆59Updated 2 months ago