Code4Agent / codeagentLinks
☆18Updated last year
Alternatives and similar repositories for codeagent
Users that are interested in codeagent are comparing it to the libraries listed below
Sorting:
- NaturalCodeBench (Findings of ACL 2024)☆67Updated 11 months ago
- Code for paper Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding☆81Updated last year
- ☆88Updated last year
- CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)☆72Updated last year
- The implementation for CIKM 2024: Towards Completeness-Oriented Tool Retrieval for Large Language Models.☆22Updated 10 months ago
- MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation☆28Updated last year
- This is the implementation for the paper "LARGE LANGUAGE MODEL CASCADES WITH MIX- TURE OF THOUGHT REPRESENTATIONS FOR COST- EFFICIENT REA…☆26Updated last year
- ☆27Updated last year
- [ICLR 2024] MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use☆96Updated last year
- PLease check our latest version of relased code at https://github.com/microsoft/TableProvider/tree/main.☆41Updated 3 weeks ago
- [NeurIPS 2023] Codebase for the paper: "Guiding Large Language Models with Directional Stimulus Prompting"☆113Updated 2 years ago
- ☆29Updated 3 months ago
- Code and data for "MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models"☆43Updated 11 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- ☆19Updated 11 months ago
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆123Updated 7 months ago
- A framework for editing the CoTs for better factuality☆51Updated last year
- This is the code repo for our paper "Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents".☆108Updated 11 months ago
- ☆18Updated last year
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆87Updated last year
- ☆23Updated 9 months ago
- [NAACL 2024] Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models☆86Updated last year
- Contrastive Chain-of-Thought Prompting☆68Updated last year
- Do Large Language Models Know What They Don’t Know?☆99Updated 10 months ago
- [EMNLP'24 (Main)] DRPO(Dynamic Rewarding with Prompt Optimization) is a tuning-free approach for self-alignment. DRPO leverages a search-…☆24Updated 10 months ago
- List of papers on Self-Correction of LLMs.☆76Updated 9 months ago
- This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".☆30Updated last year
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆54Updated last year
- Prompt-Guided Retrieval For Non-Knowledge-Intensive Tasks☆12Updated 2 years ago
- Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner☆28Updated last year