Code4Agent / codeagentLinks
☆13Updated last year
Alternatives and similar repositories for codeagent
Users that are interested in codeagent are comparing it to the libraries listed below
Sorting:
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆27Updated last month
- This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".☆30Updated 11 months ago
- Automatic Test Generator☆12Updated 3 months ago
- ☆28Updated last week
- The code for paper: Hierarchical Document Refinement for Long-context Retrieval-augmented Generation [ACL2025 Oral]☆24Updated 2 weeks ago
- ☆47Updated last month
- Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs☆37Updated last year
- Can VLMs understand students' hand-drawn math work?☆13Updated this week
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- ☆16Updated 2 weeks ago
- CodeUltraFeedback: aligning large language models to coding preferences☆71Updated last year
- ☆12Updated 2 months ago
- Codes for the EMNLP 2023 Findings paper "Self-Polish: Enhance Reasoning in Large Language Models via Problem Refining" by Zhiheng Xi, Sen…☆30Updated 2 years ago
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆24Updated 3 months ago
- ☆25Updated last year
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated 2 weeks ago
- ☆17Updated 9 months ago
- ☆46Updated last year
- This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.☆20Updated last year
- Evaluation on Logical Reasoning and Abstract Reasoning Challenges☆28Updated 2 months ago
- Reasoning Agentic Retrieval-Augmented Generation for Industry Challenges☆18Updated 2 months ago
- Open-Source LLM Coders with Co-Evolving Reinforcement Learning☆93Updated this week
- The repository contains generative AI analytics platform application code.☆26Updated 2 months ago
- ☆19Updated this week
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆35Updated last year
- Code for Benchmarking Language Model Agents for Data-Driven Science☆28Updated 8 months ago
- ☆22Updated 7 months ago
- Plancraft is a minecraft environment and agent suite to test planning capabilities in LLMs☆15Updated last week
- The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"☆19Updated last year
- ☆13Updated 2 years ago