X-PLUG / Multi-LLM-AgentLinks
☆233Updated last year
Alternatives and similar repositories for Multi-LLM-Agent
Users that are interested in Multi-LLM-Agent are comparing it to the libraries listed below
Sorting:
- ☆331Updated last year
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆165Updated 8 months ago
- SuperCLUE-Agent: 基于中文原生任务的Agent智能体核心能力测评基准☆94Updated 2 years ago
- ☆147Updated last year
- ☆162Updated 11 months ago
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆232Updated 11 months ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆410Updated 5 months ago
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆400Updated 8 months ago
- Evaluating LLMs' multi-round chatting capability via assessing conversations generated by two LLM instances.☆159Updated 7 months ago
- TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios☆238Updated 4 months ago
- 中文原生检索增强生成测评基准☆123Updated last year
- 大模型多维度中文对齐评测基准 (ACL 2024)☆423Updated last month
- ☆121Updated 2 years ago
- Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"☆195Updated last year
- ☆205Updated 8 months ago
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆284Updated 2 years ago
- ☆278Updated 6 months ago
- [ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step☆301Updated last year
- LongQLoRA: Extent Context Length of LLMs Efficiently☆167Updated 2 years ago
- A Toolkit for Table-based Question Answering☆115Updated 2 years ago
- Source code and demo for memory bank and SiliconFriend☆381Updated 2 years ago
- ☆251Updated 2 years ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆130Updated 9 months ago
- FlagEval is an evaluation toolkit for AI large foundation models.☆339Updated 8 months ago
- 1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc☆162Updated 4 months ago
- SOTA Math Opensource LLM☆332Updated 2 years ago
- CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models☆350Updated 7 months ago
- ☆54Updated last year
- [ACL 2024] An Easy-to-use Instruction Processing Framework for LLMs.☆409Updated last year
- This is the official repo for "PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization". PromptAgen…☆343Updated 5 months ago