X-PLUG / Multi-LLM-AgentLinks
☆221Updated last year
Alternatives and similar repositories for Multi-LLM-Agent
Users that are interested in Multi-LLM-Agent are comparing it to the libraries listed below
Sorting:
- ☆142Updated 11 months ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆368Updated 8 months ago
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆224Updated 4 months ago
- SuperCLUE-Agent: 基于中文原生任务的Agent智能体核心能力测评基准☆88Updated last year
- ☆140Updated 4 months ago
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆142Updated 2 months ago
- ☆320Updated 11 months ago
- Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"☆178Updated 9 months ago
- 中文原生检索增强生成测评基准☆117Updated last year
- ☆193Updated this week
- Evaluating LLMs' multi-round chatting capability via assessing conversations generated by two LLM instances.☆151Updated last week
- ☆240Updated this week
- ☆282Updated 10 months ago
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆329Updated last month
- TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios☆203Updated 8 months ago
- Generative Judge for Evaluating Alignment☆238Updated last year
- LongQLoRA: Extent Context Length of LLMs Efficiently☆165Updated last year
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆243Updated 7 months ago
- [ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step☆274Updated last year
- 大模型多维度中文对齐评测基准 (ACL 2024)☆389Updated 9 months ago
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆261Updated last year
- StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization☆135Updated 4 months ago
- FireAct: Toward Language Agent Fine-tuning☆278Updated last year
- ☆169Updated last year
- FlagEval is an evaluation toolkit for AI large foundation models.☆336Updated last month
- A Toolkit for Table-based Question Answering☆112Updated last year
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆111Updated 2 months ago
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆541Updated last week
- ☆141Updated last year
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆409Updated last month