InternLM / Agent-FLANLinks
[ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models
☆351Updated last year
Alternatives and similar repositories for Agent-FLAN
Users that are interested in Agent-FLAN are comparing it to the libraries listed below
Sorting:
- Enhance LLM agents with rich tool APIs☆394Updated 10 months ago
- [ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step☆283Updated last year
- ☆49Updated 2 years ago
- ☆37Updated last year
- State-of-the-art bilingual open-sourced Math reasoning LLMs.☆518Updated 9 months ago
- InternEvo is a high-performance training system for giant models.☆38Updated last year
- PyTorch Sphinx Theme☆35Updated last year
- InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencie…☆398Updated last week
- ☆229Updated last year
- a-m-team's exploration in large language modeling☆178Updated 2 months ago
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆354Updated 3 months ago
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆238Updated 5 months ago
- ☆298Updated last year
- LLM Group Chat Framework: chat with multiple LLMs at the same time. 大模型群聊框架:同时与多个大语言模型聊天。☆313Updated last month
- ☆293Updated last month
- ☆323Updated last year
- 大模型多维度中文对齐评测基准 (ACL 2024)☆401Updated 11 months ago
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆188Updated 4 months ago
- An automated pipeline for evaluating LLMs for role-playing.☆192Updated 10 months ago
- Pre-trained, Scalable, High-performance Reward Models via Policy Discriminative Learning.☆140Updated 3 weeks ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆381Updated last month
- A live reading list for LLM-synthetic-data.☆343Updated this week
- AN O1 REPLICATION FOR CODING☆335Updated 7 months ago
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆265Updated last year
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆697Updated last week
- ☆544Updated 7 months ago
- Evaluating LLMs' multi-round chatting capability via assessing conversations generated by two LLM instances.☆156Updated 2 months ago
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆146Updated 7 months ago
- ☆227Updated 2 months ago
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆603Updated 2 months ago