THUDM / AgentTuningLinks

AgentTuning: Enabling Generalized Agent Abilities for LLMs

☆1,454

Alternatives and similar repositories for AgentTuning

Users that are interested in AgentTuning are comparing it to the libraries listed below

Sorting:

KwaiKEG / KwaiAgents
A generalized information-seeking agent system with Large Language Models (LLMs).
☆1,173Updated last year
THUDM / AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
☆2,704Updated 6 months ago
hiyouga / FastEdit
🩹Editing large language models within 10 seconds⚡
☆1,339Updated last year
microsoft / ToRA
ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting wit…
☆1,083Updated last year
Xwin-LM / Xwin-LM
Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment
☆1,040Updated last year
WeOpenML / PandaLM
☆920Updated last year
thunlp / ToolLearningPapers
☆908Updated last year
open-compass / MixtralKit
A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI
☆767Updated last year
ruixiangcui / AGIEval
☆758Updated last year
InternLM / lagent
A lightweight framework for building LLM-based agents
☆2,173Updated last month
AGI-Edgerunners / Plan-and-Solve-Prompting
Code for our ACL 2023 Paper "Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models".
☆678Updated 2 years ago
tatsu-lab / alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
☆1,816Updated 7 months ago
OpenBMB / ProAgent
An LLM-based Agent for the New Automation Paradigm - Agentic Process Automation
☆852Updated last year
yuchenlin / LLM-Blender
[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…
☆952Updated 9 months ago
AkariAsai / self-rag
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai,…
☆2,147Updated last year
OpenLemur / Lemur
[ICLR 2024] Lemur: Open Foundation Models for Language Agents
☆552Updated last year
Link-AGI / AutoAgents
[IJCAI 2024] Generate different roles for GPTs to form a collaborative entity for complex tasks.
☆1,389Updated last year
OpenLMLab / LOMO
LOMO: LOw-Memory Optimization
☆988Updated last year
lupantech / chameleon-llm
Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".
☆1,134Updated last year
FranxYao / chain-of-thought-hub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
☆2,742Updated 11 months ago
ctlllll / LLM-ToolMaker
☆1,033Updated 2 years ago
Victorwz / LongMem
Official implementation of our NeurIPS 2023 paper "Augmenting Language Models with Long-Term Memory".
☆802Updated last year
openai / prm800k
800,000 step-level correctness labels on LLM solutions to MATH problems
☆2,032Updated 2 years ago
huchenxucs / ChatDB
The official repository of "ChatDB: Augmenting LLMs with Databases as Their Symbolic Memory".
☆583Updated 2 years ago
dvlab-research / LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
☆2,674Updated 11 months ago
madaan / self-refine
LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.
☆719Updated 9 months ago
onejune2018 / Awesome-LLM-Eval
Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs…
☆555Updated 9 months ago
GAIR-NLP / factool
FacTool: Factuality Detection in Generative AI
☆886Updated 11 months ago
InteractiveNLP-Team / RoleLLM-public
RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models
☆501Updated 9 months ago
InternLM / InternLM-techreport
☆905Updated 2 years ago