THUDM / AgentTuning
AgentTuning: Enabling Generalized Agent Abilities for LLMs
☆1,362Updated last year
Related projects ⓘ
Alternatives and complementary repositories for AgentTuning
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)☆2,200Updated 2 months ago
- A generalized information-seeking agent system with Large Language Models (LLMs).☆1,095Updated 4 months ago
- 🩹Editing large language models within 10 seconds⚡☆1,281Updated last year
- This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai,…☆1,817Updated 5 months ago
- ☆870Updated 3 months ago
- ☆887Updated 5 months ago
- An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.☆1,511Updated 2 weeks ago
- ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting wit…☆971Updated 8 months ago
- A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI☆764Updated 10 months ago
- The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".☆1,425Updated 5 months ago
- [IJCAI 2024] Generate different roles for GPTs to form a collaborative entity for complex tasks.☆1,205Updated 6 months ago
- ☆707Updated 4 months ago
- DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models☆1,000Updated 9 months ago
- The official repository of "ChatDB: Augmenting LLMs with Databases as Their Symbolic Memory".☆534Updated last year
- A lightweight framework for building LLM-based agents☆1,846Updated this week
- Benchmarking large language models' complex reasoning ability with chain-of-thought prompting☆2,570Updated 3 months ago
- List of language agents based on paper "Cognitive Architectures for Language Agents"☆761Updated 2 months ago
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…☆881Updated 2 weeks ago
- LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.☆613Updated last month
- Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"☆2,159Updated 3 weeks ago
- Code for our ACL 2023 Paper "Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models".☆596Updated last year
- MOSS-RLHF☆1,290Updated 8 months ago
- ☆1,263Updated this week
- ☆2,571Updated last week
- Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment☆1,021Updated 5 months ago
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward☆701Updated this week
- Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs…☆429Updated 2 weeks ago
- [ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models☆1,964Updated 9 months ago
- RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models☆463Updated 3 weeks ago
- An LLM-based Agent for the New Automation Paradigm - Agentic Process Automation☆774Updated 10 months ago