THUDM / AgentTuning
AgentTuning: Enabling Generalized Agent Abilities for LLMs
☆1,402Updated last year
Alternatives and similar repositories for AgentTuning:
Users that are interested in AgentTuning are comparing it to the libraries listed below
- A generalized information-seeking agent system with Large Language Models (LLMs).☆1,140Updated 9 months ago
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)☆2,453Updated last month
- 🩹Editing large language models within 10 seconds⚡☆1,317Updated last year
- ☆893Updated 8 months ago
- ☆910Updated 10 months ago
- ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting wit…☆1,047Updated last year
- An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.☆1,695Updated 2 months ago
- Official implementation of our NeurIPS 2023 paper "Augmenting Language Models with Long-Term Memory".☆785Updated 11 months ago
- Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)☆2,652Updated 7 months ago
- ☆741Updated 9 months ago
- List of language agents based on paper "Cognitive Architectures for Language Agents"☆888Updated 2 months ago
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…☆924Updated 5 months ago
- Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment☆1,037Updated 9 months ago
- This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai,…☆2,009Updated 10 months ago
- Open Academic Research on Improving LLaMA to SOTA LLM☆1,617Updated last year
- Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"☆2,324Updated 3 months ago
- ☆903Updated last year
- A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.☆801Updated 8 months ago
- A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI☆767Updated last year
- [NIPS2023] RRHF & Wombat☆804Updated last year
- Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them☆471Updated 9 months ago
- [ICLR 2024] Lemur: Open Foundation Models for Language Agents☆542Updated last year
- [TLLM'23] PandaGPT: One Model To Instruction-Follow Them All☆782Updated last year
- Benchmarking large language models' complex reasoning ability with chain-of-thought prompting☆2,697Updated 7 months ago
- The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".☆1,496Updated 9 months ago
- An Open-source Toolkit for LLM Development☆2,762Updated 2 months ago
- Codebase for Merging Language Models (ICML 2024)☆801Updated 10 months ago
- RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models☆489Updated 5 months ago
- LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.☆671Updated 5 months ago
- [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models☆612Updated last month