Gentopia-AI / GentopiaLinks
Build Hierarchical Autonomous Agents through Config. Collaborative Growth of Specialized Agents.
☆317Updated last year
Alternatives and similar repositories for Gentopia
Users that are interested in Gentopia are comparing it to the libraries listed below
Sorting:
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …☆268Updated last year
- Codes for our paper "ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate"☆286Updated 8 months ago
- Repo for paper "Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration"☆339Updated last year
- [ICLR 2024] Lemur: Open Foundation Models for Language Agents☆550Updated last year
- Source code for the paper "Active Prompting with Chain-of-Thought for Large Language Models"☆241Updated last year
- ☆182Updated 4 months ago
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆153Updated last year
- Gentopia Agent Zoo and Agent Benchmark☆30Updated last year
- A set of utilities for running few-shot prompting experiments on large-language models☆121Updated last year
- [NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents☆358Updated 9 months ago
- ☆172Updated last year
- [ICLR 2023] Code for the paper "Binding Language Models in Symbolic Languages"☆318Updated last year
- PaL: Program-Aided Language Models (ICML 2023)☆499Updated last year
- FireAct: Toward Language Agent Fine-tuning☆279Updated last year
- Official implementation of paper "Cumulative Reasoning With Large Language Models" (https://arxiv.org/abs/2308.04371)☆294Updated 9 months ago
- [NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898☆220Updated last year
- An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]☆325Updated last year
- LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.☆703Updated 8 months ago
- Code and data for "Measuring and Narrowing the Compositionality Gap in Language Models"☆314Updated last year
- Build, evaluate, understand, and fix LLM-based apps☆489Updated last year
- ☆361Updated 2 years ago
- SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks☆309Updated 8 months ago
- Data and Code for Program of Thoughts (TMLR 2023)☆276Updated last year
- Forward-Looking Active REtrieval-augmented generation (FLARE)☆636Updated last year
- A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research …☆977Updated 6 months ago
- All available datasets for Instruction Tuning of Large Language Models☆252Updated last year
- ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)☆262Updated last year
- ☆298Updated last year
- NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…☆315Updated last year
- Official Implementation of InstructZero; the first framework to optimize bad prompts of ChatGPT(API LLMs) and finally obtain good prompts…☆194Updated 10 months ago