Gentopia-AI / GentPool
Gentopia Agent Zoo and Agent Benchmark
☆28Updated last year
Related projects ⓘ
Alternatives and complementary repositories for GentPool
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆92Updated last year
- Build Hierarchical Autonomous Agents through Config. Collaborative Growth of Specialized Agents.☆297Updated 11 months ago
- ☆171Updated 6 months ago
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆76Updated 9 months ago
- Codes for the EMNLP 2023 Findings paper "Self-Polish: Enhance Reasoning in Large Language Models via Problem Refining" by Zhiheng Xi, Sen…☆27Updated last year
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆145Updated 8 months ago
- ☆24Updated this week
- ☆38Updated 4 months ago
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆39Updated last month
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆75Updated last month
- FireAct: Toward Language Agent Fine-tuning☆255Updated last year
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆107Updated last year
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆103Updated 6 months ago
- ☆78Updated 11 months ago
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆87Updated last year
- ☆48Updated last year
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆49Updated 8 months ago
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…☆40Updated 9 months ago
- ☆56Updated 9 months ago
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆50Updated 6 months ago
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆48Updated 5 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆41Updated 9 months ago
- Evaluating tool-augmented LLMs in conversation settings☆72Updated 5 months ago
- ☆116Updated 5 months ago
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆141Updated last year
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆30Updated 3 months ago
- ☆17Updated 4 months ago
- ☆42Updated 2 months ago
- Data preparation code for CrystalCoder 7B LLM☆42Updated 6 months ago