Gentopia-AI / GentPoolLinks
Gentopia Agent Zoo and Agent Benchmark
☆30Updated last year
Alternatives and similar repositories for GentPool
Users that are interested in GentPool are comparing it to the libraries listed below
Sorting:
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆151Updated last year
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆56Updated 5 months ago
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆97Updated last year
- A set of utilities for running few-shot prompting experiments on large-language models☆121Updated last year
- ☆52Updated last year
- ☆40Updated 10 months ago
- Build Hierarchical Autonomous Agents through Config. Collaborative Growth of Specialized Agents.☆316Updated last year
- Data preparation code for CrystalCoder 7B LLM☆44Updated last year
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆96Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆104Updated 5 months ago
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆43Updated last year
- Based on the tree of thoughts paper☆48Updated last year
- ☆82Updated last year
- Reasoning by Communicating with Agents☆28Updated last month
- Toy implementation of Strawberry☆31Updated 8 months ago
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆57Updated last year
- Official code for the paper "ADaPT: As-Needed Decomposition and Planning with Language Models"☆80Updated last year
- [NeurIPS 2023] PyTorch code for Can Language Models Teach? Teacher Explanations Improve Student Performance via Theory of Mind☆67Updated last year
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- Advanced Reasoning Benchmark Dataset for LLMs☆46Updated last year
- Langchain Agent finetuning using 7B - LLAMA 2 , on hotpotQA (Retroformer framework)☆15Updated last year
- Metacognitive Prompting Improves Understanding in Large Language Models (NAACL 2024)☆34Updated last year
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆105Updated 7 months ago
- Everything for the Paper: 'Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing'☆16Updated last year
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated last year
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆37Updated 3 months ago
- Code and Data for "Language Modeling with Editable External Knowledge"☆33Updated 11 months ago
- Codes for the EMNLP 2023 Findings paper "Self-Polish: Enhance Reasoning in Large Language Models via Problem Refining" by Zhiheng Xi, Sen…☆30Updated 2 years ago
- Official repo of Respond-and-Respond: data, code, and evaluation☆103Updated 10 months ago
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆106Updated 6 months ago