Ag2S1 / Sibyl-System
☆103Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for Sibyl-System
- ☆116Updated 5 months ago
- [ACL 2024] AUTOACT: Automatic Agent Learning from Scratch for QA via Self-Planning☆178Updated last month
- AWM: Agent Workflow Memory☆205Updated last month
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆170Updated last month
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆80Updated 2 months ago
- Environments, tools, and benchmarks for general computer agents☆172Updated 3 weeks ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆75Updated last month
- Reformatted Alignment☆112Updated last month
- An implemtation of Everyting of Thoughts (XoT).☆132Updated 8 months ago
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆204Updated this week
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆92Updated last year
- Code for the paper 🌳 Tree Search for Language Model Agents☆138Updated 3 months ago
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆110Updated 3 weeks ago
- A simple unified framework for evaluating LLMs☆145Updated last week
- Beating the GAIA benchmark with Transformers Agents. 🚀☆62Updated 3 weeks ago
- This is the official repo for "PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization". PromptAgen…☆204Updated 3 months ago
- FireAct: Toward Language Agent Fine-tuning☆255Updated last year
- ☆62Updated 3 weeks ago
- Benchmarking LLMs with Challenging Tasks from Real Users☆195Updated 2 weeks ago
- An Analytical Evaluation Board of Multi-turn LLM Agents☆250Updated 6 months ago
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆118Updated last month
- Official Repo for UGround☆97Updated last week
- Building open version of OpenAI o1 via reasoning traces (Groq, ollama, Anthropic, Gemini, OpenAI, Azure supported) Demo: https://hugging…☆141Updated last month
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆47Updated 3 weeks ago
- ☆41Updated 2 months ago
- ☆78Updated 11 months ago
- ☆127Updated 3 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆180Updated 3 weeks ago
- Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…☆142Updated 9 months ago
- ☆152Updated 2 months ago