microsoft / Everything-of-Thoughts-XoT
An implemtation of Everyting of Thoughts (XoT).
☆114Updated 6 months ago
Related projects: ⓘ
- ☆111Updated 3 months ago
- An Analytical Evaluation Board of Multi-turn LLM Agents☆227Updated 3 months ago
- [ACL 2024] AUTOACT: Automatic Agent Learning from Scratch for QA via Self-Planning☆162Updated 5 months ago
- Code for the paper 🌳 Tree Search for Language Model Agents☆124Updated last month
- SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks☆239Updated this week
- ☆242Updated last week
- FireAct: Toward Language Agent Fine-tuning☆242Updated 10 months ago
- Implementation of "RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation".☆151Updated 3 months ago
- ☆262Updated this week
- ☆166Updated 4 months ago
- KnowAgent: Knowledge-Augmented Planning for LLM-Based Agents☆162Updated 3 months ago
- Benchmarks, environments, and toolkits for general computer agents☆154Updated this week
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆143Updated 5 months ago
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆87Updated 11 months ago
- ☆90Updated last month
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization☆96Updated 4 months ago
- Codes for our paper "ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate"☆221Updated 5 months ago
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆84Updated 11 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆73Updated 2 months ago
- AWM: Agent Workflow Memory☆121Updated this week
- FuseAI Project☆75Updated 3 weeks ago
- augmented LLM with self reflection☆80Updated 9 months ago
- CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/☆167Updated this week
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆111Updated 2 months ago
- Official implementation of paper "Cumulative Reasoning With Large Language Models" (https://arxiv.org/abs/2308.04371)☆274Updated this week
- Implementation of Google's SELF-DISCOVER☆267Updated last month
- This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'☆69Updated this week
- Official implementation for the paper "LongEmbed: Extending Embedding Models for Long Context Retrieval"☆108Updated 4 months ago
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆134Updated 6 months ago
- Expert Specialized Fine-Tuning☆129Updated last month