zorazrw / agent-workflow-memory
AWM: Agent Workflow Memory
☆203Updated last month
Related projects ⓘ
Alternatives and complementary repositories for agent-workflow-memory
- ☆310Updated last month
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆327Updated 4 months ago
- OS-ATLAS: A Foundation Action Model For Generalist GUI Agents☆118Updated this week
- CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/☆187Updated this week
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆147Updated this week
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆169Updated last month
- Environments, tools, and benchmarks for general computer agents☆171Updated 2 weeks ago
- Official Repo for UGround☆93Updated this week
- An Analytical Evaluation Board of Multi-turn LLM Agents☆243Updated 5 months ago
- ☆116Updated 5 months ago
- ☆102Updated 2 months ago
- Code for the paper 🌳 Tree Search for Language Model Agents☆138Updated 3 months ago
- A simple unified framework for evaluating LLMs☆138Updated this week
- [EMNLP 2024 Findings] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.☆137Updated 3 weeks ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆169Updated last week
- Code and Data for Tau-Bench☆193Updated 2 weeks ago
- awesome synthetic (text) datasets☆239Updated last week
- [ACL 2024] AUTOACT: Automatic Agent Learning from Scratch for QA via Self-Planning☆177Updated 3 weeks ago
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"☆447Updated 7 months ago
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆173Updated last week
- A compilation of the best multi-agent papers☆247Updated last week
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆106Updated 2 weeks ago
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆188Updated 2 months ago
- The official evaluation suite and dynamic data release for MixEval.☆222Updated last week
- [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning☆332Updated 2 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆114Updated 6 months ago
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆109Updated 2 months ago
- Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"☆200Updated last month
- ☆102Updated 2 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆119Updated 2 weeks ago