allenai / lumos
Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"
☆462Updated 11 months ago
Alternatives and similar repositories for lumos:
Users that are interested in lumos are comparing it to the libraries listed below
- AWM: Agent Workflow Memory☆241Updated 2 weeks ago
- Code for Quiet-STaR☆713Updated 5 months ago
- NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…☆312Updated last year
- ☆362Updated last month
- Implementation of Google's SELF-DISCOVER☆289Updated 6 months ago
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆336Updated 8 months ago
- ☆305Updated 8 months ago
- FireAct: Toward Language Agent Fine-tuning☆265Updated last year
- An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]☆281Updated 9 months ago
- [ICLR 2024] Lemur: Open Foundation Models for Language Agents☆540Updated last year
- SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks☆301Updated 3 months ago
- Code for the paper 🌳 Tree Search for Language Model Agents☆178Updated 6 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆222Updated this week
- ☆496Updated 3 months ago
- ☆268Updated last year
- ☆120Updated 8 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆100Updated 5 months ago
- RewardBench: the first evaluation tool for reward models.☆505Updated this week
- [NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898☆208Updated 9 months ago
- The official evaluation suite and dynamic data release for MixEval.☆231Updated 3 months ago
- ☆114Updated 6 months ago
- Official Implementation of InstructZero; the first framework to optimize bad prompts of ChatGPT(API LLMs) and finally obtain good prompts…☆187Updated 6 months ago
- Official repo for "Make Your LLM Fully Utilize the Context"☆250Updated 9 months ago
- [NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents☆305Updated 5 months ago
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆145Updated 2 months ago
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆451Updated 11 months ago
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"☆866Updated last week
- Benchmarking LLMs with Challenging Tasks from Real Users☆215Updated 3 months ago
- [ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning☆640Updated 8 months ago
- Build Hierarchical Autonomous Agents through Config. Collaborative Growth of Specialized Agents.☆310Updated last year