allenai / lumosLinks
Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"
☆472Updated last year
Alternatives and similar repositories for lumos
Users that are interested in lumos are comparing it to the libraries listed below
Sorting:
- NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…☆318Updated 2 years ago
- ☆415Updated last year
- ☆277Updated 2 years ago
- [ICLR 2024] Lemur: Open Foundation Models for Language Agents☆555Updated 2 years ago
- Implementation of Google's SELF-DISCOVER☆300Updated last year
- Code for Quiet-STaR☆742Updated last year
- A codebase for "Language Models can Solve Computer Tasks"☆236Updated last year
- SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks☆320Updated last year
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆117Updated last month
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆346Updated last year
- ☆122Updated last year
- AWM: Agent Workflow Memory☆359Updated 10 months ago
- This is work done by the Oxen.ai Community, trying to reproduce the Self-Rewarding Language Model paper from MetaAI.☆132Updated last year
- xLAM: A Family of Large Action Models to Empower AI Agent Systems☆584Updated 3 months ago
- ☆185Updated 10 months ago
- [ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"☆804Updated last year
- ☆379Updated 2 years ago
- An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]☆368Updated last year
- ☆556Updated last year
- Data and code for FreshLLMs (https://arxiv.org/abs/2310.03214)☆379Updated last week
- [NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898☆230Updated last year
- FireAct: Toward Language Agent Fine-tuning☆286Updated 2 years ago
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆156Updated 9 months ago
- ☆313Updated last year
- The official evaluation suite and dynamic data release for MixEval.☆253Updated last year
- ☆320Updated last year
- An implemtation of Everyting of Thoughts (XoT).☆155Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆250Updated 9 months ago
- Code for the paper 🌳 Tree Search for Language Model Agents☆216Updated last year
- [NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation☆321Updated 9 months ago