nexusflowai / NexusRaven
NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRaven-13B and baselines.
☆313Updated last year
Alternatives and similar repositories for NexusRaven:
Users that are interested in NexusRaven are comparing it to the libraries listed below
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆336Updated 8 months ago
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"☆463Updated 11 months ago
- [ICLR 2024] Lemur: Open Foundation Models for Language Agents☆541Updated last year
- ☆404Updated last year
- ☆366Updated last month
- ☆268Updated last year
- Code for Quiet-STaR☆716Updated 6 months ago
- Run evaluation on LLMs using human-eval benchmark☆395Updated last year
- Implementation of Google's SELF-DISCOVER☆290Updated 6 months ago
- A library for easily merging multiple LLM experts, and efficiently train the merged LLM.☆445Updated 6 months ago
- AWM: Agent Workflow Memory☆245Updated last month
- An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]☆282Updated 9 months ago
- ☆155Updated 6 months ago
- Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhan…☆620Updated 9 months ago
- ☆498Updated 3 months ago
- A bagel, with everything.☆316Updated 10 months ago
- ☆307Updated 5 months ago
- ☆157Updated 6 months ago
- [ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning☆643Updated 9 months ago
- SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks☆303Updated 4 months ago
- ☆120Updated 8 months ago
- FireAct: Toward Language Agent Fine-tuning☆270Updated last year
- Open Source WizardCoder Dataset☆155Updated last year
- ☆353Updated last year
- Beating the GAIA benchmark with Transformers Agents. 🚀☆90Updated last week
- ☆506Updated 6 months ago
- Official repository for LongChat and LongEval☆517Updated 9 months ago
- [NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents☆307Updated 5 months ago
- 🤠 Agent-as-a-Judge and DevAI dataset☆324Updated last month
- Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"☆345Updated last week