Evaluating tool-augmented LLMs in conversation settings
☆89May 31, 2024Updated last year
Alternatives and similar repositories for ToolTalk
Users that are interested in ToolTalk are comparing it to the libraries listed below
Sorting:
- A Data Source for Reasoning Embodied Agents☆19Sep 18, 2023Updated 2 years ago
- KuaiSearch PERKS☆12Nov 16, 2021Updated 4 years ago
- ☆18Sep 17, 2025Updated 5 months ago
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Jun 28, 2024Updated last year
- Code of LeCoRE☆13Feb 15, 2023Updated 3 years ago
- [ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step☆304Apr 3, 2024Updated last year
- Script for downloading GitHub.☆13Sep 24, 2020Updated 5 years ago
- This document attempts to capture useful patterns and warn about subtle gotchas when it comes to designing and evolving schemas for long-…☆13May 25, 2017Updated 8 years ago
- Semantic Functions for Semantic Link☆14Dec 3, 2025Updated 2 months ago
- ☆15Feb 28, 2024Updated 2 years ago
- public dataset for followup-query analysis, accepted by AAAI2019☆15Aug 22, 2019Updated 6 years ago
- m&ms: A Benchmark to Evaluate Tool-Use for multi-step multi-modal tasks☆44Sep 26, 2024Updated last year
- [ACL 2024] On the Multi-turn Instruction Following for Conversational Web Agents☆17Oct 12, 2024Updated last year
- AIMLBot (Program#) is a small, fast, standards-compliant yet easily customizable .NET dll implementation of an AIML (Artificial Intellige…☆17Apr 11, 2017Updated 8 years ago
- This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.☆22Mar 11, 2024Updated last year
- CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)☆73Jun 25, 2024Updated last year
- Hugging Face and Pyserini interoperability☆19May 18, 2023Updated 2 years ago
- AI Agent Tools library for Graphlit Platform☆21Jan 14, 2025Updated last year
- ☆22Jul 27, 2023Updated 2 years ago
- Everything for the Paper: 'Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing'☆19Dec 2, 2023Updated 2 years ago
- Companion code to https://arxiv.org/abs/2402.15491☆22Sep 18, 2025Updated 5 months ago
- ☆917Jul 24, 2024Updated last year
- Official Implementation of "Simulating Environments with Reasoning Models for Agent Training"☆57Feb 18, 2026Updated last week
- ☆48Jan 20, 2026Updated last month
- ☆640Nov 10, 2025Updated 3 months ago
- A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.☆217Apr 15, 2025Updated 10 months ago
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.☆22Nov 26, 2022Updated 3 years ago
- ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)☆270Apr 18, 2024Updated last year
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆92Jul 21, 2024Updated last year
- Gogoanime and Anilist Scrapper free hosting on cloudflare with tutorial.☆14Mar 2, 2025Updated 11 months ago
- Hammer: Robust Function-Calling for On-Device Language Models via Function Masking☆113Jun 13, 2025Updated 8 months ago
- Repo for "Smart Word Suggestions" (SWS) task and benchmark☆20Dec 4, 2023Updated 2 years ago
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆228Jun 16, 2025Updated 8 months ago
- ☆28Nov 10, 2025Updated 3 months ago
- Neuron Activation☆26Nov 21, 2024Updated last year
- The implementation for CIKM 2024: Towards Completeness-Oriented Tool Retrieval for Large Language Models.☆24Nov 6, 2024Updated last year
- ☆29May 4, 2024Updated last year
- A fast and reliable screenshot capture API, built on top of Selenium.☆25Jun 10, 2024Updated last year
- [EMNLP 2024] Ask-before-Plan: Proactive Language Agents for Real-World Planning☆21Jul 28, 2025Updated 7 months ago