This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.
☆22Mar 11, 2024Updated 2 years ago
Alternatives and similar repositories for ToolVerifier
Users that are interested in ToolVerifier are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Feb 20, 2024Updated 2 years ago
- 使用VSCode官方LMAPI接口,基于拓展程序实现标准API服务☆26Mar 30, 2026Updated last month
- implementation of https://arxiv.org/pdf/2312.09299☆21Jul 3, 2024Updated last year
- [ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios☆71Aug 5, 2025Updated 9 months ago
- ☆21Jun 4, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Use strategy in stock transaction for high revenue.☆10Dec 24, 2015Updated 10 years ago
- ☆32May 8, 2025Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆13Feb 14, 2024Updated 2 years ago
- Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"☆14Mar 25, 2025Updated last year
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆54Jun 6, 2025Updated 11 months ago
- Collection of papers, benchmarks and newest trends in the domain of End-to-end ToDs☆14Nov 18, 2023Updated 2 years ago
- Reasoning-based Evaluation and Ranking of Translations.☆20Jul 18, 2025Updated 10 months ago
- Experimental tl;dr summaries for datasets on the Hugging Face Hub!☆10Apr 4, 2024Updated 2 years ago
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆66Oct 18, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The medical question entailment data introduced in the AMIA 2016 Paper (Recognizing Question Entailment for Medical Question Answering)☆14May 13, 2026Updated last week
- Code and data for AAAI 2022 paper "Multilingual Code Snippets Training for Program Translation"☆10Mar 7, 2022Updated 4 years ago
- This is code for the EMNLP 2022 Paper "UniRPG: Unified Discrete Reasoning over Table and Text as Program Generation".☆10Apr 30, 2023Updated 3 years ago
- ☆31Jun 5, 2025Updated 11 months ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆33Feb 27, 2025Updated last year
- ☆18Mar 19, 2023Updated 3 years ago
- Repository for the Exposing Outlier Exposure paper☆12Aug 20, 2024Updated last year
- ALBench Leaderboard for active learning in object detection☆15Jan 13, 2023Updated 3 years ago
- Web-grounded natural language instructions☆18Nov 25, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆50Mar 14, 2024Updated 2 years ago
- A passion project on my favorite e-commerce site that scrapes product data and builds a recommendation engine☆10May 2, 2023Updated 3 years ago
- Text classification with Sparse Composite Document Vectors.☆61Jun 29, 2020Updated 5 years ago
- [ACL 2024] On the Multi-turn Instruction Following for Conversational Web Agents☆17Oct 12, 2024Updated last year
- Android app for Zhihu Daily☆15May 28, 2017Updated 8 years ago
- 带人工标注的中文灾害数据集,将会持续更新。☆16May 12, 2019Updated 7 years ago
- ☆15Oct 20, 2023Updated 2 years ago
- Class Prior Estimation in Active Positive and Unlabeled Learning☆16Mar 24, 2021Updated 5 years ago
- R-LPIPS [ICML W 2023]☆17Nov 14, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A pytorch reimplementation of KL-Loss (CVPR'2019)☆15Oct 15, 2023Updated 2 years ago
- [ICASSP '26] This is the code repo for our paper: LegalΔ: Enhancing Legal Reasoning in LLMs via Reinforcement Learning with Chain-of-Thou…☆30Aug 20, 2025Updated 9 months ago
- Cerule - A Tiny Mighty Vision Model☆70Nov 9, 2025Updated 6 months ago
- Official code of our work, AVATAR: A Parallel Corpus for Java-Python Program Translation.☆59Jul 31, 2024Updated last year
- ☆18May 30, 2023Updated 2 years ago
- pytorch reimplementation for Detecting Adversarial Examples from Sensitivity Inconsistency of Spatial-Transform Domain☆11Oct 30, 2022Updated 3 years ago
- Companion code to https://arxiv.org/abs/2402.15491☆22Sep 18, 2025Updated 8 months ago