This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.
☆22Mar 11, 2024Updated 2 years ago
Alternatives and similar repositories for ToolVerifier
Users that are interested in ToolVerifier are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆21Jun 4, 2024Updated last year
- ☆31May 8, 2025Updated 10 months ago
- LLM-MapBook: AI-Powered Maps for Storytelling. Extracts geo-coordinates from books, visualizes on interactive maps, offering immersive st…☆12Aug 27, 2024Updated last year
- ☆33Aug 26, 2025Updated 6 months ago
- A Python package for extracting confidence scores from LLM models outputs, particularly using log probabilities.☆19Sep 15, 2024Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆13Feb 14, 2024Updated 2 years ago
- Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"☆14Mar 25, 2025Updated 11 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆54Jun 6, 2025Updated 9 months ago
- A fast, lightweight Go-based CLI tool to detect and manage processes using network ports—featuring project awareness, Docker support, and…☆36Jun 5, 2025Updated 9 months ago
- Reasoning-based Evaluation and Ranking of Translations.☆20Jul 18, 2025Updated 8 months ago
- Experimental tl;dr summaries for datasets on the Hugging Face Hub!☆10Apr 4, 2024Updated last year
- Copy the style from one image to another☆41Apr 19, 2024Updated last year
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆66Oct 18, 2024Updated last year
- Codes for paper SoAy: A Service-oriented APIs Applying Framework of Large Language Models☆27Jul 14, 2025Updated 8 months ago
- This is code for the EMNLP 2022 Paper "UniRPG: Unified Discrete Reasoning over Table and Text as Program Generation".☆10Apr 30, 2023Updated 2 years ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆22May 26, 2025Updated 9 months ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆34Feb 27, 2025Updated last year
- ☆18Mar 19, 2023Updated 3 years ago
- Repository for the Exposing Outlier Exposure paper☆12Aug 20, 2024Updated last year
- insight data engineering fellow project☆16Nov 14, 2016Updated 9 years ago
- ALBench Leaderboard for active learning in object detection☆15Jan 13, 2023Updated 3 years ago
- ☆50Mar 14, 2024Updated 2 years ago
- Web-grounded natural language instructions☆18Nov 25, 2024Updated last year
- A passion project on my favorite e-commerce site that scrapes product data and builds a recommendation engine☆10May 2, 2023Updated 2 years ago
- CFinBench: A Comprehensive Chinese Financial Benchmark for Large Language Models☆15Oct 14, 2024Updated last year
- Android app for Zhihu Daily☆15May 28, 2017Updated 8 years ago
- [ACL 2024] On the Multi-turn Instruction Following for Conversational Web Agents☆17Oct 12, 2024Updated last year
- ☆12Apr 17, 2024Updated last year
- Class Prior Estimation in Active Positive and Unlabeled Learning☆16Mar 24, 2021Updated 4 years ago
- R-LPIPS [ICML W 2023]☆17Nov 14, 2023Updated 2 years ago
- A pytorch reimplementation of KL-Loss (CVPR'2019)☆15Oct 15, 2023Updated 2 years ago
- "자연어처리 알고리즘을 활용한 느린학습자 교육 컨텐츠 제작" 프로젝트 "애움길" 팀입니다. 데이터 수집(크롤링)/EDA/Preprocessing, 쉬운말 생성요약 AI 모델링(NLP - KoBERT, KoBART), 프로토타입 제작을 진행했습니다…☆13Mar 24, 2022Updated 4 years ago
- This is a new version of Learning Active Learning which uses reinforcement learning☆13May 17, 2022Updated 3 years ago
- ☆16Mar 6, 2025Updated last year
- pytorch reimplementation for Detecting Adversarial Examples from Sensitivity Inconsistency of Spatial-Transform Domain☆11Oct 30, 2022Updated 3 years ago
- Companion code to https://arxiv.org/abs/2402.15491☆22Sep 18, 2025Updated 6 months ago
- Print an image of a cat to the iTerm2 terminal☆14Feb 7, 2017Updated 9 years ago
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆14Mar 20, 2024Updated 2 years ago
- Exploring the Efficacy of Idiomify: How Effective is GPT-3 for Teaching Idioms to EFL Writers?☆16Aug 9, 2022Updated 3 years ago