SqueezeAILab / Tool2VecLinks
Efficient and Scalable Estimation of Tool Representations in Vector Space
☆23Updated 9 months ago
Alternatives and similar repositories for Tool2Vec
Users that are interested in Tool2Vec are comparing it to the libraries listed below
Sorting:
- Verifiers for LLM Reinforcement Learning☆60Updated 2 months ago
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆22Updated 2 months ago
- LLMs as Collaboratively Edited Knowledge Bases☆45Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- Official Repository for Task-Circuit Quantization☆20Updated 3 weeks ago
- Self-host LLMs with LMDeploy and BentoML☆20Updated 2 weeks ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆53Updated 4 months ago
- Reasoning by Communicating with Agents☆29Updated last month
- ☆36Updated last month
- [EMNLP 2023 Industry Track] A simple prompting approach that enables the LLMs to run inference in batches.☆74Updated last year
- Official Repo for InSTA: Towards Internet-Scale Training For Agents☆42Updated this week
- ☆15Updated 2 months ago
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆42Updated last year
- The official evaluation suite and dynamic data release for MixEval.☆11Updated 9 months ago
- ☆80Updated 7 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- ☆41Updated 6 months ago
- [EMNLP 2024 Main] Virtual Personas for Language Models via an Anthology of Backstories☆28Updated 7 months ago
- Compression for Foundation Models☆31Updated 3 months ago
- ☆47Updated 2 weeks ago
- The official repo for "LLoCo: Learning Long Contexts Offline"☆117Updated last year
- [ACL25' Findings] SWE-Dev is an SWE agent with a scalable test case construction pipeline.☆40Updated 2 weeks ago
- ☆51Updated 7 months ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆124Updated last year
- A new way to generate large quantities of high quality synthetic data (on par with GPT-4), with better controllability, at a fraction of …☆22Updated 8 months ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆43Updated last year
- A repository for research on medium sized language models.☆76Updated last year
- The repository contains generative AI analytics platform application code.☆26Updated last month
- Nexusflow function call, tool use, and agent benchmarks.☆20Updated 6 months ago