fairyshine / Chain-of-Tools
The official implementation of the paper "Chain-of-Tools: Utilizing Massive Unseen Tools in the CoT Reasoning of Frozen Language Models".
☆70Updated last month
Alternatives and similar repositories for Chain-of-Tools
Users that are interested in Chain-of-Tools are comparing it to the libraries listed below
Sorting:
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆90Updated 2 months ago
- A lightweight adjustment tool for smoothing token probabilities in the Qwen models to encourage balanced multilingual generation.☆65Updated 2 weeks ago
- ☆65Updated 2 months ago
- Toy O☆16Updated 7 months ago
- ☆27Updated last month
- This repository contains popular code generation frameworks such as MapCoder, CodeSIM.☆46Updated 3 weeks ago
- Code for paper called Self-Training Elicits Concise Reasoning in Large Language Models☆30Updated 3 weeks ago
- Verifiers for LLM Reinforcement Learning☆50Updated last month
- ☆91Updated last month
- Agentic Knowledgeable Self-awareness☆56Updated last month
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆53Updated last month
- Official code repository for Sketch-of-Thought (SoT)☆112Updated last week
- Complex Function Calling Benchmark.☆100Updated 3 months ago
- ☆61Updated 10 months ago
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆116Updated last year
- ☆45Updated last month
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆132Updated last month
- Code for ExploreTom☆83Updated 5 months ago
- Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24☆137Updated 11 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆97Updated 6 months ago
- ☆24Updated 4 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆53Updated this week
- 1-Click is all you need.☆61Updated last year
- ☆201Updated 2 months ago
- ☆45Updated 7 months ago
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆65Updated 10 months ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆32Updated 3 months ago
- ☆50Updated 3 months ago
- accompanying material for sleep-time compute paper☆83Updated 2 weeks ago
- Build complex LLM Applications with Python Dictionary☆40Updated 7 months ago