fairyshine / Seal-ToolsLinks

The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmark.

☆52

Alternatives and similar repositories for Seal-Tools

Users that are interested in Seal-Tools are comparing it to the libraries listed below

Sorting:

Junjie-Ye / ToolEyes
[COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios
☆68Updated 2 months ago
GAIR-NLP / OPO
☆50Updated last year
MadeAgents / Hammer
Hammer: Robust Function-Calling for On-Device Language Models via Function Masking
☆92Updated last month
Zheng0428 / COIG-Kun
☆36Updated 10 months ago
GAIR-NLP / ReAlign
Reformatted Alignment
☆113Updated 10 months ago
thu-coai / ComplexBench
Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)
☆89Updated 5 months ago
meowpass / FollowComplexInstruction
Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…
☆50Updated last year
FlagOpen / Infinity-Instruct
☆48Updated last year
thu-coai / CritiqueLLM
☆144Updated last year
ZitongYang / Synthetic_Continued_Pretraining
Code implementation of synthetic continued pretraining
☆123Updated 6 months ago
swt-user / DMPO
☆43Updated 9 months ago
SqueezeAILab / LLM2LLM
[ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
☆186Updated last year
weizhepei / InstructRAG
[ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales
☆112Updated 5 months ago
RUCAIBox / SimpleDeepSearcher
SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis
☆91Updated 2 months ago
Open-Source-O1 / o1_Reasoning_Patterns_Study
☆103Updated 7 months ago
tianyi-lab / Superfiltering
[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
☆166Updated last month
THUDM / LongReward
☆56Updated 9 months ago
JoeYing1019 / UltraTool
[ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios
☆57Updated last year
yongchao98 / PROMST
Automatic prompt optimization framework for multi-step agent tasks.
☆32Updated 8 months ago
HowieHwong / MetaTool
[ICLR 2024] MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use
☆90Updated last year
csitfun / LogiQA2.0
Logiqa2.0 dataset - logical reasoning in MRC and NLI tasks
☆98Updated last year
nick7nlp / Counting-Stars
Counting-Stars (★)
☆83Updated last month
ernie-research / Tool-Augmented-Reward-Model
[ICLR'24 spotlight] Tool-Augmented Reward Modeling
☆51Updated last month
facebookresearch / FnCTOD
Official code for the publication "Large Language Models as Zero-shot Dialogue State Tracker through Function Calling" https//arxiv.org/a…
☆63Updated 11 months ago
pldlgb / nuggets
☆84Updated last year
OFA-Sys / DiverseEvol
Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning
☆83Updated last year
PALIN2018 / BrowseComp-ZH
☆86Updated 2 months ago
NumberChiffre / mcts-llm
☆95Updated 7 months ago
ACEBench / ACEBench
☆71Updated last week
yegcjs / mixinglaws
☆103Updated 2 weeks ago