fairyshine / Chain-of-ToolsView external linksLinks
The official implementation of the paper "Chain-of-Tools: Utilizing Massive Unseen Tools in the CoT Reasoning of Frozen Language Models".
☆87Mar 25, 2025Updated 10 months ago
Alternatives and similar repositories for Chain-of-Tools
Users that are interested in Chain-of-Tools are comparing it to the libraries listed below
Sorting:
- [COLING 2025] NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models☆18Jan 18, 2025Updated last year
- The code for "MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking"☆19Jan 25, 2025Updated last year
- bulk image downloader freeware, reddit bulk image downloader, bulk image downloader extension, bulk image downloader from url, bulk image…☆25Aug 26, 2025Updated 5 months ago
- ☆22Sep 2, 2025Updated 5 months ago
- Official Implementation of "Probing Language Models for Pre-training Data Detection"☆20Dec 4, 2024Updated last year
- [COLM 2025: 1st Workshop on the Application of LLM Explainability to Reasoning and Planning] Latent Chain-of-Thought? Decoding the Depth-…☆17Oct 4, 2025Updated 4 months ago
- Code repository for the paper on "Predicting the Performance of Black-Box LLMs through Self-Queries".☆12Jan 9, 2025Updated last year
- Chain-of-thought 방식을 활용하여 llama2를 fine-tuning☆10Nov 18, 2023Updated 2 years ago
- A minimal yet unstoppable blueprint for multi-agent AI—anchored by the rare, far-reaching “Multi-Agent AI DAO” (2017 Prior Art)—empowerin…☆32Jan 11, 2025Updated last year
- ScaledMCP is a horizontally scalabled MCP and A2A Server. You know, for AI.☆44Aug 11, 2025Updated 6 months ago
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- Code and data release of the paper Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic Workflows☆14Oct 4, 2024Updated last year
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆32Aug 5, 2025Updated 6 months ago
- Task management for AI agents☆15Jun 25, 2025Updated 7 months ago
- Change Point Detection in Time Series☆14Mar 15, 2023Updated 2 years ago
- Self Organizing Maps (SOM) ML model can be used to conduct semantic search to populate context required for Retrieval Augmented Generatio…☆15Mar 16, 2024Updated last year
- ☆61Sep 18, 2025Updated 5 months ago
- [EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"☆68Apr 11, 2025Updated 10 months ago
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆64Aug 2, 2024Updated last year
- Work in progress! I don't recommend looking at the code right now.☆24Dec 3, 2025Updated 2 months ago
- [NeurIPS 2025 D&B Spotlight] CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays☆31Oct 23, 2025Updated 3 months ago
- I-SHEEP: Iterative Self-enHancEmEnt Paradigm of LLMs through Self-Instruct and Self-Assessment☆17Jan 16, 2025Updated last year
- Control LLM☆22Apr 6, 2025Updated 10 months ago
- Example autonomous project that searches HN create todos☆18Jan 4, 2025Updated last year
- WebSage is an AI Engine that extracts content from any URL, generates summaries, and enables interaction using AI models. Choose between …☆16Feb 25, 2025Updated 11 months ago
- ☆43May 29, 2025Updated 8 months ago
- ☆17Aug 7, 2024Updated last year
- LangGraph-powered ReAct agent with Model Context Protocol (MCP) integration. A Streamlit web interface for dynamically configuring, deplo…☆684Apr 14, 2025Updated 10 months ago
- An interactive integration of yFiles for HTML with LlamaIndex to visualize the knowledge graph used for query resolution.☆50Mar 10, 2025Updated 11 months ago
- Performs benchmarking on two Korean datasets with minimal time and effort.☆45Jan 22, 2026Updated 3 weeks ago
- AutoRAG example about benchmarking Korean embeddings.☆43Oct 2, 2024Updated last year
- ☆32Oct 13, 2025Updated 4 months ago
- Evaluate gpt-4o on CLIcK (Korean NLP Dataset)☆20May 18, 2024Updated last year
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆50Feb 4, 2026Updated last week
- [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning☆989Sep 26, 2025Updated 4 months ago
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆96May 16, 2025Updated 9 months ago
- AI Agent which deep crawls a company website and generates a comprehensive PDF report.☆22Feb 14, 2025Updated last year
- ☆36Oct 9, 2025Updated 4 months ago
- Comprehensive metrics, insights, and visualization for Agno and Crew AI applications☆26May 21, 2025Updated 8 months ago