shibing624 / open-o1
open-o1: Using GPT-4o with CoT to Create o1-like Reasoning Chains
☆115Updated 3 months ago
Alternatives and similar repositories for open-o1:
Users that are interested in open-o1 are comparing it to the libraries listed below
- Deep Reasoning Translation via Reinforcement Learning (arXiv preprint 2025); DRT: Deep Reasoning Translation via Long Chain-of-Thought (a…☆215Updated last week
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆282Updated last week
- An open platform for enhancing the capability of LLMs in workflow orchestration.☆133Updated last month
- ☆94Updated 4 months ago
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆138Updated last month
- 🌐 WebThinker: Empowering Large Reasoning Models with Deep Research Capability☆147Updated 2 weeks ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆236Updated last week
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆157Updated this week
- ☆146Updated last month
- The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmar…☆47Updated 5 months ago
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆44Updated 2 months ago
- ☆91Updated last year
- ☆36Updated 7 months ago
- Open replication of DeepSeek R1 for text-to-graph extraction.☆93Updated 2 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆101Updated last month
- This is the official repo for "PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization". PromptAgen…☆269Updated 8 months ago
- A lightweight script for processing HTML page to markdown format with support for code blocks☆79Updated last year
- ☆82Updated 5 months ago
- GLM Series Edge Models☆136Updated 2 months ago
- The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installation☆147Updated 6 months ago
- Generate multi-round conversation roleplay data based on self-instruct and evol-instruct.☆126Updated 3 months ago
- This is InfiniRetri, a tool enhance Transformer-based LLMs(Large Language Model) ablity to hangle Long-Context.☆85Updated 3 weeks ago
- ☆39Updated 11 months ago
- StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization☆126Updated 3 months ago
- 利用免费的大模型api来结合你的私域数据来生成sft训练数据(妥妥白嫖)支持llamafactory等工具的训练数据格式synthetic data☆155Updated 5 months ago
- ☆92Updated 2 months ago
- ☆63Updated 7 months ago
- 🌐 WebWalker: Benchmarking LLMs in Web Traversal☆384Updated 2 weeks ago
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆61Updated 7 months ago
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆65Updated 9 months ago