shibing624 / open-o1Links
open-o1: Using GPT-4o with CoT to Create o1-like Reasoning Chains
☆115Updated 5 months ago
Alternatives and similar repositories for open-o1
Users that are interested in open-o1 are comparing it to the libraries listed below
Sorting:
- An open platform for enhancing the capability of LLMs in workflow orchestration.☆146Updated 2 months ago
- ☆94Updated 6 months ago
- Deep Reasoning Translation via Reinforcement Learning (arXiv preprint 2025); DRT: Deep Reasoning Translation via Long Chain-of-Thought (a…☆224Updated last week
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆169Updated this week
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆142Updated 2 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆114Updated 2 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆420Updated last month
- MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding☆165Updated 2 months ago
- Search, organize, discover anything!☆48Updated last year
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆45Updated 4 months ago
- Imitate OpenAI with Local Models☆87Updated 9 months ago
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆226Updated 4 months ago
- This is the official repo for "PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization". PromptAgen…☆284Updated 10 months ago
- Train a Language Model with GRPO to create a schedule from a list of events and priorities☆195Updated last month
- Enjoy easier conversations with LLM☆37Updated 2 months ago
- FuseAI Project☆87Updated 4 months ago
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆66Updated 10 months ago
- The evaluation benchmark on MCP servers☆115Updated 2 weeks ago
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆76Updated 2 weeks ago
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆248Updated 3 weeks ago
- ☆102Updated 6 months ago
- ☆222Updated last year
- Qwen DianJin: LLMs for the Financial Industry by Alibaba Cloud☆106Updated 2 weeks ago
- Repo for "VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforce…☆156Updated this week
- Generate multi-round conversation roleplay data based on self-instruct and evol-instruct.☆127Updated 5 months ago
- OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking☆452Updated last month
- ☆91Updated last year
- GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation☆191Updated this week
- Code and Data for Our NeurIPS 2024 paper "AMOR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process Feedback"☆32Updated 7 months ago
- LongQLoRA: Extent Context Length of LLMs Efficiently☆166Updated last year