shibing624 / open-o1
open-o1: Using GPT-4o with CoT to Create o1-like Reasoning Chains
☆115Updated 4 months ago
Alternatives and similar repositories for open-o1
Users that are interested in open-o1 are comparing it to the libraries listed below
Sorting:
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆166Updated 2 weeks ago
- Qwen DianJin: LLMs for the Financial Industry by Alibaba Cloud☆68Updated 3 weeks ago
- Deep Reasoning Translation via Reinforcement Learning (arXiv preprint 2025); DRT: Deep Reasoning Translation via Long Chain-of-Thought (a…☆219Updated 2 weeks ago
- ☆94Updated 5 months ago
- Generate multi-round conversation roleplay data based on self-instruct and evol-instruct.☆126Updated 4 months ago
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆142Updated last month
- GLM Series Edge Models☆139Updated 2 months ago
- A lightweight script for processing HTML page to markdown format with support for code blocks☆79Updated last year
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆45Updated 3 months ago
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆230Updated 8 months ago
- [ICML 2025] | From Hours to Minutes: Lossless Acceleration of Ultra Long Sequence Generation☆90Updated last week
- Imitate OpenAI with Local Models☆88Updated 8 months ago
- Official code repository for Sketch-of-Thought (SoT)☆112Updated last week
- GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation☆152Updated this week
- Qwen GRPO Graph Extraction RL Finetune☆48Updated last month
- 我们是第一个完全可商用的角色大模型。☆40Updated 9 months ago
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆223Updated 4 months ago
- An open platform for enhancing the capability of LLMs in workflow orchestration.☆142Updated 2 months ago
- Search, organize, discover anything!☆48Updated last year
- Using Llama-3.1 70b on Groq to create o1-like reasoning chains☆19Updated 7 months ago
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆79Updated 4 months ago
- ☆94Updated 5 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆363Updated last month
- [EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation☆297Updated 6 months ago
- ☆40Updated last year
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆107Updated last month
- Auto Thinking Mode switch for Qwen3 in Open webui☆60Updated last week
- ☆36Updated 8 months ago
- ☆151Updated 2 weeks ago
- Open replication of DeepSeek R1 for text-to-graph extraction.☆94Updated 3 months ago