Ai-trainee / o1-flowLinks
Using Llama-3.1 70b on Groq to create o1-like reasoning chains
☆19Updated 9 months ago
Alternatives and similar repositories for o1-flow
Users that are interested in o1-flow are comparing it to the libraries listed below
Sorting:
- GLM Series Edge Models☆142Updated last week
- Enjoy easier conversations with LLM☆37Updated 3 months ago
- A Toolkit for Running On-device Large Language Models (LLMs) in APP☆73Updated 11 months ago
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆143Updated 2 months ago
- ☆91Updated last year
- An open platform for enhancing the capability of LLMs in workflow orchestration.☆147Updated 3 months ago
- A small open source 3D agent simulator based on LLM.☆66Updated 6 months ago
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆172Updated this week
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆78Updated last month
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆213Updated 4 months ago
- AgileGen: Empowering Agile-Based Generative Software Development through Human-AI Teamwork (accepted by ACM TOSEM)☆22Updated 7 months ago
- 使用langchain进行任务规划,构建子任务的会话场景资源,通过MCTS任务执行器,来让每个子 任务通过在上下文中资源,通过自身反思探索来获取自身对问题的最优答案;这种方式依赖模型的对齐偏好,我们在每种偏好上设计了一个工程框架,来完成自我对不同答案的奖励进行采样策略☆29Updated last month
- GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation☆199Updated last week
- ☆41Updated 7 months ago
- Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"☆122Updated 3 weeks ago
- ☆94Updated 6 months ago
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆79Updated 5 months ago
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆251Updated 2 weeks ago
- An Open Math Pre-trainng Dataset with 370B Tokens.☆89Updated 2 months ago
- 利用免费的大模型api来结合你的私域数据来生成sft训练数据(妥妥白嫖)支持llamafactory等工具的训练数据格式synthetic data☆167Updated 6 months ago
- Awesome Code Action - DeepWebSearch AgentKit App. Build with 🤗 Hugging Face smolagents framework☆72Updated this week
- ☆35Updated 6 months ago
- ☆144Updated 5 months ago
- Qwen GRPO Graph Extraction RL Finetune☆49Updated 2 months ago
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆61Updated 8 months ago
- A unified tool to generate fine-tuning datasets for LLMs, including questions, answers, and dialogues. ✨🤖📚💬☆59Updated 3 months ago
- ☆86Updated last month
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆38Updated 6 months ago
- 我们是第一个完全可商用的角色大模型。☆40Updated 10 months ago
- ☆95Updated 6 months ago