Ai-trainee / o1-flowLinks
Using Llama-3.1 70b on Groq to create o1-like reasoning chains
☆19Updated 8 months ago
Alternatives and similar repositories for o1-flow
Users that are interested in o1-flow are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆142Updated 2 months ago
- An open platform for enhancing the capability of LLMs in workflow orchestration.☆145Updated 2 months ago
- A Toolkit for Running On-device Large Language Models (LLMs) in APP☆72Updated 10 months ago
- GLM Series Edge Models☆139Updated 3 months ago
- ☆139Updated 4 months ago
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆169Updated last month
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆74Updated last week
- GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation☆168Updated 2 weeks ago
- Mixture-of-Experts (MoE) Language Model☆188Updated 8 months ago
- ☆80Updated this week
- AgileGen: Empowering Agile-Based Generative Software Development through Human-AI Teamwork (accepted by ACM TOSEM)☆22Updated 6 months ago
- Enjoy easier conversations with LLM☆36Updated 2 months ago
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆201Updated 3 months ago
- ☆91Updated last year
- ☆29Updated 9 months ago
- connecting humans and agents☆84Updated 5 months ago
- ☆175Updated last month
- ☆41Updated 7 months ago
- Official code for Dynamic Parametric RAG.☆123Updated last week
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆132Updated 11 months ago
- 使用langchain进行任 务规划,构建子任务的会话场景资源,通过MCTS任务执行器,来让每个子任务通过在上下文中资源,通过自身反思探索来获取自身对问题的最优答案;这种方式依赖模型的对齐偏好,我们在每种偏好上设计了一个工程框架,来完成自我对不同答案的奖励进行采样策略☆29Updated 3 weeks ago
- ☆131Updated this week
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆401Updated last month
- 利用免费的大模型api来结合你的私域数据来生成sft训练数据(妥妥白嫖)支持llamafactory等工具的训练数据格式synthetic data☆161Updated 6 months ago
- The evaluation benchmark on MCP servers☆113Updated last week
- [ACL2025 demo track] ROGRAG: A Robustly Optimized GraphRAG Framework☆134Updated 3 weeks ago
- Deep Reasoning Translation via Reinforcement Learning (arXiv preprint 2025); DRT: Deep Reasoning Translation via Long Chain-of-Thought (a…☆222Updated this week
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆79Updated 5 months ago
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆202Updated last month
- ☆94Updated 5 months ago