Ai-trainee / o1-flow
Using Llama-3.1 70b on Groq to create o1-like reasoning chains
☆19Updated 5 months ago
Alternatives and similar repositories for o1-flow:
Users that are interested in o1-flow are comparing it to the libraries listed below
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆130Updated 2 weeks ago
- An open platform for enhancing the capability of LLMs in workflow orchestration.☆104Updated this week
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆164Updated 3 weeks ago
- GLM Series Edge Models☆130Updated 3 weeks ago
- Mixture-of-Experts (MoE) Language Model☆185Updated 6 months ago
- A Toolkit for Running On-device Large Language Models (LLMs) in APP☆65Updated 8 months ago
- ☆180Updated 2 weeks ago
- 使用langchain进行任务规划,构建子任务的会话场景资源,通过MCTS任务执行器,来让每个子任务通过在上下文中资源,通过自身反思探索来获取自身对问题的最优答案;这种方式依赖模型的对齐偏好,我们在每种偏好上设计了一个工程框架,来完成自我对不同答案的奖励进行采样策略☆29Updated this week
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆133Updated 2 weeks ago
- connecting humans and agents☆76Updated 3 months ago
- 利用免费的大模型api来结合你的私域数据来生成sft训练数据(妥妥白嫖)支持llamafactory等工具的训练数据格 式synthetic data☆147Updated 3 months ago
- ☆105Updated last year
- ☆52Updated this week
- PC Agent: While You Sleep, AI Works - A Cognitive Journey into Digital World☆197Updated 2 months ago
- ☆103Updated last month
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆195Updated 2 weeks ago
- Imitate OpenAI with Local Models☆87Updated 6 months ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆128Updated 9 months ago
- A LLM-based Agent that predict its tasks proactively.☆319Updated this week
- ☆28Updated 6 months ago
- This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achi…☆218Updated 4 months ago
- ☆212Updated 10 months ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆224Updated 3 weeks ago
- ☆87Updated 11 months ago
- ☆40Updated 4 months ago
- ☆141Updated 8 months ago
- ☆225Updated 10 months ago