Ai-trainee / o1-flow
Using Llama-3.1 70b on Groq to create o1-like reasoning chains
☆19Updated 6 months ago
Alternatives and similar repositories for o1-flow:
Users that are interested in o1-flow are comparing it to the libraries listed below
- ☆214Updated 11 months ago
- GLM Series Edge Models☆132Updated last month
- ☆88Updated 11 months ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆129Updated 9 months ago
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆133Updated this week
- connecting humans and agents☆78Updated 3 months ago
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆173Updated last month
- An open platform for enhancing the capability of LLMs in workflow orchestration.☆119Updated last week
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆131Updated 3 weeks ago
- Its an open source LLM based on MOE Structure.☆58Updated 8 months ago
- Imitate OpenAI with Local Models☆88Updated 6 months ago
- Mixture-of-Experts (MoE) Language Model☆185Updated 6 months ago
- ☆181Updated last month
- ☆29Updated 6 months ago
- ☆32Updated 3 months ago
- 使用langchain进行任务规划,构建子任务的会话场景资源,通过MCTS任务执行器,来让每个子任务通过在上下文中资源,通过自身反思探索来获取自身对问题的最优答案;这种方式依赖模型的对齐偏好,我们在每种偏好上设计了一个工程框架,来完成自我对不同答案的奖励进行采样策略☆29Updated 2 weeks ago
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆274Updated this week
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆229Updated last month
- ☆110Updated 2 months ago
- Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024☆54Updated 4 months ago
- ☆142Updated 8 months ago
- PC Agent: While You Sleep, AI Works - A Cognitive Journey into Digital World☆211Updated 2 months ago
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆215Updated 2 months ago
- SUS-Chat: Instruction tuning done right☆48Updated last year
- ✨✨Latest Papers and Datasets on Mobile and PC GUI Agent☆115Updated 3 months ago
- HuixiangDou2: A Robustly Optimized GraphRAG Approach☆92Updated this week
- Hammer: Robust Function-Calling for On-Device Language Models via Function Masking☆63Updated last month
- ☆55Updated this week
- SuperCLUE-Agent: 基于中文原生任务的Agent智能体核心能力测评基准☆83Updated last year
- ☆40Updated 4 months ago