Ai-trainee / o1-flow
Using Llama-3.1 70b on Groq to create o1-like reasoning chains
☆19Updated 6 months ago
Alternatives and similar repositories for o1-flow:
Users that are interested in o1-flow are comparing it to the libraries listed below
- An open platform for enhancing the capability of LLMs in workflow orchestration.☆130Updated last month
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆135Updated 3 weeks ago
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆62Updated last month
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆200Updated this week
- ☆125Updated 2 months ago
- ☆70Updated this week
- ☆50Updated 2 months ago
- ☆29Updated 7 months ago
- ☆90Updated last year
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆198Updated last month
- ☆87Updated 2 months ago
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆186Updated last month
- AgileGen: Empowering Agile-Based Generative Software Development through Human-AI Teamwork (accepted by ACM TOSEM)☆22Updated 5 months ago
- GLM Series Edge Models☆134Updated last month
- connecting humans and agents☆80Updated 4 months ago
- ☆41Updated 5 months ago
- Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis☆122Updated 2 weeks ago
- ☆218Updated 11 months ago
- ☆137Updated last month
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆61Updated 6 months ago
- ☆314Updated 7 months ago
- ☆47Updated 4 months ago
- ☆94Updated 4 months ago
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆221Updated 3 months ago
- Reformatted Alignment☆115Updated 6 months ago
- A repo with an automated prompt engineering workflow from scratch. It leverages the OPRO technique.☆187Updated 7 months ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆236Updated 2 months ago
- Deep Reasoning Translation via Reinforcement Learning (arXiv preprint 2025); DRT: Deep Reasoning Translation via Long Chain-of-Thought (a…☆214Updated this week
- ☆143Updated 9 months ago
- A Toolkit for Running On-device Large Language Models (LLMs) in APP☆72Updated 9 months ago