Gen-Verse / ScoreFlow
Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"
☆59Updated last month
Alternatives and similar repositories for ScoreFlow:
Users that are interested in ScoreFlow are comparing it to the libraries listed below
- connecting humans and agents☆80Updated 3 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆63Updated last month
- ☆44Updated 3 months ago
- An open platform for enhancing the capability of LLMs in workflow orchestration.☆127Updated 2 weeks ago
- ☆94Updated 3 months ago
- ☆56Updated 6 months ago
- ☆47Updated last month
- [preprint] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆43Updated 3 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆86Updated 5 months ago
- ☆82Updated last month
- Scholar Copilot is an intelligent academic writing assistant that enhances the research writing process through AI-powered text completio…☆83Updated last week
- ☆102Updated 3 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆75Updated last week
- ☆91Updated 3 months ago
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆136Updated this week
- ☆32Updated 3 months ago
- ☆124Updated 3 weeks ago
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆67Updated last week
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆83Updated last week
- ☆88Updated last year
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆40Updated last month
- MPO: Boosting LLM Agents with Meta Plan Optimization☆40Updated 3 weeks ago
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆112Updated last week
- Code for Paper: Teaching Language Models to Critique via Reinforcement Learning☆84Updated last month
- ☆16Updated 5 months ago
- Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning☆32Updated last month
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆48Updated last month
- DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought☆211Updated 2 months ago
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆60Updated 6 months ago
- rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking☆38Updated 2 months ago