Gen-Verse / ScoreFlow
Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"
☆67Updated 2 months ago
Alternatives and similar repositories for ScoreFlow:
Users that are interested in ScoreFlow are comparing it to the libraries listed below
- ☆94Updated 5 months ago
- connecting humans and agents☆83Updated 5 months ago
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆90Updated 2 months ago
- ☆102Updated 5 months ago
- ☆47Updated 4 months ago
- ☆54Updated 2 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆89Updated 2 months ago
- MPO: Boosting LLM Agents with Meta Plan Optimization☆50Updated 2 months ago
- ☆64Updated 7 months ago
- [preprint] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆44Updated 4 months ago
- Reformatted Alignment☆115Updated 7 months ago
- ☆92Updated 3 months ago
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆128Updated last month
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆96Updated 6 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆103Updated last month
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"☆141Updated 2 weeks ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆53Updated last month
- An Open Math Pre-trainng Dataset with 370B Tokens.☆80Updated last month
- ☆38Updated 4 months ago
- ☆42Updated 6 months ago
- DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking☆45Updated 2 months ago
- diagnosis_zero, R1 Zero reproduce on disease diagnosis☆29Updated 3 months ago
- Agentic Knowledgeable Self-awareness☆56Updated 3 weeks ago
- ☆149Updated last week
- ☆56Updated 5 months ago
- An open platform for enhancing the capability of LLMs in workflow orchestration.☆137Updated last month
- Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling☆101Updated 3 months ago
- Awesome Agent Training☆96Updated this week
- ☆94Updated 4 months ago
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆201Updated 3 weeks ago