sail-sg / FlowReasonerLinks
☆142Updated 7 months ago
Alternatives and similar repositories for FlowReasoner
Users that are interested in FlowReasoner are comparing it to the libraries listed below
Sorting:
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆254Updated 7 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆136Updated last year
- ☆226Updated 9 months ago
- SSRL: Self-Search Reinforcement Learning☆158Updated 3 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆115Updated 6 months ago
- [COLING'25] Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?☆82Updated 10 months ago
- Process Reward Models That Think☆63Updated 2 weeks ago
- DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL☆211Updated 2 months ago
- Demystifying Reinforcement Learning in Agentic Reasoning☆126Updated last month
- JudgeLRM: Large Reasoning Models as a Judge☆40Updated this week
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆85Updated last week
- A Comprehensive Library for Memory of LLM-based Agents.☆92Updated 7 months ago
- ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization☆93Updated 6 months ago
- This is the repository for NAACL'25 paper "TART: An Open-Source Tool-Augmented Framework for Explainable Table-based Reasoning"☆56Updated 7 months ago
- ☆104Updated 2 months ago
- [ICLR 2025] DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆88Updated 3 months ago
- ☆72Updated 6 months ago
- MemGen: Weaving Generative Latent Memory for Self-Evolving Agents☆230Updated 2 weeks ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆70Updated 6 months ago
- [ACL'25 Oral] What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆74Updated 5 months ago
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆92Updated 2 months ago
- [ACL 2025] Knowledge Unlearning for Large Language Models☆47Updated 2 months ago
- Official Code Release for "Training a Generally Curious Agent"☆39Updated 6 months ago
- ☆70Updated last month
- Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"☆54Updated 2 months ago
- [NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning☆139Updated 2 months ago
- [ICML 2025] Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search☆109Updated 6 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆291Updated 2 months ago
- [arXiv 2025] "CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought"☆14Updated 8 months ago
- A-MEM: Agentic Memory for LLM Agents☆187Updated 3 weeks ago