sail-sg / FlowReasonerLinks
☆128Updated 3 months ago
Alternatives and similar repositories for FlowReasoner
Users that are interested in FlowReasoner are comparing it to the libraries listed below
Sorting:
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆99Updated last month
- ☆212Updated 5 months ago
- [ACL 2025] Knowledge Unlearning for Large Language Models☆39Updated 3 months ago
- [COLING'25] Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?☆79Updated 6 months ago
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆154Updated last month
- This is the repository for NAACL'25 paper "TART: An Open-Source Tool-Augmented Framework for Explainable Table-based Reasoning"☆54Updated 3 months ago
- Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory☆69Updated 2 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆120Updated 9 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆233Updated 3 months ago
- JudgeLRM: Large Reasoning Models as a Judge☆32Updated 3 months ago
- [ACL'25 Oral] What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆71Updated last month
- ☆114Updated 6 months ago
- ☆189Updated 2 months ago
- ☆61Updated 2 weeks ago
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆65Updated 5 months ago
- Code for the paper: "Learning to Reason without External Rewards"☆344Updated 3 weeks ago
- The code of RouterDC☆65Updated 3 months ago
- [ACL 2025] Agentic Knowledgeable Self-awareness☆77Updated last month
- [ICML 2025] Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search☆105Updated 2 months ago
- A Comprehensive Library for Memory of LLM-based Agents.☆56Updated 2 months ago
- Process Reward Models That Think☆47Updated last month
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆91Updated 2 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆67Updated 2 months ago
- Codes and datasets for the paper Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Ref…☆63Updated 5 months ago
- ☆47Updated 5 months ago
- Large language models for document ranking.☆64Updated 2 months ago
- ☆126Updated 2 months ago
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆103Updated last week
- Official code repository for Sketch-of-Thought (SoT)☆125Updated 3 months ago
- [Preprint 2025] Thinkless: LLM Learns When to Think☆215Updated last month