sail-sg / FlowReasoner
☆108Updated last week
Alternatives and similar repositories for FlowReasoner
Users that are interested in FlowReasoner are comparing it to the libraries listed below
Sorting:
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆193Updated last week
- What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆63Updated 2 months ago
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆132Updated last month
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆52Updated 2 months ago
- Official code repository for Sketch-of-Thought (SoT)☆112Updated last week
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆97Updated 6 months ago
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆90Updated 2 months ago
- ☆46Updated last week
- This is the repository for NAACL'25 paper "TART: An Open-Source Tool-Augmented Framework for Explainable Table-based Reasoning"☆53Updated 2 weeks ago
- Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning☆74Updated 2 months ago
- official repository for “Reinforcement Learning for Reasoning in Large Language Models with One Training Example”☆143Updated last week
- ☆65Updated 2 weeks ago
- ☆201Updated 2 months ago
- Process Reward Models That Think☆32Updated this week
- Large language models for document ranking.☆52Updated this week
- Improving Your Model Ranking on Chatbot Arena by Vote Rigging (ICML 2025)☆20Updated 2 months ago
- [ICML'25] Multi-agent Architecture Search via Agentic Supernet☆52Updated 2 weeks ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆53Updated last month
- The code of RouterDC☆62Updated last month
- Benchmarking Chat Assistants on Long-Term Interactive Memory (ICLR 2025)☆87Updated 2 weeks ago
- Official Code for paper "Towards Efficient and Effective Unlearning of Large Language Models for Recommendation" (Frontiers of Computer S…☆36Updated 9 months ago
- Codes and datasets for the paper Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Ref…☆56Updated 2 months ago
- [COLING'25] Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?☆78Updated 3 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆92Updated 2 months ago
- The official repo for the code and data of paper SMART☆26Updated 2 months ago
- ☆93Updated 3 months ago
- ☆110Updated 3 months ago
- ☆77Updated 6 months ago
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆81Updated last month
- ☆97Updated this week