☆145May 6, 2025Updated last year
Alternatives and similar repositories for FlowReasoner
Users that are interested in FlowReasoner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2025] An official source code for paper "GuardReasoner-VL: Safeguarding VLMs via Reinforced Reasoning".☆123Feb 22, 2026Updated 3 months ago
- [IEEE T-PAMI] Awesome-Efficient-Inference-for-LRMs is a collection of state-of-the-art, novel, exciting, token-efficient methods for Larg…☆236May 31, 2026Updated last week
- ☆17Apr 9, 2025Updated last year
- ☆19May 17, 2025Updated last year
- ☆11Aug 10, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ICLR Workshop 2025] An official source code for paper "GuardReasoner: Towards Reasoning-based LLM Safeguards".☆173May 19, 2025Updated last year
- Code and data for paper "Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation".☆25Oct 22, 2025Updated 7 months ago
- ☆89Oct 28, 2024Updated last year
- ☆35Apr 22, 2025Updated last year
- MAPO: MIXED ADVANTAGE POLICY OPTIMIZATION☆39Sep 24, 2025Updated 8 months ago
- ☆18Mar 2, 2026Updated 3 months ago
- [NeurIPS'25] Backdoor Cleaning without External Guidance in MLLM Fine-tuning☆20Oct 13, 2025Updated 7 months ago
- Code for paper "Factual Confidence of LLMs: on Reliability and Robustness of Current Estimators"☆16Dec 4, 2024Updated last year
- [Arxiv 2025] Official code and datasets of paper: GNNs as Predictors of Agentic Workflow Performances☆20Jan 15, 2026Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Defeating the Training-Inference Mismatch via FP16☆194Nov 14, 2025Updated 6 months ago
- [ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet☆274Nov 13, 2025Updated 6 months ago
- ☆80Nov 19, 2024Updated last year
- ☆21Apr 16, 2025Updated last year
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆55Jul 15, 2025Updated 10 months ago
- ☆10Feb 22, 2023Updated 3 years ago
- Benchmark for Hetergeneous Federated Learning by MARS Group at the Wuhan University, led by Prof. Mang Ye.☆19May 29, 2023Updated 3 years ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 8 months ago
- Generate Python Package with Simple Prompts☆75Nov 22, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- AWM: Agent Workflow Memory☆442Dec 22, 2025Updated 5 months ago
- ☆25Jan 17, 2025Updated last year
- official code for paper Probing the Decision Boundaries of In-context Learning in Large Language Models. https://arxiv.org/abs/2406.11233…☆20Jul 27, 2025Updated 10 months ago
- On Policy Distillation Build on top of Verl☆69May 25, 2026Updated 2 weeks ago
- ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization☆98May 22, 2025Updated last year
- Official implementation of ECCV24 paper: POA☆24Aug 8, 2024Updated last year
- Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"☆65Jan 5, 2026Updated 5 months ago
- Codes for "MixupE: Understanding and Improving Mixup from Directional Derivative Perspective" UAI 2023 Oral☆30Aug 30, 2023Updated 2 years ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Apr 20, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [NeurIPS 2025 Spotlight] Fast-Slow Thinking GRPO for Large Vision-Language Model Reasoning☆55Apr 16, 2026Updated last month
- ☆31May 30, 2025Updated last year
- ☆101Jun 23, 2025Updated 11 months ago
- ☆19Aug 4, 2025Updated 10 months ago
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆47Apr 15, 2025Updated last year
- Code and Data for "FaithfulRAG: Fact-Level Conflict Modeling for Context-Faithful Retrieval-Augmented Generation" (ACL25)☆34Oct 26, 2025Updated 7 months ago
- moodist☆28Apr 23, 2026Updated last month