☆144May 6, 2025Updated 9 months ago
Alternatives and similar repositories for FlowReasoner
Users that are interested in FlowReasoner are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] An official source code for paper "GuardReasoner-VL: Safeguarding VLMs via Reinforced Reasoning".☆117Feb 22, 2026Updated last week
- ☆19May 17, 2025Updated 9 months ago
- ☆17Apr 9, 2025Updated 10 months ago
- Awesome-Efficient-Inference-for-LRMs is a collection of state-of-the-art, novel, exciting, token-efficient methods for Large Reasoning Mo…☆236Jun 14, 2025Updated 8 months ago
- Large language models for document ranking.☆71Jan 13, 2026Updated last month
- rmp data ranking☆13Nov 4, 2025Updated 3 months ago
- Adversaial attack comparative assessment Large Language Model☆13May 21, 2025Updated 9 months ago
- [ACL 2025] Knowledge Unlearning for Large Language Models☆48Sep 18, 2025Updated 5 months ago
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆52Jul 15, 2025Updated 7 months ago
- Defeating the Training-Inference Mismatch via FP16☆182Nov 14, 2025Updated 3 months ago
- Code and Data for "FaithfulRAG: Fact-Level Conflict Modeling for Context-Faithful Retrieval-Augmented Generation" (ACL25)☆29Oct 26, 2025Updated 4 months ago
- The official implementation of Preference Data Reward-Augmentation.☆18May 1, 2025Updated 10 months ago
- ☆20Apr 16, 2025Updated 10 months ago
- [ICLR Workshop 2025] An official source code for paper "GuardReasoner: Towards Reasoning-based LLM Safeguards".☆168May 19, 2025Updated 9 months ago
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆20Apr 9, 2025Updated 10 months ago
- ☆23Jan 17, 2025Updated last year
- Code and data for paper "Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation".☆23Oct 22, 2025Updated 4 months ago
- MM-Instruct: Generated Visual Instructions for Large Multimodal Model Alignment☆35Jul 1, 2024Updated last year
- ☆87Oct 28, 2024Updated last year
- MAPO: MIXED ADVANTAGE POLICY OPTIMIZATION☆38Sep 24, 2025Updated 5 months ago
- Official implementation of ECCV24 paper: POA☆24Aug 8, 2024Updated last year
- ☆53Feb 11, 2025Updated last year
- ☆98Jun 23, 2025Updated 8 months ago
- ☆17Aug 1, 2025Updated 7 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Jun 1, 2025Updated 9 months ago
- ☆14Mar 20, 2025Updated 11 months ago
- ☆25Aug 19, 2025Updated 6 months ago
- ☆16Feb 22, 2025Updated last year
- ☆14Apr 14, 2025Updated 10 months ago
- ☆74Jun 28, 2025Updated 8 months ago
- Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"☆59Jan 5, 2026Updated last month
- AWM: Agent Workflow Memory☆397Dec 22, 2025Updated 2 months ago
- Tiny evaluation of leading LLMs on competitive programming problems☆14Nov 28, 2024Updated last year
- [BIB 2023] scDFC: A deep fusion clustering method for single-cell RNA-seq data☆10Nov 27, 2025Updated 3 months ago
- [CVPR 2025] Code for "Notes-guided MLLM Reasoning: Enhancing MLLM with Knowledge and Visual Notes for Visual Question Answering".☆20Jun 16, 2025Updated 8 months ago
- [NeurIPS'25] Backdoor Cleaning without External Guidance in MLLM Fine-tuning☆17Oct 13, 2025Updated 4 months ago
- Code for the paper "FinRLlama: A Solution to LLM-Engineered Signals Challenge at FinRL Contest 2024"☆13Feb 14, 2025Updated last year
- ☆11Feb 22, 2023Updated 3 years ago
- ☆15Jan 12, 2026Updated last month