sail-sg/FlowReasoner

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sail-sg/FlowReasoner)

sail-sg / FlowReasoner

☆145

Alternatives and similar repositories for FlowReasoner

Users that are interested in FlowReasoner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yueliu1999 / GuardReasoner-VL
View on GitHub
[NeurIPS 2025] An official source code for paper "GuardReasoner-VL: Safeguarding VLMs via Reinforced Reasoning".
☆123Feb 22, 2026Updated 5 months ago
shuzhangzhong / HybriMoE-Preview
View on GitHub
☆17Apr 9, 2025Updated last year
EsmaeilNarimissa / aws-sft-grpo-budget-llm-finetune
View on GitHub
☆19May 17, 2025Updated last year
SnoopX-AI / Awesome-Weak-to-Strong-Generalization
View on GitHub
☆11Aug 10, 2024Updated last year
Hongcheng-Gao / HAVEN
View on GitHub
Code and data for paper "Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation".
☆25Oct 22, 2025Updated 8 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
rainavyas / attack-comparative-assessment
View on GitHub
Adversaial attack comparative assessment Large Language Model
☆13May 21, 2025Updated last year
liuqi6777 / llm4ranking
View on GitHub
Large language models for document ranking.
☆75May 20, 2026Updated 2 months ago
TuringEyeTest / TuringEyeTest
View on GitHub
Pixels, Patterns, but no Poetry: To See the World like Humans
☆18Aug 11, 2025Updated 11 months ago
MinorJerry / OpenWebVoyager
View on GitHub
☆89Oct 28, 2024Updated last year
LirongWu / Homophily-Enhanced-Self-supervision
View on GitHub
Code for TNNLS paper "Homophily-Enhanced Self-supervision for Graph Structure Learning: Insights and Directions"
☆15Feb 27, 2024Updated 2 years ago
hkust-nlp / model-task-align-rl
View on GitHub
[ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".
☆18Feb 9, 2026Updated 5 months ago
sail-sg / tty-use
View on GitHub
☆15Oct 13, 2025Updated 9 months ago
sail-sg / ActivePRM
View on GitHub
☆21Apr 16, 2025Updated last year
WestlakeAI / DrugGPT
View on GitHub
☆10Feb 22, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
thu-wyz / inference_scaling
View on GitHub
☆80Nov 19, 2024Updated last year
DayuHuu / scDFC
View on GitHub
[BIB 2023] scDFC: A deep fusion clustering method for single-cell RNA-seq data
☆10Nov 27, 2025Updated 7 months ago
amazon-science / factual-confidence-of-llms
View on GitHub
Code for paper "Factual Confidence of LLMs: on Reliability and Robustness of Current Estimators"
☆17Dec 4, 2024Updated last year
AndrewWTY / UNIT
View on GitHub
☆33Jun 24, 2025Updated last year
sail-sg / feedback-conditional-policy
View on GitHub
Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"
☆65Jan 5, 2026Updated 6 months ago
sail-sg / Meta-Unlearning
View on GitHub
☆35Apr 22, 2025Updated last year
Linzwcs / AFT
View on GitHub
☆13Jan 22, 2025Updated last year
XuankunRong / SafeGRPO
View on GitHub
[CVPR'26] SafeGRPO: Self-Rewarded Multimodal Safety Alignment via Rule-Governed Policy Optimization
☆21Feb 19, 2026Updated 5 months ago
zhiyuanhubj / LongRecipe
View on GitHub
LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models
☆79Oct 16, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
siyan-zhao / ICL_decision_boundary
View on GitHub
official code for paper Probing the Decision Boundaries of In-context Learning in Large Language Models. https://arxiv.org/abs/2406.11233…
☆20Jul 27, 2025Updated 11 months ago
GitsSaikat / PyGen
View on GitHub
Generate Python Package with Simple Prompts
☆75Nov 22, 2024Updated last year
Qichuzyy / POA
View on GitHub
Official implementation of ECCV24 paper: POA
☆24Aug 8, 2024Updated last year
WenkeHuang / MAPO
View on GitHub
MAPO: MIXED ADVANTAGE POLICY OPTIMIZATION
☆35Sep 24, 2025Updated 9 months ago
XuankunRong / BYE
View on GitHub
[NeurIPS'25] Backdoor Cleaning without External Guidance in MLLM Fine-tuning
☆20Oct 13, 2025Updated 9 months ago
Gen-Verse / ScoreFlow
View on GitHub
ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization
☆97May 22, 2025Updated last year
facebookresearch / ZeroSumEval
View on GitHub
A framework for pitting LLMs against each other in an evolving library of games ⚔
☆35Apr 20, 2025Updated last year
sail-sg / Precision-RL
View on GitHub
Defeating the Training-Inference Mismatch via FP16
☆197Nov 14, 2025Updated 8 months ago
sail-sg / AnytimeReasoner
View on GitHub
Optimizing Anytime Reasoning via Budget Relative Policy Optimization
☆54Jul 15, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zjunlp / unlearn
View on GitHub
[ACL 2025] Knowledge Unlearning for Large Language Models
☆49Sep 18, 2025Updated 10 months ago
TianyuanYang / KAN4Rec
View on GitHub
Implementation of Kolmogorov-Arnold Network (KAN) for Recommendations
☆29May 15, 2024Updated 2 years ago
mandyyyyii / east
View on GitHub
☆19Aug 4, 2025Updated 11 months ago
sail-sg / dice
View on GitHub
Official implementation of Bootstrapping Language Models via DPO Implicit Rewards
☆47Apr 15, 2025Updated last year
TencentARC / SEED-Bench-R1
View on GitHub
☆100Jun 23, 2025Updated last year
anpaure / cp_eval
View on GitHub
Tiny evaluation of leading LLMs on competitive programming problems
☆14Apr 10, 2026Updated 3 months ago
mwatkins1970 / SAE_Feature_Interpretability_Tool
View on GitHub
A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…
☆19Oct 4, 2024Updated last year