VLM-RL/Ocean-R1

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/VLM-RL/Ocean-R1)

VLM-RL / Ocean-R1

☆25

Alternatives and similar repositories for Ocean-R1

Users that are interested in Ocean-R1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

fengzi258 / Ocean-R1
View on GitHub
☆29Mar 12, 2025Updated last year
BaichuanSEED / BaichuanSEED.github.io
View on GitHub
Official Repository for Paper "BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Compet…
☆18Aug 28, 2024Updated last year
zezeze97 / DFE-GPS
View on GitHub
☆14Jul 15, 2025Updated 10 months ago
voidism / L2KD
View on GitHub
Code for the EMNLP2020 long paper "Lifelong Language Knowledge Distillation" https://arxiv.org/abs/2010.02123
☆12Jul 13, 2021Updated 4 years ago
GAIR-NLP / LIMR
View on GitHub
☆219Feb 20, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
silverriver / OOD4NLU
View on GitHub
Code for paper "Out-of-domain detection for natural language understanding in dialog systems"
☆10May 27, 2022Updated 3 years ago
MiFei / Continual-Learning-for-NLG
View on GitHub
☆14Sep 22, 2020Updated 5 years ago
chenxy99 / SD-FSIC
View on GitHub
Official code for the paper "Self-Distillation for Few-Shot Image Captioning"
☆18Mar 15, 2021Updated 5 years ago
Victorwz / MLM_Filter
View on GitHub
Official implementation of our paper "Finetuned Multimodal Language Models are High-Quality Image-Text Data Filters".
☆71Apr 14, 2025Updated last year
tanhuajie / Reason-RFT
View on GitHub
[NeurIPS 2025]⭐️ Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.
☆287Oct 5, 2025Updated 7 months ago
si0wang / ThinkLite-VL
View on GitHub
☆106Jun 10, 2025Updated 11 months ago
RLHFlow / GVM
View on GitHub
☆16Jul 29, 2025Updated 9 months ago
wizard-III / Archer2.0
View on GitHub
Archer2.0 evolves from its predecessor by introducing ASPO, which overcomes fundamental PPO-Clip limitations to prevent premature converg…
☆31Oct 10, 2025Updated 7 months ago
PKU-Baichuan-MLSystemLab / SysBench
View on GitHub
SysBench: Can Large Language Models Follow System Messages?
☆40Sep 4, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
pengyuLPY / Adversarial-Pose-Regression-Network-for-Pose-Invariant-Face-Recognitions
View on GitHub
☆10Apr 9, 2021Updated 5 years ago
PKU-Baichuan-MLSystemLab / CFBench
View on GitHub
CFBench: A Comprehensive Constraints-Following Benchmark for LLMs
☆52Aug 26, 2024Updated last year
GauravGajbhiye / SCAMET_RSIC
View on GitHub
This is tensorflow 2.2 based SCAMET framework for remote sensing image captioning.
☆13Aug 10, 2023Updated 2 years ago
Levi-ZJY / SAN
View on GitHub
SAN: Structure-Aware Network for Complex and Long-tailed Chinese Text Recognition
☆10Apr 8, 2024Updated 2 years ago
MikeWangWZHL / PAPO
View on GitHub
Official repo for "PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning"
☆136Feb 4, 2026Updated 3 months ago
EDM-Research / VATr-pp
View on GitHub
☆18Jul 9, 2024Updated last year
uclanlp / OpenVLThinker
View on GitHub
OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement
☆149Apr 15, 2026Updated last month
sail-sg / ActivePRM
View on GitHub
☆21Apr 16, 2025Updated last year
Planet-AI-GmbH / tfaip-hybrid-ctc-s2s
View on GitHub
Repository sharing code and the model for the paper "Rescoring Sequence-to-Sequence Models for Text Line Recognition with CTC-Prefixes"
☆17Oct 13, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
TyroneLi / ESOL_WSSS
View on GitHub
☆14Jan 4, 2023Updated 3 years ago
1KE-JI / UPFT
View on GitHub
Official resources of "The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reaso…
☆21Jun 13, 2025Updated 11 months ago
FormalGeo / FormalGeo
View on GitHub
Formal representation and solving for Euclidean plane geometry problems.
☆39May 7, 2026Updated 2 weeks ago
sangbobo / WeChatKeyTips
View on GitHub
微信消息关键字提醒，获取微信消息，并匹配消息关键字，向邮箱发送带有关键字的消息
☆13May 10, 2017Updated 9 years ago
aquastripe / DenseCLIP
View on GitHub
An unofficial implementation for paper "DenseCLIP: Extract Free Dense Labels from CLIP"
☆24Jan 27, 2022Updated 4 years ago
neopen / story-shot-agent
View on GitHub
剧本分镜智能体（PenShot）：电影/动漫/短剧/小说/剧本→分镜→片段→prompt | 基于 LangGraph+LLM，自动解析任意格式剧本，生成 Sora/Veo/Runway 等模型可用的连贯text-to-video提示词。保持角色/剧情跨片段一致，支持 MC…
☆69May 15, 2026Updated last week
conceptmath / conceptmath
View on GitHub
[ACL 2024 Findings] The official repo for "ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large …
☆25May 29, 2024Updated last year
XenoZLH / Shuffle-R1
View on GitHub
Official code repository of Shuffle-R1
☆26Feb 23, 2026Updated 2 months ago
zhanggefan / mmdet3d-gaussian
View on GitHub
☆17Nov 23, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
GuangyanS / Sys2-LLaVA
View on GitHub
☆31Feb 10, 2025Updated last year
PKU-Baichuan-MLSystemLab / PAS
View on GitHub
☆53Sep 11, 2024Updated last year
HansRen1024 / C-OF
View on GitHub
Official implementation for paper "A Real-Time and Long-Term Face Tracking Method Using Convolutional Neural Network and Optical Flow for…
☆20Aug 1, 2022Updated 3 years ago
eeric / insightface
View on GitHub
Face Analysis Project on MXNet
☆11Dec 1, 2021Updated 4 years ago
vvvvvjdy / D-OPSD
View on GitHub
Official Repo of "D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models"
☆137May 13, 2026Updated last week
ljwztc / MedChain
View on GitHub
The repository for "MedChain: Bridging the Gap Between LLM Agents and Real-World Clinical Decision Making"
☆52Apr 8, 2026Updated last month
wizard-III / ArcherCodeR
View on GitHub
ArcherCodeR is an open-source initiative enhancing code reasoning in large language models through scalable, rule-governed reinforcement …
☆44Aug 6, 2025Updated 9 months ago