groundlight/r1_vlm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/groundlight/r1_vlm)

groundlight / r1_vlm

Build your own visual reasoning model

☆421

Alternatives and similar repositories for r1_vlm

Users that are interested in r1_vlm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

brendanhogan / DeepSeekRL-Extended
View on GitHub
Exploring Applications of GRPO
☆252Aug 25, 2025Updated 11 months ago
PrimeIntellect-ai / verifiers
View on GitHub
Our library for RL environments + evals
☆4,403Updated this week
groundlight / framegrab
View on GitHub
Library to easily grab frames from cameras or streams
☆25Mar 17, 2026Updated 4 months ago
minosvasilias / simple_grpo
View on GitHub
Simple GRPO scripts and configurations.
☆59Feb 6, 2025Updated last year
nano-R1 / resources
View on GitHub
Compiling useful links, papers, benchmarks, ideas, etc.
☆45Mar 16, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
brendanhogan / completion_tree_view
View on GitHub
☆15Apr 26, 2025Updated last year
HarleyCoops / smolThinker-.5B
View on GitHub
A Qwen .5B reasoning model trained on OpenR1-Math-220k
☆14Updated this week
tyler-romero / microR1
View on GitHub
Simple repository for training small reasoning models
☆51Feb 17, 2026Updated 5 months ago
tom-doerr / simpledspy
View on GitHub
☆114May 9, 2026Updated 2 months ago
s-smits / grpo-optuna
View on GitHub
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆60Oct 18, 2025Updated 9 months ago
uclanlp / OpenVLThinker
View on GitHub
OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement
☆155May 25, 2026Updated 2 months ago
StarsfieldAI / R1-V
View on GitHub
Witness the aha moment of VLM with less than $3.
☆4,065May 19, 2025Updated last year
McGill-NLP / nano-aha-moment
View on GitHub
Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"
☆626Oct 7, 2025Updated 9 months ago
xjdr-alt / simple_transformer
View on GitHub
Simple Transformer in Jax
☆143Jun 22, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
bespokelabsai / verifiers
View on GitHub
Verifiers for LLM Reinforcement Learning
☆81Jul 17, 2026Updated last week
haizelabs / j1-micro
View on GitHub
j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.
☆105Jul 19, 2025Updated last year
doomslide / autoloom
View on GitHub
Approximating the joint distribution of language models via MCTS
☆22Nov 3, 2024Updated last year
tokenbender / avataRL
View on GitHub
rl from zero pretrain, can it be done? yes.
☆295Sep 28, 2025Updated 9 months ago
PrimeIntellect-ai / prime-rl
View on GitHub
Agentic RL Training at Scale
☆1,725Updated this week
collinear-ai / spider
View on GitHub
Streamline on-policy/off-policy distillation workflows in a few lines of code
☆107Updated this week
sail-sg / understand-r1-zero
View on GitHub
Understanding R1-Zero-Like Training: A Critical Perspective
☆1,268Aug 27, 2025Updated 11 months ago
xjdr-alt / entropix
View on GitHub
Entropy Based Sampling and Parallel CoT Decoding
☆3,433Nov 13, 2024Updated last year
ivanleomk / modal-grpo
View on GitHub
☆19Mar 16, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
MetaStone-AI / XBai-o4
View on GitHub
[ICLR2026] Test-Time Scaling with Reflective Generative Model
☆299Jan 28, 2026Updated 5 months ago
Alex-Gurung / ReasoningNCP
View on GitHub
Official repo for Learning to Reason for Long-Form Story Generation
☆78Apr 19, 2025Updated last year
TencentARC / SEED-Bench-R1
View on GitHub
☆100Jun 23, 2025Updated last year
sail-sg / oat
View on GitHub
🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.
☆666Jan 29, 2026Updated 5 months ago
Jiayi-Pan / TinyZero
View on GitHub
Minimal reproduction of DeepSeek R1-Zero
☆13,204Feb 27, 2026Updated 5 months ago
JacksonCakes / vision-r1
View on GitHub
☆13Mar 23, 2025Updated last year
brendanhogan / picoDeepResearch
View on GitHub
☆69May 23, 2025Updated last year
om-ai-lab / VLM-R1
View on GitHub
Solve Visual Understanding with Reinforced VLMs
☆6,015Jul 7, 2026Updated 2 weeks ago
sail-sg / SkyLadder
View on GitHub
The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
☆43Dec 29, 2025Updated 6 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
lamm-mit / PRefLexOR
View on GitHub
Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning
☆244Feb 24, 2025Updated last year
EvolvingLMMs-Lab / open-r1-multimodal
View on GitHub
A fork to add multimodal model training to open-r1
☆1,594Feb 8, 2025Updated last year
microsoft / ArchScale
View on GitHub
Simple & Scalable Pretraining for Neural Architecture Research
☆340Mar 31, 2026Updated 3 months ago
MoonshotAI / Moonlight
View on GitHub
Muon is Scalable for LLM Training
☆1,514Aug 3, 2025Updated 11 months ago
kubernetes-bad / reward-composer
View on GitHub
Lego for GRPO
☆30May 27, 2025Updated last year
the-laughing-monkey / agent-rl
View on GitHub
Scripts for training Qwen 2.5 VL with ms-swift and GRPO
☆12Feb 27, 2025Updated last year
SakanaAI / self-adaptive-llms
View on GitHub
A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!
☆1,221Jan 30, 2025Updated last year