chenllliang/G1

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/chenllliang/G1)

chenllliang / G1

G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning

☆103

Alternatives and similar repositories for G1

Users that are interested in G1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yunfeixie233 / ViGaL
View on GitHub
☆70Feb 4, 2026Updated 5 months ago
Visual-Agent / DeepEyes
View on GitHub
☆1,250Nov 20, 2025Updated 8 months ago
Yifan-Song793 / InfoCL
View on GitHub
Findings of EMNLP 2023: InfoCL: Alleviating Catastrophic Forgetting in Continual Text Classification from An Information Theoretic Perspe…
☆14Aug 13, 2024Updated last year
chenllliang / DnD-Transformer
View on GitHub
[ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…
☆80Dec 10, 2024Updated last year
RM-R1-UIUC / RM-R1
View on GitHub
[ICLR'26] RM-R1: Unleashing the Reasoning Potential of Reward Models
☆167Jun 26, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
FranxYao / Retrieval-Head-with-Flash-Attention
View on GitHub
Efficient retrieval head analysis with triton flash attention that supports topK probability
☆13Jun 15, 2024Updated 2 years ago
HJYao00 / MMReason
View on GitHub
[ICCV 2025] MMReason, MLLMs, step by step, reasoning benchmark, AGI
☆15Apr 25, 2026Updated 2 months ago
Vinoground / Vinoground
View on GitHub
☆13Apr 13, 2026Updated 3 months ago
hqhQAQ / Syn-GRPO
View on GitHub
[preprint] Syn-GRPO: Self-Evolving Data Synthesis for MLLM Perception Reasoning
☆24Nov 25, 2025Updated 7 months ago
depixels / Advanced-LLM-Systems-Learning
View on GitHub
本项目旨在系统性学习和记录大语言模型（LLM）系统领域的核心知识，重点关注分布式训练、推理优化、强化学习（RLHF）对齐的原理、主流框架和工程实践。
☆15Apr 9, 2026Updated 3 months ago
StarsfieldAI / R1-V
View on GitHub
Witness the aha moment of VLM with less than $3.
☆4,065May 19, 2025Updated last year
zhaochen0110 / OpenThinkIMG
View on GitHub
OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images.
☆399Jun 1, 2025Updated last year
vl-rewardbench / VL_RewardBench
View on GitHub
☆29Jul 23, 2025Updated last year
snap-research / VIMI
View on GitHub
☆13Jul 10, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
ModalMinds / MM-PRM
View on GitHub
MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision
☆30May 26, 2025Updated last year
dongyh20 / Insight-V
View on GitHub
[CVPR2025 Highlight] Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models
☆240Nov 7, 2025Updated 8 months ago
real-absolute-AI / SynthRL
View on GitHub
SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis
☆70Jul 24, 2025Updated last year
pkunlp-icler / IKE
View on GitHub
☆25Feb 27, 2023Updated 3 years ago
Yifan-Song793 / GoodBadGreedy
View on GitHub
The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism
☆31Jul 17, 2024Updated 2 years ago
TencentARC / pi-Tuning
View on GitHub
Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.
☆33Jul 21, 2023Updated 3 years ago
chenllliang / MMEvalPro
View on GitHub
[NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs
☆25Sep 26, 2024Updated last year
CSfufu / Revisual-R1
View on GitHub
[ICLR 2026]🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, mul…
☆212Dec 10, 2025Updated 7 months ago
JIA-Lab-research / ARPO
View on GitHub
Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay
☆162May 29, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
chenllliang / DreamEngine
View on GitHub
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!
☆123Mar 4, 2025Updated last year
pkunlp-icler / SCL-RAI
View on GitHub
Code for "SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER" @COLING-2022
☆11Aug 20, 2022Updated 3 years ago
hiyouga / EasyR1
View on GitHub
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
☆5,081Jul 15, 2026Updated last week
ziyadsheeba / qfat
View on GitHub
[NeurIPS 2025, Spotlight] An official implementation of the paper Quantization-Free Autoregressive Action Transformer
☆11Mar 3, 2026Updated 4 months ago
kokolerk / TON
View on GitHub
[NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models
☆58Sep 29, 2025Updated 9 months ago
chenllliang / ATP-AMR
View on GitHub
Source code for paper "ATP: AMRize Than Parse! Enhancing AMR Parsing with PseudoAMRs" @NAACL-2022
☆15Mar 31, 2023Updated 3 years ago
MoonshotAI / WorldVQA
View on GitHub
☆119Feb 4, 2026Updated 5 months ago
InternScience / SimChart9K
View on GitHub
The proposed simulated dataset consisting of 9,536 charts and associated data annotations in CSV format.
☆26Feb 22, 2024Updated 2 years ago
Mini-o3 / Mini-o3
View on GitHub
Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"
☆422Jan 29, 2026Updated 5 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
kxfan2002 / SophiaVL-R1
View on GitHub
SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward
☆94Aug 8, 2025Updated 11 months ago
Qwen-Applications / GD2PO
View on GitHub
☆20Jun 16, 2026Updated last month
segev-shlomov / ST-WebAgentBench
View on GitHub
A Benchmark for Evaluating Safety and Trustworthiness in Web Agents for Enterprise Scenarios
☆25Mar 12, 2026Updated 4 months ago
ModalMinds / gym-v
View on GitHub
A unified framework for vision-language environments with Gymnasium-compatible interface
☆35Mar 17, 2026Updated 4 months ago
ModalMinds / MM-EUREKA
View on GitHub
MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning
☆770Sep 7, 2025Updated 10 months ago
mit-han-lab / vila-u
View on GitHub
[ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation
☆425Apr 25, 2025Updated last year
Bostoncake / C-VisDiT
View on GitHub
The official implementation of our ICCV 2023 publication, C-VisDiT
☆10Oct 23, 2024Updated last year