zli12321/Vision-SR1

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zli12321/Vision-SR1)

zli12321 / Vision-SR1

Reinforcement Learning of Vision Language Models with Self Visual Perception Reward

☆174

Alternatives and similar repositories for Vision-SR1

Users that are interested in Vision-SR1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zli12321 / VideoHallu
View on GitHub
Synthetic Video hallucination and Mitigation
☆23Sep 21, 2025Updated 9 months ago
zli12321 / LHTB
View on GitHub
Long Horizon Terminal Benchmark with Dense Reward Grading
☆48Updated this week
zli12321 / MM-Zero
View on GitHub
Self-evolving vision language models from zero data
☆77Mar 14, 2026Updated 4 months ago
Hongyang-Du / VideoGPA
View on GitHub
[ICML'26] VideoGPA is a self-supervised framework that enhances 3D consistency in Video Diffusion Models.
☆69Jun 6, 2026Updated last month
si0wang / ViCrit
View on GitHub
☆24Jun 18, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
zli12321 / free-form-grpo
View on GitHub
grpo to train long form QA and instructions with long-form reward model
☆17Jul 17, 2025Updated 11 months ago
Pay20Y / PIMNet
View on GitHub
☆16Jan 30, 2022Updated 4 years ago
wuxiyang1996 / COS-PLAY
View on GitHub
COS-PLAY: Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Game Play
☆28Updated this week
sailing-lab / sr2am
View on GitHub
SR²AM: Efficient Agentic Reasoning Through Self-Regulated Simulative Planning
☆21May 22, 2026Updated last month
Koreyoshi01 / VISD
View on GitHub
This repository is the official implementation for VISD.
☆20May 17, 2026Updated last month
showlab / AUI
View on GitHub
Computer-Use Agents as Judges for Generative UI
☆44Nov 27, 2025Updated 7 months ago
zai-org / UI2Code_N
View on GitHub
☆77May 2, 2026Updated 2 months ago
yunlong10 / CAT-V
View on GitHub
[AAAI 26 Demo] Offical repo for CAT-V - Caption Anything in Video: Object-centric Dense Video Captioning with Spatiotemporal Multimodal P…
☆67Jan 27, 2026Updated 5 months ago
meituan / MemOCR
View on GitHub
MemOCR: an OCR-driven visual memory agent.
☆33May 17, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
adithya-s-k / MoLE
View on GitHub
Mixture of Lora Experts
☆11Apr 7, 2024Updated 2 years ago
HongbangYuan / OmniReward
View on GitHub
☆47Dec 16, 2025Updated 6 months ago
snumprlab / isr-dpo
View on GitHub
Official Implementation of ISR-DPO:Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO (AAAI'25)
☆23Nov 25, 2025Updated 7 months ago
zjr2000 / REVERIE
View on GitHub
[ECCV2024] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models
☆20Jul 17, 2024Updated last year
hqhQAQ / Hint-GRPO
View on GitHub
[ICCV 2025] Boosting MLLM Reasoning with Text-Debiased Hint-GRPO
☆48Jul 1, 2025Updated last year
qiujihao19 / Artemis
View on GitHub
[NeurIPS 2024] Artemis: Towards Referential Understanding in Complex Videos
☆27Apr 8, 2025Updated last year
ICYPOLE / Fudan-Course-Search
View on GitHub
复旦研究生抢课脚本
☆10Feb 14, 2022Updated 4 years ago
agents-x-project / PyVision-RL
View on GitHub
[ICML 2026] Official implementation of "PyVision-RL: Forging Open Agentic Vision Models via RL."
☆69Feb 25, 2026Updated 4 months ago
NLie2 / what_features_jailbreak_LLMs
View on GitHub
☆18Mar 30, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
allenai / understanding_mcqa
View on GitHub
Code for the arXiv preprint "Answer, Assemble, Ace: Understanding How Transformers Answer Multiple Choice Questions"
☆15Aug 2, 2025Updated 11 months ago
dongwonjo / FastKV
View on GitHub
[ACL Findings 2026] Official Implementation of "FastKV: Decoupling of Context Reduction and KV Cache Compression for Prefill-Decoding Acc…
☆32Apr 14, 2026Updated 3 months ago
inclusionAI / M2-Reasoning
View on GitHub
M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning
☆47Jul 17, 2025Updated 11 months ago
Pay20Y / GCAN
View on GitHub
☆26Feb 2, 2023Updated 3 years ago
diaoquesang / Code-in-Paper-Guide
View on GitHub
🌟 手把手教你在论文中插入代码链接
☆25Aug 2, 2025Updated 11 months ago
YujunZhou / EVOL-RL
View on GitHub
Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).
☆51Mar 31, 2026Updated 3 months ago
JingMog / THOR
View on GitHub
[ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".
☆33Feb 26, 2026Updated 4 months ago
lyan62 / vlm-info-loss
View on GitHub
☆22Sep 16, 2025Updated 9 months ago
mlvlab / VidChain
View on GitHub
Official Implementation (Pytorch) of the "VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Capti…
☆25Jan 26, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ncTimTang / AKS
View on GitHub
[CVPR 2025] Adaptive Keyframe Sampling for Long Video Understanding
☆227Dec 19, 2025Updated 6 months ago
hwanyu112 / VIBE-Benchmark
View on GitHub
☆27Feb 3, 2026Updated 5 months ago
Interplay-LM-Reasoning / Interplay-LM-Reasoning
View on GitHub
[ICML 2026 Spotlight] On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models
☆160Jun 8, 2026Updated last month
wuxiyang1996 / Heterogeneous_Highway_Env
View on GitHub
Heterogeneous Multi-agent Version of Highway-env
☆18Jun 28, 2023Updated 3 years ago
si0wang / ThinkLite-VL
View on GitHub
☆105Jun 10, 2025Updated last year
tanganke / weight-ensembling_MoE
View on GitHub
Code for paper "Merging Multi-Task Models via Weight-Ensembling Mixture of Experts"
☆32Jun 7, 2024Updated 2 years ago
yule-BUAA / MergeLLM
View on GitHub
Codes for Merging Large Language Models
☆37Aug 7, 2024Updated last year