PRIS-CV/GRPO-for-Llava

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/PRIS-CV/GRPO-for-Llava)

PRIS-CV / GRPO-for-Llava

GRPO Algorithm for Llava Architecture (Based on Verl)

☆49

Alternatives and similar repositories for GRPO-for-Llava

Users that are interested in GRPO-for-Llava are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

PRIS-CV / FakeReasoning
View on GitHub
[TIP 2026] Toward Generalizable Forgery Detection and Reasoning.
☆22Apr 20, 2026Updated 2 months ago
vimar-gu / SSD
View on GitHub
[AAAI2024] Summarizing Stream Data for Memory-Restricted Online Continual Learning
☆21Apr 30, 2024Updated 2 years ago
hyungjin-chung / VPS
View on GitHub
☆16Sep 11, 2025Updated 9 months ago
tpoisonooo / open-r1
View on GitHub
Fully open reproduction of DeepSeek-R1
☆11Mar 24, 2025Updated last year
PRIS-CV / EAFT
View on GitHub
EAFT(Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting) official repo
☆106Jan 15, 2026Updated 5 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
RUC-NLPIR / EnvScaler
View on GitHub
The official implementation of "EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis".
☆170Feb 12, 2026Updated 4 months ago
HKUST-LongGroup / Relation-R1
View on GitHub
[AAAI 2026] Relation-R1: Progressively Cognitive Chain-of-Thought Guided Reinforcement Learning for Unified Relation Comprehension
☆20Mar 6, 2026Updated 4 months ago
zjr2000 / REVERIE
View on GitHub
[ECCV2024] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models
☆20Jul 17, 2024Updated last year
showlab / Edit2Perceive
View on GitHub
[CVPR 2026] Official Implementation of Edit2Perceive
☆45Feb 21, 2026Updated 4 months ago
QuentinFitteRey / VLMSAM
View on GitHub
Qwen-SAM is a reasoning-based segmentation model that integrates Qwen 2.5 VL 7B with the Segment Anything Model (SAM), enabling fine-grai…
☆31Jun 4, 2025Updated last year
andrearosasco / DistilledReplay
View on GitHub
Code for the pubblication "Distilled Replay: Overcoming Forgetting through Synthetic Examples"
☆12Apr 1, 2021Updated 5 years ago
kim-sanghwan / ANCL
View on GitHub
The code implementation of the <Achieving a Better Stability-Plasticity Trade-off via Auxiliary Networks in Continual Learning> in The Co…
☆14May 25, 2023Updated 3 years ago
pranoyr / scene-graph-vit
View on GitHub
Implementation of the Paper Scene-Graph ViT
☆10Dec 20, 2024Updated last year
Yifanfanfanfan / Reverse-Engineering-of-Imperceptible-Adversarial-Image-Perturbations
View on GitHub
☆11Mar 31, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
LeonDiao0427 / SEAS
View on GitHub
We release our code and data for SEAS in this repository.
☆21Dec 23, 2024Updated last year
guanwei49 / EMIT
View on GitHub
EMIT: Enhancing MLLMs for Industrial Anomaly Detection via Difficulty-Aware GRPO
☆26Jan 24, 2026Updated 5 months ago
innovator-zero / SAK
View on GitHub
[ICLR2025] Swiss Army Knife: Synergizing Biases in Knowledge from Vision Foundation Models for Multi-Task Learning
☆14Apr 8, 2025Updated last year
HotTricker / TransitLM
View on GitHub
TransitLM: A Large-Scale Dataset and Benchmark for Map-Free Transit Route Generation
☆126May 30, 2026Updated last month
sosppxo / MDIN
View on GitHub
[MM2024 Oral] 3D-GRES: Generalized 3D Referring Expression Segmentation
☆43Dec 15, 2024Updated last year
Mishne-Lab / IGNR
View on GitHub
Implementation of Implicit Graphon Neural Representation
☆14Sep 1, 2023Updated 2 years ago
Simon98-AI / Vedas
View on GitHub
☆56May 13, 2026Updated last month
DavisPL / PCCC
View on GitHub
Proof-carrying code completions in Dafny
☆11Apr 4, 2025Updated last year
kkaiwwana / MVPbev
View on GitHub
[ACM MM24 Poster] Official implementation of paper "MVPbev: Multi-view Perspective Image Generation from BEV with Test-time Controllabili…
☆20Sep 6, 2025Updated 10 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
zhjohnchan / SK-VG
View on GitHub
[CVPR-2023] The official dataset of Advancing Visual Grounding with Scene Knowledge: Benchmark and Method.
☆34Jul 12, 2023Updated 2 years ago
kaviezhang / MeshMamba
View on GitHub
☆19Oct 23, 2025Updated 8 months ago
Ufarufa / dl_and_rl_resource
View on GitHub
Deep Learning And Reinforcement Learning
☆11Mar 28, 2017Updated 9 years ago
JiangpengHe / CL-LoRA
View on GitHub
☆38Jul 14, 2025Updated 11 months ago
YuxiXie / V-DPO
View on GitHub
Preference Learning for LLaVA
☆59Nov 9, 2024Updated last year
aba122 / Q-Hawkeye
View on GitHub
☆61Feb 9, 2026Updated 5 months ago
ml-postech / GM-VAE
View on GitHub
Official PyTorch implementation of "Hyperbolic VAE via Latent Gaussian Distributions"
☆25Oct 26, 2023Updated 2 years ago
ml-postech / reverse-gnn
View on GitHub
☆11Jun 14, 2024Updated 2 years ago
minglllli / CLS-RL
View on GitHub
[NeurIPS 2025 Spotlight] Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning
☆88Sep 19, 2025Updated 9 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
DoHunLee1 / VideoGuide
View on GitHub
[CVPR2025] Official repository for "VideoGuide: Improving Video Diffusion Models without Training Through a Teacher's Guide"
☆29May 27, 2025Updated last year
liaoq / pnas2019
View on GitHub
☆11Nov 27, 2019Updated 6 years ago
kid-yang233 / robots
View on GitHub
The homework of robos learning base.
☆11May 23, 2023Updated 3 years ago
hzhao98 / GDCL
View on GitHub
Graph Debiased Contrastive Learning with Joint Representation Clustering
☆26May 10, 2023Updated 3 years ago
ml-postech / SpReME
View on GitHub
☆11Mar 15, 2023Updated 3 years ago
ml-postech / SSAD
View on GitHub
☆12Feb 26, 2024Updated 2 years ago
medunigraz / pyCEPS
View on GitHub
pyCEPS provides an interface to import, visualize and translate clinical mapping data
☆14Nov 25, 2025Updated 7 months ago