PRIS-CV / GRPO-for-LlavaView external linksLinks
GRPO Algorithm for Llava Architecture (Based on Verl)
☆47May 9, 2025Updated 9 months ago
Alternatives and similar repositories for GRPO-for-Llava
Users that are interested in GRPO-for-Llava are comparing it to the libraries listed below
Sorting:
- ☆20Jun 13, 2025Updated 8 months ago
- FakeReasoning: Towards Generalizable Forgery Detection and Reasoning.☆14Aug 28, 2025Updated 5 months ago
- ☆18Apr 20, 2025Updated 9 months ago
- ☆26Jul 14, 2025Updated 7 months ago
- The code implementation of the <Achieving a Better Stability-Plasticity Trade-off via Auxiliary Networks in Continual Learning> in The Co…☆14May 25, 2023Updated 2 years ago
- [ACM MM24 Poster] Official implementation of paper "MVPbev: Multi-view Perspective Image Generation from BEV with Test-time Controllabili…☆20Sep 6, 2025Updated 5 months ago
- Official implementation of Layout-aware Dreamer for Embodied Referring Expression Grounding [AAAI 23].☆16Apr 13, 2023Updated 2 years ago
- MDPO: Overcoming the Training-Inference Divide of Masked Diffusion Language Models☆40Jan 28, 2026Updated 2 weeks ago
- Official PyTorch implementation of "Hyperbolic VAE via Latent Gaussian Distributions"☆23Oct 26, 2023Updated 2 years ago
- Official PyTorch implementation of the paper "Accelerating Diffusion Large Language Models with SlowFast Sampling: The Three Golden Princ…☆40Jul 18, 2025Updated 6 months ago
- Graph Debiased Contrastive Learning with Joint Representation Clustering☆25May 10, 2023Updated 2 years ago
- [CVPR-2023] The official dataset of Advancing Visual Grounding with Scene Knowledge: Benchmark and Method.☆33Jul 12, 2023Updated 2 years ago
- [NeurIPS 2025 Spotlight] Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning☆78Sep 19, 2025Updated 4 months ago
- Geometric Adversarial Attacks and Defenses on 3D Point Clouds (3DV 2021)☆27Jun 25, 2023Updated 2 years ago
- [ICLR 2025] Official PyTorch Implementation for CPE: Concept Pinpoint Eraser for Text-to-image Diffusion Models via Residual Attention Ga…☆12Apr 7, 2025Updated 10 months ago
- [ICRA 2024] WLST: Weak Labels Guided Self-training for Weakly-supervised Domain Adaptation on 3D Object Detection☆12Feb 6, 2024Updated 2 years ago
- ☆18Jun 10, 2025Updated 8 months ago
- Official repository for the NuScenes-MQA. This paper is accepted by LLVA-AD Workshop at WACV 2024.☆35Dec 21, 2023Updated 2 years ago
- 字节跳动瓜最终真实情况,用事实说话,正义会迟到但不会缺席!☆23Oct 18, 2024Updated last year
- [CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"☆36Apr 21, 2024Updated last year
- NegCLIP.☆38Feb 6, 2023Updated 3 years ago
- Implementation of the CVPR2025 paper LoTUS: Large-Scale Machine Unlearning with a Taste of Uncertainty.☆16Sep 10, 2025Updated 5 months ago
- [NeurIPS 2023] Official Implementation of A Generic Active Learning Baseline for LiDAR Semantic Segmentation☆32Apr 26, 2024Updated last year
- 松灵Piper机械臂适配新版Lerobot☆20Jul 22, 2025Updated 6 months ago
- For paper《Gaussian Transformer: A Lightweight Approach for Natural Language Inference》☆28Feb 23, 2020Updated 5 years ago
- Code for paper: Nullu: Mitigating Object Hallucinations in Large Vision-Language Models via HalluSpace Projection☆50Mar 13, 2025Updated 11 months ago
- [ICLR 23] Contrastive Aligned of Vision to Language Through Parameter-Efficient Transfer Learning☆40Jul 29, 2023Updated 2 years ago
- ☆32Sep 24, 2023Updated 2 years ago
- Federated Meta-Learning for Emotion and Sentiment Aware Multi-modal Complaint Identification☆10May 30, 2024Updated last year
- Proof-carrying code completions in Dafny☆11Apr 4, 2025Updated 10 months ago
- [MM2024 Oral] 3D-GRES: Generalized 3D Referring Expression Segmentation☆42Dec 15, 2024Updated last year
- Documentation at☆14Mar 27, 2025Updated 10 months ago
- ☆11Mar 31, 2022Updated 3 years ago
- EMIT: Enhancing MLLMs for Industrial Anomaly Detection via Difficulty-Aware GRPO☆19Jan 24, 2026Updated 3 weeks ago
- Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"☆13May 12, 2023Updated 2 years ago
- The homework of robos learning base.☆11May 23, 2023Updated 2 years ago
- Improving One-stage Visual Grounding by Recursive Sub-query Construction, ECCV 2020☆89Sep 30, 2021Updated 4 years ago
- Visual self-questioning for large vision-language assistant.☆45Jul 23, 2025Updated 6 months ago
- Implementation of the Paper Scene-Graph ViT☆10Dec 20, 2024Updated last year