VoyageWang/VG-Refiner

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/VoyageWang/VG-Refiner)

VoyageWang / VG-Refiner

The repository of VG-Refiner paper

☆20

Alternatives and similar repositories for VG-Refiner

Users that are interested in VG-Refiner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

VoyageWang / IteRPrimE
View on GitHub
The official implementation of our paper ''IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Prima…
☆20Apr 6, 2025Updated last year
ZoengHN / Embed-RL
View on GitHub
☆46Jun 23, 2026Updated last month
ChrisDong-THU / GaussianToken
View on GitHub
Official PyTorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting
☆108Apr 3, 2025Updated last year
zhang9302002 / ThinkingWithVideos
View on GitHub
The official code of "Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning"
☆102Oct 15, 2025Updated 9 months ago
Yxxxb / LAVT-RS
View on GitHub
[CVPR'2022, TPAMI'2024] LAVT: Language-Aware Vision Transformer for Referring Segmentation
☆26Jan 21, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Zhao-Jianing-SUDA / Hawkeye
View on GitHub
The official implementation of our work Hawkeye: Discovering and Grounding Implicit Anomalous Sentiment in Recon-videos via Scene-enhanc…
☆13Oct 14, 2024Updated last year
shiyi-zh0408 / Meta-CoT
View on GitHub
[CVPR 2026] Official code of the paper "Meta-CoT: Enhancing Granularity and Generalization in Image Editing"
☆79May 6, 2026Updated 2 months ago
yongliu20 / Awesome-Unified-Understanding-and-Generation
View on GitHub
☆52Aug 22, 2025Updated 11 months ago
spacetools / SpaceTools
View on GitHub
code release
☆38Jun 22, 2026Updated last month
EternalEvan / DPMesh
View on GitHub
The repository contains the official implementation of "DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery", CVPR 2024
☆45Jun 4, 2024Updated 2 years ago
shiyi-zh0408 / NAE_CVPR2024
View on GitHub
[CVPR 2024] Narrative Action Evaluation with Prompt-Guided Multimodal Interaction
☆43May 16, 2024Updated 2 years ago
ChangyuanWang17 / QVLM
View on GitHub
[NeurIPS'24]Efficient and accurate memory saving method towards W4A4 large multi-modal models.
☆102Jan 3, 2025Updated last year
CodeDance-VL / CodeDance
View on GitHub
☆32Mar 17, 2026Updated 4 months ago
yongliu20 / SCAN
View on GitHub
[CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"
☆77Sep 23, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
mbzuai-oryx / Video-R2
View on GitHub
Video-R2: Reinforcing Consistent and Grounded Reasoning in Multimodal Language Models
☆19Jan 21, 2026Updated 6 months ago
DeepExperience / HyperEyes
View on GitHub
HyperEyes is a parallel multimodal search agent that fuses visual grounding and retrieval into a single atomic action, enabling concurren…
☆70May 23, 2026Updated 2 months ago
SuleBai / SC-CLIP
View on GitHub
[TIP 2025] Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation
☆73Mar 27, 2026Updated 4 months ago
Dai-Wenxun / MotionLCM
View on GitHub
[ ECCV 2024 ] MotionLCM: This repo is the official implementation of "MotionLCM: Real-time Controllable Motion Generation via Latent Cons…
☆462Feb 24, 2025Updated last year
Jixuan-Fan / Momentum-GS
View on GitHub
[ICCV 2025] Code for Momentum-GS: Momentum Gaussian Self-Distillation for High-Quality Large Scene Reconstruction
☆173Dec 15, 2025Updated 7 months ago
HumanMLLM / LOVE-R1
View on GitHub
Official repository of paper "LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning"
☆24Nov 1, 2025Updated 9 months ago
EvolvingLMMs-Lab / ParaVT
View on GitHub
ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning
☆55Jun 2, 2026Updated last month
Lucanyc / VISTA-Gym
View on GitHub
☆27Mar 17, 2026Updated 4 months ago
weitongseu / PU-Learning
View on GitHub
This repo lists some researches and applications in PU learning.
☆12Mar 12, 2020Updated 6 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
EternalEvan / FlowIE
View on GitHub
[CVPR 2024 oral]This repository contains the official implementation of "FlowIE: Efficient Image Enhancement via Rectified Flow"
☆153Jan 13, 2025Updated last year
alibaba / ReWatch-R1
View on GitHub
[ICLR 2026] ReWatch-R1: Boosting Complex Video Reasoning in Large Vision-Language Models through Agentic Data Synthesis
☆30Mar 27, 2026Updated 4 months ago
irfl-dataset / IRFL
View on GitHub
IRFL: Image Recognition of Figurative Language
☆12Nov 30, 2023Updated 2 years ago
RammusLeo / ScoreHOI
View on GitHub
Official repository of ScoreHOI (ICCV 2025)
☆16Dec 21, 2025Updated 7 months ago
shiyi-zh0408 / LOGO
View on GitHub
[CVPR 2023] LOGO: A Long-Form Video Dataset for Group Action Quality Assessment
☆48Apr 9, 2024Updated 2 years ago
Tianlu-Zhang / Awesome-RGB-Event-Tracking
View on GitHub
A curated list of RGB-Event (RGB-E) Tracking papers, datasets, and projects.
☆19May 15, 2024Updated 2 years ago
RenlyH / CodeV
View on GitHub
[CVPR 2026 Oral] Code with Image
☆31Dec 5, 2025Updated 7 months ago
jokersio-tsy / CroSel
View on GitHub
[CVPR 24] This is official implication for our paper: ''CroSel: Cross Selection of Confident Pseudo Labels for Partial-Label Learning''.
☆15Apr 27, 2025Updated last year
mybabyyh / Preim3D
View on GitHub
☆23Sep 6, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
As-Time-Goes-By / OmniSegNet
View on GitHub
☆19Apr 11, 2026Updated 3 months ago
WHB139426 / TAB-Agent
View on GitHub
Think, Act, Build: An Agentic Framework with Vision Language Models for Zero-Shot 3D Visual Grounding
☆26Apr 5, 2026Updated 3 months ago
WNJXYK / DeCoOp
View on GitHub
☆16Jun 4, 2024Updated 2 years ago
RobertLuo1 / CoHD
View on GitHub
The official implementation of A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentation
☆27Aug 17, 2025Updated 11 months ago
XingruiWang / DynSuperCLEVR
View on GitHub
A video question answering dataset that focuses on the dynamics properties of objects (velocity, acceleration) and their collisions withi…
☆20Apr 23, 2025Updated last year
AMAP-ML / UniVG-R1
View on GitHub
UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning
☆167Jun 2, 2025Updated last year
ZJU-REAL / SpatialEvo
View on GitHub
SpatialEvo: Self-Evolving Spatial Intelligence via Deterministic Geometric Environments
☆80Apr 16, 2026Updated 3 months ago