[SIGGRAPH2025] Generative Video Matting
☆57Aug 12, 2025Updated 6 months ago
Alternatives and similar repositories for GVM
Users that are interested in GVM are comparing it to the libraries listed below
Sorting:
- ☆14Dec 20, 2022Updated 3 years ago
- ☆11Mar 11, 2025Updated 11 months ago
- Official implementation of [CVPR 2025] RePerformer: Immersive Human-centric Volumetric Videos from Playback to Photoreal Reperformance☆21Sep 9, 2025Updated 5 months ago
- Flux training codes (lora) for UniTEX☆23Jun 8, 2025Updated 8 months ago
- [ACMMM 2025] Officially implement of the paper "Seg-Wild: Interactive Segmentation based on 3D Gaussian Splatting for Unconstrained Image…☆18Jul 29, 2025Updated 7 months ago
- ☆19Jan 23, 2023Updated 3 years ago
- ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation (CVPR'25)☆18Apr 2, 2025Updated 11 months ago
- ☆17May 14, 2022Updated 3 years ago
- Repository for the Sports Shape and Pose 3D (SSP-3D) dataset.☆45Jul 25, 2024Updated last year
- LLMBind: A Unified Modality-Task Integration Framework☆19Jun 16, 2024Updated last year
- GLoSS model of the human skeleton☆26Jun 8, 2022Updated 3 years ago
- Open-Vocabulary Panoptic Segmentation☆27Jun 15, 2025Updated 8 months ago
- [ICCV 2025 Highlight] ETCH: Generalizing Body Fitting to Clothed Humans via Equivariant Tightness☆129Sep 26, 2025Updated 5 months ago
- MPMAvatar: Learning 3D Gaussian Avatars with Accurate and Robust Physics-Based Dynamics (NeurIPS 2025)☆79Dec 30, 2025Updated 2 months ago
- DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation☆39Aug 3, 2025Updated 6 months ago
- Code implementation of paper "MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval (AAAI2025)"☆25Feb 2, 2025Updated last year
- ☆27Oct 5, 2023Updated 2 years ago
- A unified framework for controllable caption generation across images, videos, and audio. Supports multi-modal inputs and customizable ca…☆52Jul 24, 2025Updated 7 months ago
- Synthetic Humans for Action Recognition, IJCV 2021☆72Jun 9, 2021Updated 4 years ago
- The official implementation of our work Hawkeye: Discovering and Grounding Implicit Anomalous Sentiment in Recon-videos via Scene-enhanc…☆12Oct 14, 2024Updated last year
- OPSTL: Self-supervised Skeleton-based Action Recognition in Occluded Environments☆14Oct 25, 2023Updated 2 years ago
- [ECCV2022] Unstructured Feature Decoupling for Vehicle Re-Identification (UFDN)☆27Oct 13, 2022Updated 3 years ago
- ☆36May 20, 2025Updated 9 months ago
- Official implementation of "Seg4Diff: Unveiling Open-Vocabulary Segmentation in Text-to-Image Diffusion Transformers" (NeurIPS 2025)☆68Sep 23, 2025Updated 5 months ago
- Codes of CVPR2022 paper: Fixing Malfunctional Objects With Learned Physical Simulation and Functional Prediction☆32Aug 23, 2022Updated 3 years ago
- ☆72Feb 14, 2023Updated 3 years ago
- [CVPR23] PoseExaminer: Automated Testing of Out-of-Distribution Robustness in Human Pose and Shape Estimation☆39Jul 7, 2023Updated 2 years ago
- A tool that remove objects from video with a simple painting using FGT and Siam Mask models.☆37Jun 22, 2023Updated 2 years ago
- (ICCV2025) Official repository of paper "ViSpeak: Visual Instruction Feedback in Streaming Videos"☆45Jul 1, 2025Updated 8 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆12Nov 14, 2025Updated 3 months ago
- Vision-Language Models Toolbox: Your all-in-one solution for multimodal research and experimentation☆12Feb 16, 2025Updated last year
- The repository of VG-Refiner paper☆17Dec 9, 2025Updated 2 months ago
- The official implementation of our paper "Cockatiel: Ensembling Synthetic and Human Preferenced Training for Detailed Video Caption"☆38May 21, 2025Updated 9 months ago
- ☆19Sep 29, 2025Updated 5 months ago
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Apr 12, 2023Updated 2 years ago
- [ICLR 2025] IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model☆37Nov 27, 2024Updated last year
- [SIGGRAPH 2025] AutoKeyframe: Autoregressive Keyframe Generation for Human Motion Synthesis and Editing☆49May 9, 2025Updated 9 months ago
- [CVPR 2025] Official code repository for "MaSS13K: A Matting-level Semantic Segmentation Benchmark"☆50Jun 12, 2025Updated 8 months ago
- UniLat3D: Geometry-Appearance Unified Latents for Single-Stage 3D Generation☆135Nov 19, 2025Updated 3 months ago