pansanity666 / INO_VOSLinks
The official code for [ACM MM 2022] 'In-N-Out Generative Learning for Dense Unsupervised Video Segmentation'.
☆20Updated 2 years ago
Alternatives and similar repositories for INO_VOS
Users that are interested in INO_VOS are comparing it to the libraries listed below
Sorting:
- [TIP 2023] Co-Learning Meets Stitch-Up for Noisy Multi-label Visual Recognition.☆13Updated 2 years ago
- ICCV2023-Diffusion-Papers☆108Updated 2 years ago
- Official code for ICCV 2023 paper: "TransHuman: A Transformer-based Human Representation for Generalizable Neural Human Rendering".☆67Updated last year
- Open-vocabulary Object Segmentation with Diffusion Models☆181Updated 2 years ago
- ☆29Updated 7 months ago
- [CVPR2022] SVIP: Sequence VerIfication for Procedures in Videos☆24Updated 2 years ago
- The official repository of SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization☆54Updated 2 weeks ago
- [ICLR 2024] Official implementation of the paper "Toss: High-quality text-guided novel view synthesis from a single image"☆22Updated last year
- ☆39Updated last year
- Repo for "Human-Centric Foundation Models: Perception, Generation and Agentic Modeling" (https://arxiv.org/abs/2502.08556)☆53Updated 8 months ago
- Global-to-Local Modeling for Video-based 3D Human Pose and Shape Estimation☆58Updated 2 years ago
- Official implementation of “JOTR: 3D Joint Contrastive Learning with Transformers for Occluded Human Mesh Recovery“☆37Updated 2 years ago
- [NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator☆96Updated last year
- Code release for LayoutDiffuse☆57Updated 2 years ago
- [NeurIPS 2022] Official implementation of the paper "Rethinking Resolution in the Context of Efficient Video Recognition".☆31Updated 2 years ago
- ☆31Updated last year
- The 1st place solution of 2022 Ego4d Natural Language Queries.☆32Updated 3 years ago
- Unofficial implementation of DragDiffusion☆37Updated 2 years ago
- Official implementation for "Diffusion-Based Scene Graph to Image Generation with Masked Contrastive Pre-Training" https://arxiv.org/abs/…☆72Updated 10 months ago
- Code for the paper "Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundatio…☆27Updated last year
- ☆58Updated 2 years ago
- [ICCV 2023 Oral, Best Paper Finalist] ITI-GEN: Inclusive Text-to-Image Generation☆69Updated last year
- A PyTorch implementation of TVC☆24Updated last year
- [WACV2025] Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)☆80Updated last year
- ☆79Updated 5 months ago
- Curated list of recent visual autoregressive (VAR) modeling works☆30Updated 7 months ago
- Official Implementation of ICLR'24: Kosmos-G: Generating Images in Context with Multimodal Large Language Models☆73Updated last year
- [NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT☆135Updated last year
- Official Repository for "Diffusion HPC: Generate Synthetic Data for Human Mesh Recovery in Challenging Domains" (3DV 2024 Spotlight)☆43Updated 2 years ago
- [ICCV2023] EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding☆77Updated 2 years ago