pansanity666 / INO_VOSLinks
The official code for [ACM MM 2022] 'In-N-Out Generative Learning for Dense Unsupervised Video Segmentation'.
☆20Updated 2 years ago
Alternatives and similar repositories for INO_VOS
Users that are interested in INO_VOS are comparing it to the libraries listed below
Sorting:
- [TIP 2023] Co-Learning Meets Stitch-Up for Noisy Multi-label Visual Recognition.☆13Updated 2 years ago
- ☆31Updated 10 months ago
- ICCV2023-Diffusion-Papers☆108Updated 2 years ago
- [CVPR2022] SVIP: Sequence VerIfication for Procedures in Videos☆24Updated 2 years ago
- ☆39Updated 2 years ago
- [ICLR 2024] Official implementation of the paper "Toss: High-quality text-guided novel view synthesis from a single image"☆23Updated last year
- PhyGDPO: Physics-Aware Groupwise Direct Preference Optimization for Physically Consistent Text-to-Video Generation☆44Updated 3 weeks ago
- Official code for ICCV 2023 paper: "TransHuman: A Transformer-based Human Representation for Generalizable Neural Human Rendering".☆67Updated 2 years ago
- Code for the paper "Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundatio…☆28Updated 2 years ago
- Official implementation for "Diffusion-Based Scene Graph to Image Generation with Masked Contrastive Pre-Training" https://arxiv.org/abs/…☆74Updated last year
- Global-to-Local Modeling for Video-based 3D Human Pose and Shape Estimation☆59Updated 2 years ago
- [NeurIPS 2022] Official implementation of the paper "Rethinking Resolution in the Context of Efficient Video Recognition".☆31Updated 3 years ago
- Official implementation of “JOTR: 3D Joint Contrastive Learning with Transformers for Occluded Human Mesh Recovery“☆37Updated 2 years ago
- ☆32Updated last year
- ☆58Updated 2 years ago
- Repo for "Human-Centric Foundation Models: Perception, Generation and Agentic Modeling" (https://arxiv.org/abs/2502.08556)☆57Updated 11 months ago
- The 1st place solution of 2022 Ego4d Natural Language Queries.☆32Updated 3 years ago
- Official Repository for "Diffusion HPC: Generate Synthetic Data for Human Mesh Recovery in Challenging Domains" (3DV 2024 Spotlight)☆43Updated 2 years ago
- Benchmark dataset and code of MSRVTT-Personalization☆52Updated 2 months ago
- Open-vocabulary Object Segmentation with Diffusion Models☆182Updated 2 years ago
- Code release for LayoutDiffuse☆57Updated 2 years ago
- [NeurIPS 2023 Datasets and Benchmarks] "FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation", Yuanxin L…☆57Updated last year
- [ICLR 2025] IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model☆37Updated last year
- Unified layout planning and image generation, ICCV2025☆40Updated last week
- ☆82Updated 8 months ago
- Official implementation of the paper "Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model"☆66Updated 2 years ago
- Implementation and checkpoints of Imagen, Google's text-to-image synthesis neural network, in Pytorch☆17Updated 3 years ago
- [ICCV 2023 Oral, Best Paper Finalist] ITI-GEN: Inclusive Text-to-Image Generation☆69Updated last year
- Official Implementation for "Matching Is Not Enough: A Two-Stage Framework for Category-Agnostic Pose Estimation", CVPR 2023.☆54Updated 2 years ago
- [ICCV 2021] Crossover Learning for Fast Online Video Instance Segmentation☆85Updated 3 years ago