The official code for [ACM MM 2022] 'In-N-Out Generative Learning for Dense Unsupervised Video Segmentation'.
☆20Feb 22, 2023Updated 3 years ago
Alternatives and similar repositories for INO_VOS
Users that are interested in INO_VOS are comparing it to the libraries listed below
Sorting:
- [TIP 2023] Co-Learning Meets Stitch-Up for Noisy Multi-label Visual Recognition.☆13Aug 19, 2023Updated 2 years ago
- Official code for ICCV 2023 paper: "TransHuman: A Transformer-based Human Representation for Generalizable Neural Human Rendering".☆67Jan 11, 2024Updated 2 years ago
- Official implementation of “JOTR: 3D Joint Contrastive Learning with Transformers for Occluded Human Mesh Recovery“☆37Aug 21, 2023Updated 2 years ago
- The 1st place solution of 2022 Ego4d Natural Language Queries.☆32Sep 5, 2022Updated 3 years ago
- Global-to-Local Modeling for Video-based 3D Human Pose and Shape Estimation☆59Jun 21, 2023Updated 2 years ago
- Self-supervised Point Cloud Representation Learning via Separating Mixed Shapes☆21May 23, 2023Updated 2 years ago
- The official implementation of VidFace☆12Aug 27, 2024Updated last year
- [NeurIPS 23] Official repository for NeurIPS 2023 paper "Global-correlated 3D-decoupling Transformer for Clothed Avatar Reconstruction"☆112Sep 21, 2025Updated 5 months ago
- DMAOT ranked 1st in the VOTS 2023 challenge.☆16Dec 21, 2023Updated 2 years ago
- ☆47Mar 24, 2024Updated last year
- [CVPR2024] CapHuman: Capture Your Moments in Parallel Universes☆100Nov 20, 2024Updated last year
- [SIGIR 2022] CenterCLIP: Token Clustering for Efficient Text-Video Retrieval.☆134May 4, 2022Updated 3 years ago
- ☆32Mar 1, 2024Updated 2 years ago
- [CVPR2022] Multi-View Consistent Generative Adversarial Networks for 3D-aware Image Synthesis☆100Jun 23, 2022Updated 3 years ago
- The benchmark for "Video Object Segmentation in Panoptic Wild Scenes".☆12Oct 17, 2023Updated 2 years ago
- [TIP 2022] Towards Better Accuracy-efficiency Trade-offs: Divide and Co-training. Plus, an image classification toolbox includes ResNet, …☆107Aug 14, 2022Updated 3 years ago
- The official repository of SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization☆157Jan 29, 2026Updated last month
- Code for Point-Calibrated Spectral Neural Operators☆20Oct 15, 2024Updated last year
- ☆14Dec 11, 2024Updated last year
- ICLR 2023 DeCap: Decoding CLIP Latents for Zero-shot Captioning☆138Mar 16, 2023Updated 2 years ago
- (AAAI2024) Controllable 3D Face Generation with Conditional Style Code Diffusion☆38Apr 17, 2024Updated last year
- Implementation of the paper Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval (CVPR 2024)☆20Nov 4, 2024Updated last year
- 🌟 Code for ACL 2023 paper "GloFE: Gloss-Free End-to-End Sign Language Translation" (Oral)☆38Nov 30, 2023Updated 2 years ago
- (VillagerAgent ACL 2024) A Graph based Minecraft multi agents framework☆84Updated this week
- Market-1501 dataset with super-resolution quality☆20May 12, 2022Updated 3 years ago
- 3D Gaussian Splatting☆18Mar 25, 2024Updated last year
- ICLR 2023 - FedFA: Federated Feature Augmentation☆59Mar 28, 2023Updated 2 years ago
- Codes for the EMNLP2021 paper: Benchmarking Commonsense Knowledge Base Population (https://aclanthology.org/2021.emnlp-main.705.pdf). An …☆26Feb 14, 2024Updated 2 years ago
- Repository of our CVPR2023 paper "Lana: A Language-Capable Navigator for Instruction Following and Generation"☆94Apr 27, 2023Updated 2 years ago
- EVA: Zero-shot Accurate Attributes and Multi-Object Video Editing☆30Mar 29, 2024Updated last year
- A simple, fast, efficient and end-to-end 3D object detector without NMS.☆30Nov 30, 2021Updated 4 years ago
- The official implementation of "Human101: Training 100+FPS Human Gaussians in 100s from 1 View".☆111Dec 27, 2023Updated 2 years ago
- 📦 A lightweight machine learning toolkit for researchers, providing common model design & learning functionalities.☆28Jul 2, 2025Updated 8 months ago
- Aims for memory-efficient training (24GB VRAM) on consumer GPUs. Optimizing language models through guidance tokens in reasoning chains, …☆29Feb 23, 2025Updated last year
- (ICML 2024) Improve Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning☆28Sep 27, 2024Updated last year
- A collection of diffusion models based on FLUX/DiT for image/video generation, editing, reconstruction, inpainting .etc.☆85Jun 20, 2025Updated 8 months ago
- [ICCV23] Bird’s-Eye-View Scene Graph for Vision-Language Navigation☆123Apr 12, 2024Updated last year
- In our implementation of Qwen-Image-Edit, we employ block causal attention to improve inference speed.☆37Feb 16, 2026Updated 2 weeks ago
- ☆17Aug 1, 2025Updated 7 months ago