☆94Feb 9, 2026Updated 3 weeks ago
Alternatives and similar repositories for VideoITG
Users that are interested in VideoITG are comparing it to the libraries listed below
Sorting:
- SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing☆19Dec 28, 2024Updated last year
- Code release for "MDQE: Mining Discriminative Query Embeddings to Segment Occluded Instances on Challenging Videos"(CVPR2023)☆14Dec 14, 2023Updated 2 years ago
- [NeurlPS' 25] InstructRestore: Region-Customized Image Restoration with Human Instructions☆49Oct 23, 2025Updated 4 months ago
- Inferring and Leveraging Parts from Object Shape for Improving Semantic Image Synthesis (CVPR 2023)☆18Dec 13, 2024Updated last year
- FPR: False Positive Rectification for Weakly Supervised Semantic Segmentation (ICCV 2023)☆24Sep 24, 2023Updated 2 years ago
- Official PyTorch implementation of paper “InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction”☆33Jul 28, 2025Updated 7 months ago
- Official PyTorch codes for "Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation", ECCV2024☆30Jul 19, 2024Updated last year
- SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality☆35Nov 25, 2024Updated last year
- Official code for our Paper "SSL: A Self-similarity Loss for Improving Generative Image Super-resolution" in ACMMM 2024☆50Jun 1, 2025Updated 9 months ago
- Official PyTorch Code for our ICCV25 paper- Generalized and Efficient 2D Gaussian Splatting for Arbitrary-scale Super-Resolution☆85Aug 6, 2025Updated 6 months ago
- [ECCV2024] ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation☆53Mar 28, 2025Updated 11 months ago
- ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention (ECCV 2024)☆82May 20, 2025Updated 9 months ago
- Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024☆31Jul 18, 2024Updated last year
- The official code for paper "GPSToken: Gaussian Parameterized Spatially-adaptive Tokenization for Image Representation and Generation"☆49Sep 28, 2025Updated 5 months ago
- Toward Generalizing Visual Brain Decoding to Unseen Subjects☆28May 14, 2025Updated 9 months ago
- ECCV24, NeurIPS24, Benchmarking Generalized Out-of-Distribution Detection with Vision-Language Models☆29Jan 25, 2026Updated last month
- Official pytorch implementation of DynaMask: Dynamic Mask Selection for Instance Segmentation (CVPR 2023)☆11Feb 28, 2024Updated 2 years ago
- The public source code of "FreCaS: Efficient Higher-Resolution Image Generation via Frequency-aware Cascaded Sampling"☆29Jul 7, 2025Updated 7 months ago
- Official code for our CVPR 2025 paper: "Toward Generalized Image Quality Assessment: Relaxing the Perfect Reference Quality Assumption"☆66Sep 15, 2025Updated 5 months ago
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆30Dec 22, 2025Updated 2 months ago
- Project page of "GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors"☆23Jul 1, 2024Updated last year
- Code release for "BoxVIS: Video Instance Segmentation with Box Annotation"☆12Dec 22, 2023Updated 2 years ago
- ☆82Oct 13, 2025Updated 4 months ago
- CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models☆91Jul 4, 2024Updated last year
- Online Detection of Action Start in Untrimmed, Streaming Videos☆12Sep 1, 2018Updated 7 years ago
- [CVPR2025] Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data☆67Apr 24, 2025Updated 10 months ago
- Code for "SePPO: Semi-Policy Preference Optimization for Diffusion Alignment."☆18Oct 7, 2024Updated last year
- Official pytorch implementation of SIM: Semantic-aware Instance Mask Generation for Box-Supervised Instance Segmentation (CVPR 2023)☆38Jul 31, 2023Updated 2 years ago
- [CVPR 2025] Official code repository for "MaSS13K: A Matting-level Semantic Segmentation Benchmark"☆50Jun 12, 2025Updated 8 months ago
- [ICLR 2026] FOCUS: Efficient Keyframe Selection for Long Video Understanding☆40Feb 3, 2026Updated 3 weeks ago
- Code and Data for Real-time Human-Centric Segmentation for Complex Video Scenes☆17Feb 8, 2024Updated 2 years ago
- A curated list of resources for video super-resolution using diffusion models.☆181Updated this week
- Official repository for the paper "MICo-150K: A Comprehensive Dataset for Multi-Image Composition".☆54Feb 21, 2026Updated last week
- The official codes of our CVPR-2023 paper: Sharpness-Aware Gradient Matching for Domain Generalization☆79May 31, 2023Updated 2 years ago
- Quick Long Video Understanding [TMLR2025]☆76Oct 27, 2025Updated 4 months ago
- [IROS 2024] HPHS: Hierarchical Planning based on Hybrid Frontier Sampling for Unknown Environments Exploration☆82Apr 28, 2025Updated 10 months ago
- [ICCV 2025] RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping☆36Nov 21, 2025Updated 3 months ago
- DefFiller Mask-conditioned Generation with Diffusion Prior for Saliency-based Steel Surface Defect Detection☆30Apr 28, 2025Updated 10 months ago
- spatio-temporal tasks☆16Jul 15, 2024Updated last year