NVlabs/VideoITG

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NVlabs/VideoITG)

NVlabs / VideoITG

[CVPR 2026 Highlight] VideoITG: Multimodal Video Understanding with Instructed Temporal Grounding

☆126

Alternatives and similar repositories for VideoITG

Users that are interested in VideoITG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ZhuWenjie98 / DDE
View on GitHub
(ECCV2026) Dual Distribution Estimation for Zero-shot Noisy Test-Time Adaptation with VLMs
☆15Jul 2, 2026Updated 3 weeks ago
csslc / Self-Transcendence
View on GitHub
[ECCV 2026] Official code repository for "Self-transcendence: Is External Feature Guidance Indispensable for Accelerating Diffusion Trans…
☆37Jul 3, 2026Updated 2 weeks ago
iGuoYanjun / Memorize-When-Needed
View on GitHub
☆23Jun 29, 2026Updated 3 weeks ago
PolyU-VCLab / DepthMaster
View on GitHub
DepthMaster: Unified Monocular Depth Estimation for Perspective and Panoramic Images
☆25Jun 13, 2026Updated last month
langmanbusi / CoCoEdit
View on GitHub
[ICML 2026] Official PyTorch implementation of paper “CoCoEdit: Content-Consistent Image Editing via Region Regularized Reinforcement Lea…
☆26Jun 14, 2026Updated last month
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Liangsanzhu / Photo3D
View on GitHub
Photo3D: Advancing Photorealistic 3D Generation through Structure‑Aligned Detail Enhancement
☆22Mar 18, 2026Updated 4 months ago
lslrh / SyncNoise
View on GitHub
SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing
☆19Dec 28, 2024Updated last year
YBZh / LAPT
View on GitHub
ECCV2024, LAPT: Label-driven Automated Prompt Tuning for OOD Detection with Vision-Language Models
☆18Aug 9, 2024Updated last year
MinghanLi / FiVE-Bench
View on GitHub
[ICCV 2025] FiVE-Bench: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models
☆38Apr 2, 2026Updated 3 months ago
xiechenxi99 / DNAEdit_code
View on GitHub
[NeurIPS 2025 Spotlight] Official implementation for DNAEdit: Direct Noise Alignment for Text-Guided Rectified Flow Editing
☆32Jan 23, 2026Updated 5 months ago
Wang-pengfei / GGSD
View on GitHub
Official PyTorch codes for "Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation", ECCV2024
☆31Jul 19, 2024Updated 2 years ago
Joyies / GDPO
View on GitHub
Official code for GDPO-SR: Group Direct Preference Optimization for One-Step Generative Image Super-Resolution
☆66Jun 2, 2026Updated last month
mt-cly / ViP3DEdit
View on GitHub
[AAAI26] ViP3DE: Fast Multi-view Consistent 3D Editing with Video Priors
☆22Mar 5, 2026Updated 4 months ago
leeruibin / MfM
View on GitHub
[ICLR 2026] Many-for-Many: Unify the Training of Multiple Video and Image Generation and Manipulation Tasks
☆32Feb 5, 2026Updated 5 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
Wang-pengfei / One2Scene
View on GitHub
[ICLR 2026] - One2Scene
☆49May 25, 2026Updated last month
shuaizhengliu / InstructRestore
View on GitHub
[NeurlPS' 25] InstructRestore: Region-Customized Image Restoration with Human Instructions
☆53Oct 23, 2025Updated 9 months ago
zjr2000 / SPES
View on GitHub
Official Implementation for paper "Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm"
☆23May 8, 2026Updated 2 months ago
Joyies / TVT
View on GitHub
[ICCV2025] Official code for Fine-structure Preserved Real-world Image Super-resolution via Transfer VAE Training
☆127Jan 6, 2026Updated 6 months ago
langmanbusi / InsViE
View on GitHub
Official PyTorch implementation of paper “InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction”
☆34Apr 3, 2026Updated 3 months ago
skyhehe123 / spconv
View on GitHub
☆12Jul 18, 2024Updated 2 years ago
gwenzhang / GGA
View on GitHub
[ECCV'24] A novel weakly supervised framework for 3D object detection from 2D bounding boxes. It can easily extend to novel scenarios and…
☆36Jul 26, 2024Updated last year
PolyU-VCLab / WRC
View on GitHub
Weighted Reverse Convolution for Feature Upsampling
☆24May 24, 2026Updated last month
ChrisDud0257 / SSL
View on GitHub
Official code for our Paper "SSL: A Self-similarity Loss for Improving Generative Image Super-resolution" in ACMMM 2024
☆51Jun 6, 2026Updated last month
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
leeruibin / hybrid-forcing
View on GitHub
☆32Apr 29, 2026Updated 2 months ago
MinghanLi / MDQE_CVPR2023
View on GitHub
Code release for "MDQE: Mining Discriminative Query Embeddings to Segment Occluded Instances on Challenging Videos"(CVPR2023)
☆15Dec 14, 2023Updated 2 years ago
ChrisDud0257 / AFINE
View on GitHub
Official code for our CVPR 2025 paper: "Toward Generalized Image Quality Assessment: Relaxing the Perfect Reference Quality Assumption"
☆68Sep 15, 2025Updated 10 months ago
theEricMa / ScaleDreamer
View on GitHub
[ECCV2024] ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation
☆53Mar 28, 2025Updated last year
PolyU-VCLab / GGT-100K
View on GitHub
GGT-100K: Generative Ground Truth for Generalizable Real-World Image Restoration
☆66Jun 1, 2026Updated last month
SHI-Labs / Slow-Fast-Video-Multimodal-LLM
View on GitHub
☆29Apr 8, 2025Updated last year
cswry / DP2O-SR
View on GitHub
[NeurIPS 2025] DP²O-SR: Direct Perceptual Preference Optimization for Real-World Image Super-Resolution
☆83Dec 20, 2025Updated 7 months ago
gwenzhang / BEVDilation
View on GitHub
[AAAI'26] BEVDilation: LiDAR-Centric Multi-Modal Fusion for 3D Object Detection
☆42Dec 3, 2025Updated 7 months ago
Multimedia-Analytics-Laboratory / dpdmd
View on GitHub
[ICML 2026] The offical code of Diversity-Preserved Distribution Matching Distillation for Fast Visual Synthesis
☆87Jun 2, 2026Updated last month
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
lslrh / DynaMask
View on GitHub
Official pytorch implementation of DynaMask: Dynamic Mask Selection for Instance Segmentation (CVPR 2023)
☆11Feb 28, 2024Updated 2 years ago
Xiangtaokong / NSARM
View on GitHub
NSARM: Next-Scale Autoregressive Modeling for Robust Real-World Image Super-Resolution
☆27Oct 17, 2025Updated 9 months ago
ShawnChenn / FlexibleReflectionRemoval
View on GitHub
AAAI 25' Flexible Image Reflection Removal with Sparse Human Guidance
☆12Jul 7, 2025Updated last year
mt-cly / FPR
View on GitHub
FPR: False Positive Rectification for Weakly Supervised Semantic Segmentation (ICCV 2023)
☆24Sep 24, 2023Updated 2 years ago
skyhehe123 / ScatterFormer
View on GitHub
ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention (ECCV 2024)
☆80May 20, 2025Updated last year
SupstarZh / VividDreamer
View on GitHub
[ECCV2024] VividDreamer: Invariant Score Distillation For Hyper-Realistic Text-to-3D Generation
☆10Jul 4, 2024Updated 2 years ago
A113N-W3I / MICo-150K
View on GitHub
Official repository for the paper "MICo-150K: A Comprehensive Dataset for Multi-Image Composition".
☆111Apr 21, 2026Updated 3 months ago