[CVPR 2026] Official code and models for Video Encoder-only Mask Transformer (VidEoMT).
☆201Apr 28, 2026Updated last week
Alternatives and similar repositories for videomt
Users that are interested in videomt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2026 Oral] "MARCO: Navigating the Unseen Space of Semantic Correspondence"☆77Apr 21, 2026Updated 2 weeks ago
- ☆44May 10, 2025Updated 11 months ago
- [CVPR 2026] Scaling Zero-Shot Reference-to-Video Generation☆72Apr 28, 2026Updated last week
- OmniShotCut is a sensitive and more informative SoTA on Shot Boundary Detection task.☆114Updated this week
- ☆196Mar 11, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [CVPR 2024] Task-aligned Part-aware Panoptic Segmentation through Joint Object-Part Representations☆24Jan 20, 2025Updated last year
- M³: Dense Matching Meets Multi-View Foundation Models for Monocular Gaussian Splatting SLAM☆68Mar 18, 2026Updated last month
- [CVPR 2026] Offical implementation of the paper "HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Pre…☆83Mar 3, 2026Updated 2 months ago
- Animate Any Character in Any World☆97Mar 10, 2026Updated last month
- [ICML2026] From Statics to Dynamics: Physics-Aware Image Editing with Latent Transition Priors☆87Updated this week
- PISCO: Precise Video Instance Insertion with Sparse Control☆59Feb 13, 2026Updated 2 months ago
- Simple app for getting iPhone camera without controls to be streamed to OBS☆18May 6, 2022Updated 4 years ago
- Official repository for ICCV23 paper "Divide&Classify: Fine-Grained Classification for City-Wide Visual Place Recognition"☆24Nov 9, 2023Updated 2 years ago
- Official Codebase for our CVPR 2026 paper UniSH: Unifying Scene and Human Reconstruction in a Feed-Forward Pass☆145Feb 24, 2026Updated 2 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [CVPR 2025] ZeroMSF: Zero-shot Monocular Scene Flow Estimation in the Wild☆42Sep 16, 2025Updated 7 months ago
- [AAAI 26 Demo] Offical repo for CAT-V - Caption Anything in Video: Object-centric Dense Video Captioning with Spatiotemporal Multimodal P…☆65Jan 27, 2026Updated 3 months ago
- [Official Repo] SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing☆200Apr 13, 2026Updated 3 weeks ago
- ☆69Nov 9, 2025Updated 5 months ago
- [ICLR 26] 1K resolution vision transformers pretrained on 1B human images.☆539Apr 28, 2026Updated last week
- A framework for camera-controllable image editing using unified geometric guidance and video models.☆59Apr 28, 2026Updated last week
- A local-first, high-performance desktop asset manager for AI image generations. Features universal metadata parsing (ComfyUI/A1111), inst…☆79Mar 18, 2026Updated last month
- ☆11Feb 9, 2024Updated 2 years ago
- ☆103Mar 24, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆10Jun 26, 2022Updated 3 years ago
- [CVPR 2026 Oral] Proxy-GS: Unified Occlusion Priors for Training and Inference in Structured 3D Gaussian Splatting☆89Apr 9, 2026Updated 3 weeks ago
- [ICCV 2025] DONUT: A Decoder-Only Model for Trajectory Prediction☆46Mar 23, 2026Updated last month
- [SIGGRAGH'25] Official repository of LayerFlow: A Unified Model for Layer-aware Video Generation☆90Aug 18, 2025Updated 8 months ago
- [CVPR 2026] Official implementation of "MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second".☆436Mar 19, 2026Updated last month
- [AAAI 2026] UltraGen☆78Feb 1, 2026Updated 3 months ago
- Scene-Centric Unsupervised Panoptic Segmentation (CVPR 2025 Highlight)☆81Mar 27, 2026Updated last month
- A Benchmark and Evaluation Suite for Zero-shot Singing Voice Synthesis☆26Feb 11, 2026Updated 2 months ago
- ☆102Apr 15, 2026Updated 3 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆25Nov 13, 2025Updated 5 months ago
- Advanced drum machine for ComfyUI featuring a 64-step sequencer, custom sample support, and retro hardware aesthetics.☆20Jan 19, 2026Updated 3 months ago
- [CVPR 2026] FaceCam: Portrait Video Camera Control via Scale-Aware Conditioning☆52Mar 26, 2026Updated last month
- Real-Time Physical Action-Conditioned Video Generation☆196Mar 6, 2026Updated 2 months ago
- Official Pytorch Implementation for "Time-to-Move: Training-Free Motion Controlled Video Generation via Dual-Clock Denoising"☆355Mar 9, 2026Updated last month
- [ICLR 2026] Light-X: Generative 4D Video Rendering with Camera and Illumination Control☆180Dec 11, 2025Updated 4 months ago
- $ curl -sL sub.sh | bash☆17Aug 7, 2023Updated 2 years ago