[CVPR 2026] Official code and models for Video Encoder-only Mask Transformer (VidEoMT).
☆238Jun 8, 2026Updated last week
Alternatives and similar repositories for videomt
Users that are interested in videomt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2026 Oral] "MARCO: Navigating the Unseen Space of Semantic Correspondence"☆139Apr 21, 2026Updated last month
- ☆46May 10, 2025Updated last year
- [CVPR 2026] Scaling Zero-Shot Reference-to-Video Generation☆75Apr 28, 2026Updated last month
- ☆25Sep 8, 2025Updated 9 months ago
- ☆202Mar 11, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- M³: Dense Matching Meets Multi-View Foundation Models for Monocular Gaussian Splatting SLAM☆76Mar 18, 2026Updated 3 months ago
- Code for paper "CLiFT: Compressive Light-Field Tokens for Compute Efficient and Adaptive Neural Rendering" [NeurIPS 2025 (spotlight)]☆77Aug 2, 2025Updated 10 months ago
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"☆75Feb 26, 2026Updated 3 months ago
- [CVPR 2026] Offical implementation of the paper "HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Pre…☆95Jun 7, 2026Updated last week
- [arXiv 2512.17796] Animate Any Character in Any World☆96Mar 10, 2026Updated 3 months ago
- OmniShotCut is a sensitive and more informative SoTA on Shot Boundary Detection task.☆225Jun 1, 2026Updated 2 weeks ago
- [ICML2026] From Statics to Dynamics: Physics-Aware Image Editing with Latent Transition Priors☆91Apr 30, 2026Updated last month
- Official repository for ICCV23 paper "Divide&Classify: Fine-Grained Classification for City-Wide Visual Place Recognition"☆24Nov 9, 2023Updated 2 years ago
- Real-Time Physical Action-Conditioned Video Generation☆208Mar 6, 2026Updated 3 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆165Jun 8, 2026Updated last week
- Data release for Step Differences in Instructional Video (CVPR24)☆14Jun 19, 2024Updated last year
- 1K resolution vision transformers pretrained on 1B human images.☆796May 24, 2026Updated 3 weeks ago
- PISCO: Precise Video Instance Insertion with Sparse Control☆62Feb 13, 2026Updated 4 months ago
- Reflection Removal through Efficient Adaptation of Diffusion Transformers☆128Apr 21, 2026Updated last month
- [CVPR 2026] Official implementation of "MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second".☆454Mar 19, 2026Updated 2 months ago
- Official Codebase for our CVPR 2026 paper UniSH: Unifying Scene and Human Reconstruction in a Feed-Forward Pass☆146Feb 24, 2026Updated 3 months ago
- [Official Repo] SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing☆211Apr 13, 2026Updated 2 months ago
- [ICCV 2025] Official implementation of the paper "DreamCube: 3D Panorama Generation via Multi-plane Synchronization".☆178Feb 4, 2026Updated 4 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆73Nov 9, 2025Updated 7 months ago
- A framework for camera-controllable image editing using unified geometric guidance and video models.☆66Apr 28, 2026Updated last month
- Efficient data storage format optimized for random-access reads, especially in machine learning workflows☆24Feb 25, 2026Updated 3 months ago
- [AAAI 2026] SparseWorld: A Flexible, Adaptive, and Efficient 4D Occupancy World Model Powered by Sparse and Dynamic Queries☆57Jan 14, 2026Updated 5 months ago
- ☆37Jul 18, 2025Updated 11 months ago
- ☆11Feb 9, 2024Updated 2 years ago
- One-shot and Few-shot 3D Editing without Per-Scene Optimization☆174Aug 21, 2025Updated 9 months ago
- ☆17Dec 9, 2024Updated last year
- ComfyUI node for modular, human‑like Kani TTS. Generate natural, high‑quality speech from text☆38Oct 17, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Physically-Plausible Image Signal Processing (PPISP) for Radiance Field Reconstruction☆409Jun 11, 2026Updated last week
- ☆112Updated this week
- ☆10Jun 26, 2022Updated 3 years ago
- Official Implementation of DRA-Ctrl (Dimension-Reduction Attack! Video Generative Models are Experts on Controllable Image Synthesis)☆119Aug 15, 2025Updated 10 months ago
- [ICCV 2025] DONUT: A Decoder-Only Model for Trajectory Prediction☆49Mar 23, 2026Updated 2 months ago
- [SIGGRAGH'25] Official repository of LayerFlow: A Unified Model for Layer-aware Video Generation☆94Aug 18, 2025Updated 10 months ago
- A box containing all necessary components to play with ultrasound.☆10Nov 8, 2020Updated 5 years ago