[CVPR 2026] Official code and models for Video Encoder-only Mask Transformer (VidEoMT).
☆210May 13, 2026Updated last week
Alternatives and similar repositories for videomt
Users that are interested in videomt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2026] Scaling Zero-Shot Reference-to-Video Generation☆72Apr 28, 2026Updated 3 weeks ago
- ☆25Sep 8, 2025Updated 8 months ago
- ☆200Mar 11, 2026Updated 2 months ago
- M³: Dense Matching Meets Multi-View Foundation Models for Monocular Gaussian Splatting SLAM☆71Mar 18, 2026Updated 2 months ago
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"☆73Feb 26, 2026Updated 3 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [CVPR 2026] Offical implementation of the paper "HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Pre…☆88May 11, 2026Updated 2 weeks ago
- OmniShotCut is a sensitive and more informative SoTA on Shot Boundary Detection task.☆198May 4, 2026Updated 3 weeks ago
- [arXiv 2025.12] Animate Any Character in Any World☆97Mar 10, 2026Updated 2 months ago
- [ICML2026] From Statics to Dynamics: Physics-Aware Image Editing with Latent Transition Priors☆92Apr 30, 2026Updated 3 weeks ago
- PISCO: Precise Video Instance Insertion with Sparse Control☆62Feb 13, 2026Updated 3 months ago
- Official repository for ICCV23 paper "Divide&Classify: Fine-Grained Classification for City-Wide Visual Place Recognition"☆24Nov 9, 2023Updated 2 years ago
- ☆151Mar 7, 2026Updated 2 months ago
- 1K resolution vision transformers pretrained on 1B human images.☆717May 15, 2026Updated last week
- Code for paper "CLiFT: Compressive Light-Field Tokens for Compute Efficient and Adaptive Neural Rendering" [NeurIPS 2025 (spotlight)]☆76Aug 2, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆14Mar 11, 2022Updated 4 years ago
- [CVPR 2024] PEM: Prototype-based Efficient MaskFormer for Image Segmentation☆129Mar 10, 2025Updated last year
- Reflection Removal through Efficient Adaptation of Diffusion Transformers☆126Apr 21, 2026Updated last month
- Official Codebase for our CVPR 2026 paper UniSH: Unifying Scene and Human Reconstruction in a Feed-Forward Pass☆145Feb 24, 2026Updated 3 months ago
- [CVPR 2025] ZeroMSF: Zero-shot Monocular Scene Flow Estimation in the Wild☆42Sep 16, 2025Updated 8 months ago
- [AAAI 26 Demo] Offical repo for CAT-V - Caption Anything in Video: Object-centric Dense Video Captioning with Spatiotemporal Multimodal P…☆66Jan 27, 2026Updated 3 months ago
- ☆70Nov 9, 2025Updated 6 months ago
- A framework for camera-controllable image editing using unified geometric guidance and video models.☆65Apr 28, 2026Updated 3 weeks ago
- A local-first, high-performance desktop asset manager for AI image generations. Features universal metadata parsing (ComfyUI/A1111), inst…☆80Mar 18, 2026Updated 2 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Efficient data storage format optimized for random-access reads, especially in machine learning workflows☆24Feb 25, 2026Updated 3 months ago
- [AAAI 2026] SparseWorld: A Flexible, Adaptive, and Efficient 4D Occupancy World Model Powered by Sparse and Dynamic Queries☆50Jan 14, 2026Updated 4 months ago
- [GCPR 2023] UGainS: Uncertainty Guided Anomaly Instance Segmentation☆16Jul 31, 2024Updated last year
- ☆11Feb 9, 2024Updated 2 years ago
- Physically-Plausible Image Signal Processing (PPISP) for Radiance Field Reconstruction☆395Apr 7, 2026Updated last month
- ComfyUI node for modular, human‑like Kani TTS. Generate natural, high‑quality speech from text☆38Oct 17, 2025Updated 7 months ago
- [CVPR 2026 Oral, Award Candidate] Proxy-GS: Unified Occlusion Priors for Training and Inference in Structured 3D Gaussian Splatting☆105Updated this week
- [SIGGRAGH'25] Official repository of LayerFlow: A Unified Model for Layer-aware Video Generation☆93Aug 18, 2025Updated 9 months ago
- [CVPR 2026] Official implementation of "MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second".☆447Mar 19, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A box containing all necessary components to play with ultrasound.☆10Nov 8, 2020Updated 5 years ago
- [AAAI 2026] UltraGen☆78Feb 1, 2026Updated 3 months ago
- Style transfer using WCT transforms☆19Nov 5, 2019Updated 6 years ago
- A Benchmark and Evaluation Suite for Zero-shot Singing Voice Synthesis☆27Feb 11, 2026Updated 3 months ago
- 🐼 open source agent engineering platform: traces, evals, and metrics to debug and improve your AI agents. Integrates with LangGraph, Cre…☆152Updated this week
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆25Nov 13, 2025Updated 6 months ago
- ☆11Jun 17, 2025Updated 11 months ago