iva-mzsun / MOSOLinks

☆35

Alternatives and similar repositories for MOSO

Users that are interested in MOSO are comparing it to the libraries listed below

Sorting:

nku-zhichengzhang / ExtDM
[CVPR 2024] This is the official implementation of "ExtDM: Distribution Extrapolation Diffusion Model for Video Prediction"
☆54Updated 5 months ago
Kay1794 / MMVP-motion-matrix-based-video-prediction
This is the official repo of MMVP: motion-matrix-based video prediction (ICCV 2023)
☆42Updated 2 years ago
exisas / LGC-VD
☆38Updated last year
MCG-NJU / TemporalPerceiver
[T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection
☆37Updated 2 years ago
buxiangzhiren / VD-IT
Code for the paper "Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation", ECCV 2024
☆45Updated last year
XiYe20 / NPVP
[CVPRW'23] "A unified model for continuous conditional video prediction". Xi Ye, Guillaume-Alexandre Bilodeau.
☆14Updated last year
FuchenUSTC / DTF
☆16Updated 3 years ago
tsujuifu / pytorch_tvc
A PyTorch implementation of TVC
☆24Updated 2 years ago
Adonis-galaxy / DepthCLIP
Official implementation of "Can Language Understand Depth?"
☆83Updated 3 years ago
guikunchen / FEC
[CVPR'24] Neural Clustering based Visual Representation Learning
☆45Updated 2 months ago
eccv24 / paper-template
ECCV 2024 paper template
☆56Updated last year
pixeli99 / TrackDiffusion
[WACV2025] Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)
☆79Updated last year
hotfinda / VideoMambaPro
Improving Mamaba performance on Video Understanding task
☆39Updated last year
NiFangBaAGe / DATTT
[CVPR 2024] Depth-aware Test-Time Training for Zero-shot Video Object Segmentation
☆28Updated 7 months ago
yuzhms / Streaming-Video-Model
[CVPR2023] Code for "Streaming Video Model"
☆79Updated 2 years ago
shvdiwnkozbw / SSL-UVOS
[ECCV 2024] Code for Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation
☆34Updated 9 months ago
renwang435 / video-ttt-release
Test-Time Training on Video Streams
☆65Updated 2 years ago
neu-vi / SportsSloMo
SportsSloMo: A New Benchmark and Baseline Models for Human-centric Video Frame Interpolation, CVPR 2024 (https://arxiv.org/abs/2308.16876…
☆76Updated last year
showlab / sparseformer
(ICLR 2024, CVPR 2024) SparseFormer
☆75Updated last year
OpenGVLab / MUTR
「AAAI 2024」 Referred by Multi-Modality: A Unified Temporal Transformers for Video Object Segmentation
☆82Updated 6 months ago
mt-cly / SimCMF
SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality
☆35Updated last year
MCG-NJU / ViT-TAD
[CVPR 2024] Adapting Short-Term Transformers for Action Detection in Untrimmed Videos
☆12Updated last year
MKFMIKU / vidm
[AAAI23 Oral] Official implementations of Video Implicit Diffusion Models
☆68Updated 2 years ago
park-jungin / DualPath
☆49Updated 3 years ago
Ingrid725 / LaPE
☆19Updated last year
x360dataset / x360dataset-kit
☆32Updated 5 months ago
aim-uofa / GenDeF
☆39Updated 2 years ago
jbistanbul / MiniROAD
☆40Updated last year
Yui010206 / VEGGIE-VidEdit
[ICCV2025] VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation
☆28Updated 4 months ago
flyinglynx / CapeFormer
Official Implementation for "Matching Is Not Enough: A Two-Stage Framework for Category-Agnostic Pose Estimation", CVPR 2023.
☆54Updated 2 years ago