pixeli99 / TrackDiffusionLinks
[WACV2025] Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)
☆80Updated last year
Alternatives and similar repositories for TrackDiffusion
Users that are interested in TrackDiffusion are comparing it to the libraries listed below
Sorting:
- Official PyTorch implementation of GeoDiffusion in ICLR 2024 (https://arxiv.org/abs/2306.04607)☆90Updated 3 weeks ago
- [ICCV 2025] GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding☆66Updated last month
- ReNeg: Learning Negative Embedding with Reward Guidance☆33Updated 7 months ago
- Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"☆113Updated 2 weeks ago
- [CVPR 2025] DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention☆169Updated 5 months ago
- [TCSVT] state-of-the-art open vocabulary detector on COCO/LVIS/V3Det☆30Updated 2 months ago
- ☆39Updated last year
- ☆44Updated 10 months ago
- A list of works on video generation towards world model☆161Updated last week
- [ICCV-2023]-Universal Video Segmentaion For VSS, VPS and VIS☆110Updated last year
- Official PyTorch implementation of paper “InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction”☆20Updated last week
- ☆83Updated last year
- Code and dataset link for "DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World"☆96Updated last month
- [ECCV 2024 Oral] SPLAM: Accelerating Image Generation with Sub-path Linear Approximation Model☆21Updated 9 months ago
- [ECCV 2024] Code for Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation☆35Updated 5 months ago
- [CVPR'25 - Rating 555] Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text☆52Updated 4 months ago
- [CVPR2025] Official PyTorch implementation of "Optical-Flow Guided Prompt Optimization for Coherent Video Generation (Motion Prompt)"☆22Updated 4 months ago
- ☆124Updated last year
- Fast and general video object segmentation evaluation.☆33Updated last year
- ☆188Updated 2 months ago
- Curated list of recent visual autoregressive (VAR) modeling works☆29Updated 4 months ago
- [arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation☆76Updated 5 months ago
- PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆121Updated 3 months ago
- Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization☆21Updated 3 months ago
- GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning☆94Updated 2 months ago
- [CVPR 25] A framework named B^2-DiffuRL for RL-based diffusion model fine-tuning.☆34Updated 4 months ago
- [ICLR'25] Reconstructive Visual Instruction Tuning☆101Updated 4 months ago
- [ICCV 2023] CTVIS: Consistent Training for Online Video Instance Segmentation☆78Updated last year
- DVIS: Decoupled Video Instance Segmentation Framework☆152Updated last year
- ☆58Updated 2 years ago