pixeli99 / TrackDiffusionLinks
[WACV2025] Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)
☆80Updated last year
Alternatives and similar repositories for TrackDiffusion
Users that are interested in TrackDiffusion are comparing it to the libraries listed below
Sorting:
- Official PyTorch implementation of GeoDiffusion in ICLR 2024 (https://arxiv.org/abs/2306.04607)☆90Updated 6 months ago
- [ICCV 2025] GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding☆65Updated 3 weeks ago
- ReNeg: Learning Negative Embedding with Reward Guidance☆33Updated 6 months ago
- [TCSVT] state-of-the-art open vocabulary detector on COCO/LVIS/V3Det☆30Updated last month
- [CVPR 2025] DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention☆168Updated 4 months ago
- ☆44Updated 9 months ago
- Official implementation of "Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness".☆44Updated 3 weeks ago
- GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning☆89Updated last month
- A list of works on video generation towards world model☆157Updated this week
- Curated list of recent visual autoregressive (VAR) modeling works☆29Updated 4 months ago
- Code and dataset link for "DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World"☆68Updated 2 weeks ago
- ☆135Updated 2 weeks ago
- [ECCV 2024 Oral] SPLAM: Accelerating Image Generation with Sub-path Linear Approximation Model☆21Updated 8 months ago
- ☆39Updated last year
- Fast and general video object segmentation evaluation.☆32Updated last year
- [CVPR 25] A framework named B^2-DiffuRL for RL-based diffusion model fine-tuning.☆32Updated 3 months ago
- ☆58Updated last year
- [CVPR2025] Official PyTorch implementation of "Optical-Flow Guided Prompt Optimization for Coherent Video Generation (Motion Prompt)"☆21Updated 4 months ago
- [ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆141Updated last month
- ☆120Updated last year
- [ICCV-2023]-Universal Video Segmentaion For VSS, VPS and VIS☆110Updated last year
- [ICLR'25] Reconstructive Visual Instruction Tuning☆97Updated 3 months ago
- ☆37Updated last month
- ☆29Updated last week
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆139Updated last week
- Official PyTorch implementation of paper “InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction”☆17Updated 2 months ago
- ☆33Updated last week
- [NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator☆94Updated last year
- [arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation☆76Updated 4 months ago
- [CVPR 2025 (Oral)] Open implementation of "RandAR"☆178Updated this week