pixeli99 / TrackDiffusion
[WACV2025] Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)
☆77Updated 10 months ago
Alternatives and similar repositories for TrackDiffusion:
Users that are interested in TrackDiffusion are comparing it to the libraries listed below
- Official PyTorch implementation of GeoDiffusion in ICLR 2024 (https://arxiv.org/abs/2306.04607)☆85Updated 3 months ago
- ReNeg: Learning Negative Embedding with Reward Guidance☆31Updated 4 months ago
- [ECCV 2024 Oral] SPLAM: Accelerating Image Generation with Sub-path Linear Approximation Model☆20Updated 6 months ago
- [CVPR 2025] DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention☆163Updated 2 months ago
- ☆39Updated last year
- ☆57Updated last month
- ☆41Updated 7 months ago
- Official PyTorch implementation of paper “InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction”☆15Updated this week
- [TCSVT] state-of-the-art open vocabulary detector on COCO/LVIS/V3Det☆29Updated last year
- ☆117Updated 10 months ago
- Sora Generates Videos with Stunning Geometrical Consistency☆49Updated last year
- ICCV'2023 | CTVIS: Consistent Training for Online Video Instance Segmentation☆76Updated last year
- [Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controller☆40Updated 3 weeks ago
- This is the official implementation of VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Mode…☆13Updated 2 months ago
- ☆33Updated 6 months ago
- Curated list of recent visual autoregressive (VAR) modeling works☆30Updated last month
- PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆76Updated 2 weeks ago
- This repository is dedicated to Track 2 of the W-CODA 2024 Workshop, "Multimodal Perception and Comprehension of Corner Cases in Autonomo…☆10Updated 10 months ago
- ☆58Updated last year
- [ICCV-2023]-Universal Video Segmentaion For VSS, VPS and VIS☆110Updated last year
- (ICLR 2024, CVPR 2024) SparseFormer☆74Updated 5 months ago
- Official GitHub repository for the Text-Guided Video Editing (TGVE) competition of LOVEU Workshop @ CVPR'23.☆75Updated last year
- [NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator☆96Updated last year
- Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆77Updated 3 weeks ago
- [CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"☆86Updated 2 months ago
- Official implementation of "Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness".☆20Updated last month
- Official implementation of "STAR: Scale-wise Text-to-image generation via Auto-Regressive representations"☆32Updated last month
- [ICLR'25] Reconstructive Visual Instruction Tuning☆81Updated 3 weeks ago
- [IEEE TCSVT] Official Pytorch Implementation of CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation.☆42Updated 3 months ago
- ☆26Updated last month