pixeli99 / TrackDiffusion
Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)
☆76Updated 7 months ago
Alternatives and similar repositories for TrackDiffusion:
Users that are interested in TrackDiffusion are comparing it to the libraries listed below
- Official PyTorch implementation of GeoDiffusion in ICLR 2024 (https://arxiv.org/abs/2306.04607)☆74Updated last month
- DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention☆118Updated 2 months ago
- ReNeg: Learning Negative Embedding with Reward Guidance☆27Updated last month
- official repository of CVPR 2024 paper, RMem: Restricted Memory Banks Improve Video Object Segmentation☆39Updated 2 weeks ago
- Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"☆44Updated 3 weeks ago
- Liquid: Language Models are Scalable Multi-modal Generators☆63Updated 2 months ago
- ☆58Updated last year
- ☆15Updated 3 weeks ago
- ☆38Updated last year
- ☆37Updated 4 months ago
- [ECCV 2024 Oral] SPLAM: Accelerating Image Generation with Sub-path Linear Approximation Model☆20Updated 3 months ago
- Open implementation of "RandAR"☆53Updated last month
- [ECCV 2024] Code for Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation☆33Updated 2 months ago
- [ICLR 2025] ControlAR: Controllable Image Generation with Autoregressive Models☆188Updated 3 weeks ago
- Sora Generates Videos with Stunning Geometrical Consistency☆47Updated 10 months ago
- [AAAI 2025] Linear-complexity Visual Sequence Learning with Gated Linear Attention☆106Updated 7 months ago
- [CVPR 2024] Official implementation of "Universal Segmentation at Arbitrary Granularity with Language Instruction"☆81Updated 11 months ago
- ☆34Updated 4 months ago
- ICCV'2023 | CTVIS: Consistent Training for Online Video Instance Segmentation☆73Updated last year
- [ICCV-2023]-Universal Video Segmentaion For VSS, VPS and VIS☆111Updated 10 months ago
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆77Updated last week
- 「ECCV 2024」 PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation☆20Updated 7 months ago
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆46Updated 6 months ago
- [NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator☆94Updated 10 months ago
- 「AAAI 2024」 Referred by Multi-Modality: A Unified Temporal Transformers for Video Object Segmentation☆74Updated 7 months ago
- state-of-the-art open vocabulary detector on COCO/LVIS/V3Det☆29Updated 9 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 7 months ago
- [arXiv'25] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models☆232Updated last month
- [Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controller☆33Updated this week