MCG-NJU / SPLAM
[ECCV 2024 Oral] SPLAM: Accelerating Image Generation with Sub-path Linear Approximation Model
☆18Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for SPLAM
- Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)☆66Updated 4 months ago
- [ECCV 2024] ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video☆16Updated 3 months ago
- [ICCV 2023] MGMAE: Motion Guided Masking for Video Masked Autoencoding☆20Updated last year
- [T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection☆35Updated last year
- [NeurIPS 2022] Official implementation of the paper "Rethinking Resolution in the Context of Efficient Video Recognition".☆32Updated 2 years ago
- ☆57Updated last year
- Sora Generates Videos with Stunning Geometrical Consistency☆47Updated 8 months ago
- Official PyTorch implementation of GeoDiffusion in ICLR 2024 (https://arxiv.org/abs/2306.04607)☆65Updated 3 weeks ago
- ☆38Updated 11 months ago
- [ICCV-2023]-Universal Video Segmentaion For VSS, VPS and VIS☆110Updated 8 months ago
- IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model☆25Updated last month
- [CVPR 2024] SportsHHI: A Dataset for Human-Human Interaction Detection in Sports Videos☆11Updated 6 months ago
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆45Updated 4 months ago
- [AAAI 2024] Referred by Multi-Modality: A Unified Temporal Transformers for Video Object Segmentation☆69Updated 4 months ago
- [ECCV 2024] PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation☆18Updated 4 months ago
- Official repository of paper: "FeatAug-DETR: Enriching One-to-Many Matching for DETRs with Feature Augmentation"☆24Updated last year
- (ICLR 2024, CVPR 2024) SparseFormer☆63Updated 2 weeks ago
- PyTorch implementation of ARKitTrack for CVPR'2023 paper "ARKitTrack: A New Diverse Dataset for Tracking Using Mobile RGB-D Data", by Hao…☆44Updated last year
- Official Implementation for "Matching Is Not Enough: A Two-Stage Framework for Category-Agnostic Pose Estimation", CVPR 2023.☆49Updated last year
- Official implementation of "Can Language Understand Depth?"☆77Updated 2 years ago
- ☆58Updated last year
- ☆58Updated 4 months ago
- Official implementation for the CVPR 2024 paper CAMEL☆15Updated 5 months ago
- state-of-the-art open vocabulary detector on COCO/LVIS/V3Det☆25Updated 7 months ago
- [ECCV 2022] 🎵PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic Segmentation☆57Updated last year
- SimMAT: Exploring Transferability from Vision Foundation Models to Any Image Modality☆22Updated 2 months ago
- ☆39Updated 5 months ago
- The benchmark for "Video Object Segmentation in Panoptic Wild Scenes".☆12Updated last year
- [BMVC 2024] Official implementation of Align-DETR☆49Updated 4 months ago