zeyuanyin / SATA_forkLinks
This is a fork of SATA repo (CVPR 2025), which is invisiable.
☆23Updated 6 months ago
Alternatives and similar repositories for SATA_fork
Users that are interested in SATA_fork are comparing it to the libraries listed below
Sorting:
- ☆257Updated 2 years ago
- Official PyTorch Code for Anchor Token Guided Prompt Learning Methods: [ICCV 2025] ATPrompt and [Arxiv 2511.21188] AnchorOPT☆121Updated last month
- [CVPR 2024] This is the official implementation of "ExtDM: Distribution Extrapolation Diffusion Model for Video Prediction"☆55Updated 7 months ago
- Official repository of MLLA (NeurIPS 2024)☆370Updated 6 months ago
- The official implementation of [CVPR 2025] "5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks".☆391Updated 7 months ago
- [CVPR 2024] LAKE-RED: Camouflaged Images Generation by Latent Background Knowledge Retrieval-Augmented Diffusion.☆51Updated last year
- [AAAI 2025] Linear-complexity Visual Sequence Learning with Gated Linear Attention☆115Updated last year
- Unified the Anonymous and Camera Ready Version, hope everyone can get an ACCEPT☆251Updated 6 months ago
- [AAAI 2025] Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking☆115Updated 8 months ago
- [NeurIPS2024 Spotlight] The official implementation of MambaTree: Tree Topology is All You Need in State Space Model☆105Updated last year
- [CVPR 2024] iKUN: Speak to Trackers without Retraining☆143Updated last year
- Official implementation of CVPR 2024 paper "Multi-criteria Token Fusion with One-step-ahead Attention for Efficient Vision Transformers".☆40Updated 6 months ago
- A PyTorch implementation of the paper "ZigMa: A DiT-Style Mamba-based Diffusion Model" (ECCV 2024)☆344Updated 10 months ago
- Code of paper 'Stochastic Layer-Wise Shuffle for Improving Vision Mamba Training'☆21Updated 7 months ago
- Official code for "DiffX: Guide Your Layout to Cross-Modal Generative Modeling"☆23Updated 11 months ago
- Implements VAR+CLIP for text-to-image (T2I) generation☆147Updated last year
- Code for the paper "Conditional Representation Learning for Customized Tasks" (NeurIPS 2025 Spotlight)☆42Updated 3 months ago
- Official implementation for "Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter"☆53Updated 4 months ago
- ☆83Updated 9 months ago
- The official implementation of DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis☆229Updated last year
- DIPO: Dual-State Images Controlled Articulated Object Generation Powered by Diverse Data☆38Updated last month
- (ICCV 2025)This repository is the official implementation of AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detect…☆158Updated 6 months ago
- ☆694Updated 2 months ago
- Official repository of FLatten Transformer (ICCV2023)☆446Updated last year
- A Comprehensive Survey on Knowledge Distillation☆60Updated last month
- ☆138Updated last year
- 📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.☆347Updated 3 weeks ago
- Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"☆88Updated 7 months ago
- vHeat: Building Vision Models upon Heat Conduction☆269Updated 7 months ago
- A Fine-grained Benchmark for Video Captioning and Retrieval☆25Updated 6 months ago