Devin100086 / 2025-ZJUSE-SummerCamp-AI-Camp
本项目主要是2025届浙江大学软件学院夏令营(AI营)的考核项目
☆10Updated last month
Alternatives and similar repositories for 2025-ZJUSE-SummerCamp-AI-Camp:
Users that are interested in 2025-ZJUSE-SummerCamp-AI-Camp are comparing it to the libraries listed below
- A collection of vision foundation models unifying understanding and generation.☆51Updated 3 months ago
- This is a repository to collect training-free algorithms for visual generation and manipulation☆32Updated this week
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆80Updated 2 weeks ago
- [CVPR 2025 (Oral)] Open implementation of "RandAR"☆118Updated last month
- Official implementation for P2SAM (ACM MM 2024)☆9Updated 4 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.☆173Updated this week
- ☆80Updated last month
- Official implementation of NeurIPS'24 paper Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features☆25Updated last month
- a brief repo about paper research☆15Updated 7 months ago
- A Collection of AIGC Research Groups☆73Updated last month
- This is a repo to track the latest autoregressive visual generation papers.☆280Updated this week
- Empowering Unified MLLM with Multi-granular Visual Generation☆119Updated 3 months ago
- [CVPR2025] Precise, Fast, and Low-cost Concept Erasure in Value Space: Orthogonal Complement Matters☆27Updated last month
- ☆113Updated 2 months ago
- ☆20Updated last month
- ImageGen-CoT: Enhancing Text-to-Image In-context Learning with Chain-of-Thought Reasoning☆28Updated 3 weeks ago
- [NeurIPS 2024] The official implement of research paper "FreeLong : Training-Free Long Video Generation with SpectralBlend Temporal Atten…☆42Updated 2 months ago
- Exposing Text-Image Inconsistency Using Diffusion Models (ICLR 2024)☆10Updated 10 months ago
- This is the official implementation for ControlVAR.☆103Updated 4 months ago
- Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆73Updated 2 weeks ago
- [ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'☆154Updated last week
- [CVPR 2025] RAP: Retrieval-Augmented Personalization☆45Updated 3 weeks ago
- Official code for "DiffX: Guide Your Layout to Cross-Modal Generative Modeling"☆20Updated 2 months ago
- a collection of awesome autoregressive visual generation models☆72Updated last week
- [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆316Updated last month
- Implements VAR+CLIP for text-to-image (T2I) generation☆135Updated 3 months ago
- SAVEn-Vid: Synergistic Audio-Visual Integration for Enhanced Understanding in Long Video Context☆5Updated 4 months ago
- Diffusion-TTA improves pre-trained discriminative models such as image classifiers or segmentors using pre-trained generative models.☆72Updated last year
- ☆20Updated 3 months ago
- [ICLR 2025] SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image and Video Generation☆36Updated 3 months ago