AnonymousDUTAI / SREKCARC-IA-TUDLinks
☆20Updated 11 months ago
Alternatives and similar repositories for SREKCARC-IA-TUD
Users that are interested in SREKCARC-IA-TUD are comparing it to the libraries listed below
Sorting:
- A collection of vision foundation models unifying understanding and generation.☆57Updated 8 months ago
- A vue-based project page template for academic papers. (in development) https://junyaohu.github.io/academic-project-page-template-vue☆286Updated last month
- A tiny paper rating web☆39Updated 5 months ago
- The official code for the paper: LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs☆102Updated 2 months ago
- ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional Dependencies☆17Updated 2 months ago
- A framework for unified personalized model, achieving mutual enhancement between personalized understanding and generation. Demonstrating…☆121Updated 3 weeks ago
- ☆59Updated last month
- Fundamentals of Digital Media Technology(04713901) | Peking University ECE Course Materials☆18Updated 3 years ago
- 📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.☆279Updated 3 weeks ago
- ☆31Updated 2 months ago
- Official implementation of MC-LLaVA.☆139Updated 2 weeks ago
- Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing☆64Updated last month
- Collection of Highlight papers☆41Updated last year
- Official Code for PosterGen☆71Updated this week
- [TMLR 2025] Efficient Diffusion Models: A Survey☆100Updated 2 months ago
- ☆48Updated last week
- Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing☆87Updated last week
- Official Repo of Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration☆77Updated 3 months ago
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆117Updated 10 months ago
- Survey: https://arxiv.org/pdf/2507.20198☆121Updated last week
- UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation☆43Updated last week
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆144Updated 2 weeks ago
- TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation☆190Updated 2 weeks ago
- [ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆158Updated 3 months ago
- ☆49Updated 2 weeks ago
- Official implementation of Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning☆91Updated this week
- PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆123Updated 4 months ago
- VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆52Updated last month
- Official Release of ACM TOG 2025 paper -- GS-ROR☆23Updated 3 weeks ago
- SpaceR: The first MLLM empowered by SG-RLVR for video spatial reasoning☆76Updated last month