arctanxarc / UniCTokensLinks
A framework for unified personalized model, achieving mutual enhancement between personalized understanding and generation. Demonstrating the potential of cross-task information transfer in personalized scenario, paving the way for the development of general unified models.
☆121Updated last week
Alternatives and similar repositories for UniCTokens
Users that are interested in UniCTokens are comparing it to the libraries listed below
Sorting:
- Official implementation of MC-LLaVA.☆140Updated last month
- 📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.☆307Updated last week
- [NIPS 2025 DB Oral] Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing☆99Updated 3 weeks ago
- 🔥CVPR 2025 Multimodal Large Language Models Paper List☆155Updated 6 months ago
- TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation☆218Updated last month
- Official repository of 'ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing’☆57Updated 3 months ago
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆152Updated last week
- ☆122Updated 6 months ago
- A collection of vision foundation models unifying understanding and generation.☆55Updated 9 months ago
- This repository is the official implementation of "Look-Back: Implicit Visual Re-focusing in MLLM Reasoning".☆57Updated 3 months ago
- Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing