shxie2020 / Awesome-UGVFMView external linksLinks
A collection of vision foundation models unifying understanding and generation.
☆59Jan 2, 2025Updated last year
Alternatives and similar repositories for Awesome-UGVFM
Users that are interested in Awesome-UGVFM are comparing it to the libraries listed below
Sorting:
- Awesome autoregressive vision foundation models☆25Dec 24, 2024Updated last year
- [ICLR 2023] ReScore: Boosting Causal Discovery via Adaptive Sample Reweighting☆11Mar 11, 2023Updated 2 years ago
- A simple Computer Vision Framework, mainly based on PyTorch. Including distributed training, logging and so on.☆12Dec 2, 2023Updated 2 years ago
- [ICML'25] "Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding" by Jiajun Zhu, Peihao Wang, Ruisi…☆14Jun 6, 2025Updated 8 months ago
- ☆14Oct 10, 2022Updated 3 years ago
- Descrição diário da toda minha trajetória de estudos☆15Jan 30, 2025Updated last year
- Code for paper: Freeplane: Unlocking Free Lunch in Triplane-Based Sparse-View Reconstruction Models☆18Jun 6, 2024Updated last year
- [ICLR 2024] This is the official implementation of our paper "Semantic Flow: Learning Semantic Fields of Dynamic Scenes from Monocular Vi…☆13Sep 28, 2024Updated last year
- ☆16Apr 4, 2025Updated 10 months ago
- ☆41Jan 4, 2026Updated last month
- ☆19May 19, 2024Updated last year
- SpaceR: The first MLLM empowered by SG-RLVR for video spatial reasoning☆104Jul 9, 2025Updated 7 months ago
- [Survey] Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey☆477Jan 17, 2025Updated last year
- Co-Reinforcement Learning for Unified Multimodal Understanding and Generation☆39Jul 22, 2025Updated 6 months ago
- Awesome Unified Multimodal Models☆1,108Feb 6, 2026Updated last week
- [TMLR] Public code repo for paper "A Single Transformer for Scalable Vision-Language Modeling"☆147Nov 14, 2024Updated last year
- Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"☆172Dec 17, 2025Updated 2 months ago
- Replication in Visual Diffusion Models: A Survey and Outlook☆31Aug 2, 2024Updated last year
- [KDD 2023 (Oral)] Discovering Dynamic Causal Space for DAG Structure Learning☆24Jun 15, 2023Updated 2 years ago
- ☆26Mar 20, 2023Updated 2 years ago
- Quantum zero-day exploit Hunting for vulnerabilities as small as a quantum particle☆13Jun 13, 2025Updated 8 months ago
- UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation☆46Aug 26, 2025Updated 5 months ago
- [CVPR 2024] Sherpa3D: Boosting High-Fidelity Text-to-3D Generation via Coarse 3D Prior☆179May 22, 2024Updated last year
- ☆32Jul 29, 2025Updated 6 months ago
- ☆179Feb 21, 2025Updated 11 months ago
- Let's finetune video generation models!☆539Sep 15, 2025Updated 5 months ago
- ☆19Dec 9, 2024Updated last year
- Cẩm nang Lập trình Thi đấu☆14Jan 22, 2026Updated 3 weeks ago
- Lightweight CLI note tracker in Golang with SQLite. A changelog for your mind all within the terminal!☆11Jan 16, 2025Updated last year
- Resources, notes, and projects from Google's 5-Day Generative AI Intensive Course☆12Nov 29, 2024Updated last year
- Chatbot de WhatsApp para fluxo de conversa☆12Dec 29, 2024Updated last year
- Official Code for: "DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency"☆43Dec 26, 2025Updated last month
- ☆11Nov 23, 2024Updated last year
- ☆51Aug 22, 2025Updated 5 months ago
- Cheapo is a fun and interactive economy bot for Discord servers. Earn coins, climb the leaderboard, and unlock awesome rewards.☆16Aug 27, 2025Updated 5 months ago
- ☆16Dec 25, 2024Updated last year
- 记录推荐系统相关的面试题、优化经验☆36Jun 2, 2025Updated 8 months ago
- [NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding☆508Nov 14, 2025Updated 3 months ago
- "Comp4D: Compositional 4D Scene Generation", Dejia Xu*, Hanwen Liang*, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Platanioti…☆78Aug 25, 2024Updated last year