☆24May 23, 2025Updated last year
Alternatives and similar repositories for Generalization_unified_VLM
Users that are interested in Generalization_unified_VLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Sep 22, 2025Updated 9 months ago
- [ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆191May 21, 2025Updated last year
- ☆15Apr 25, 2025Updated last year
- ☆16Sep 17, 2024Updated last year
- [ACM MM2025] The official repository for the RealSyn dataset☆39Dec 14, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆40May 9, 2026Updated last month
- Official repository for the paper ''ambigram generation by a diffusion model''.☆17Aug 9, 2023Updated 2 years ago
- Image Super-Resolution Using Very Deep Residual Channel Attention Networks☆15Nov 29, 2021Updated 4 years ago
- The code repository of UniRL☆52May 30, 2025Updated last year
- On Path to Multimodal Generalist: General-Level and General-Bench☆21Jul 11, 2025Updated 11 months ago
- ☆13Sep 29, 2024Updated last year
- HazeFlow: Revisit Haze Physical Model as ODE and Non-Homogeneous Haze Generation for Real-World Dehazing [ICCV 2025]☆32Feb 9, 2026Updated 4 months ago
- [CVPRW 2025] UniToken is an auto-regressive generation model that combines discrete and continuous representations to process visual inpu…☆105Apr 23, 2025Updated last year
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆32Oct 9, 2025Updated 8 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Adapting LLaMA Decoder to Vision Transformer☆30May 20, 2024Updated 2 years ago
- ☆64Apr 16, 2026Updated 2 months ago
- ☆45Jan 4, 2026Updated 5 months ago
- (ICCV 2025) "Principal Components" Enable A New Language of Images☆86Jun 4, 2026Updated 3 weeks ago
- [ICLR 2026] Official PyTorch implementation for "ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding"☆63Dec 26, 2025Updated 6 months ago
- ☆19Jun 29, 2025Updated last year
- ☆19Aug 7, 2025Updated 10 months ago
- A Public repository for the COMeT model☆13Jul 25, 2024Updated last year
- ☆15Jan 1, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [CVPR 2024] 3D Geometry-aware Deformable Gaussian Splatting for Dynamic View Synthesis.☆22Apr 23, 2025Updated last year
- Airlight estimation according to "Blind Dehazing Using Internal Patch Recurrence"☆16Oct 22, 2018Updated 7 years ago
- End2End Virtual Try-on with Visual Reference, CVPR2026☆68Apr 18, 2026Updated 2 months ago
- Official repository for "On the Multi-modal Vulnerability of Diffusion Models"☆17Jul 15, 2024Updated last year
- ☆27Apr 25, 2025Updated last year
- Draw ALL Your Imagine: A Holistic Benchmark and Agent Framework for Complex Instruction-based Image Generation☆23Sep 24, 2025Updated 9 months ago
- [ICLR 26] DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation☆41Aug 3, 2025Updated 10 months ago
- BetterNet is a state-of-the-art deep learning model for accurate and efficient polyp segmentation in medical images. It combines Efficien…☆14May 8, 2024Updated 2 years ago
- [NeurIPS 2025] E-MoFlow: Learning Egomotion and Optical Flow from Event Data via Implicit Regularization☆39Nov 3, 2025Updated 7 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Self-supervised Learning and Adaptation for Single Image Dehazing (IJCAI-ECAI 2022 long presentation)☆24Dec 13, 2022Updated 3 years ago
- Official implementation of “Response Attack: Exploiting Contextual Priming to Jailbreak Large Language Models” (AAAI 2026).☆37Mar 22, 2026Updated 3 months ago
- [NeurIPS 2024] Efficient Large Multi-modal Models via Visual Context Compression☆66Feb 19, 2025Updated last year
- ☆19Oct 20, 2024Updated last year
- Official codes for GRA (Accepted by ICCV2023)☆17Jul 18, 2023Updated 2 years ago
- Modality Gap Theory☆74May 16, 2026Updated last month
- [Under Review] Super4DR: 4D Radar-centric Self-supervised Odometry and Gaussian-based Map Optimization☆34Dec 11, 2025Updated 6 months ago