InternVL-U is a 4B-parameter unified multimodal model (UMM) that brings multimodal understanding, reasoning, image generation, image editing into a single framework.
☆257Mar 21, 2026Updated 2 weeks ago
Alternatives and similar repositories for InternVL-U
Users that are interested in InternVL-U are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official repo of CrossEarth-SAR, a sar-centric and billion-scale geospatial foundation model for cross-domain semantic segmentation☆36Mar 18, 2026Updated 3 weeks ago
- The first unified, efficient, and extensible evaluation toolkit for evaluating image generation and editing models across multiple benchm…☆41Updated this week
- ☆16Dec 25, 2025Updated 3 months ago
- Official code implementation of "OmniPSD: Layered PSD Generation with Diffusion Transformer"☆95Dec 13, 2025Updated 3 months ago
- (CVPR 2026) Official repository for Scone (Subject-driven COmposition and DistinctioN Enhancement) model, supporting subject composition …☆28Jan 14, 2026Updated 2 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆52Feb 9, 2026Updated 2 months ago
- [CVPR'26] AdapTok: Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space☆26Mar 15, 2026Updated 3 weeks ago
- Millions-Level Face/Human-Scene Image-Text Datasets☆24Jun 9, 2025Updated 10 months ago
- Evaluation codes and data for GenEval2☆61Jan 8, 2026Updated 3 months ago
- ☆43Sep 1, 2025Updated 7 months ago
- From Word to World: Can Large Language Models be Implicit Text-based World Models?☆55Dec 25, 2025Updated 3 months ago
- ☆10Jun 12, 2021Updated 4 years ago
- Datasets and Code for: https://arxiv.org/pdf/2305.14914.pdf☆35Aug 12, 2024Updated last year
- ☆39May 20, 2025Updated 10 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Globally Consistent Probabilistic Human Motion Estimation☆23Feb 28, 2023Updated 3 years ago
- ☆50Aug 27, 2025Updated 7 months ago
- ☆16Oct 12, 2025Updated 5 months ago
- Update the latest text-related papers from top conferences☆27Mar 12, 2025Updated last year
- RefTeacher is a strong baseline method for Semi-Supervised Referring Expression Comprehension.☆13May 26, 2023Updated 2 years ago
- ☆10Jul 5, 2024Updated last year
- ☆10Jul 12, 2022Updated 3 years ago
- Vision-Language Dataset for Remote Sensing☆42May 27, 2025Updated 10 months ago
- [CVPR 2026] Official repo of "MorphAny3D: Unleashing the Power of Structured Latent in 3D Morphing“☆89Mar 19, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Pytorch Preprocessing and Training for Open X-Embodiment☆26Jul 13, 2024Updated last year
- [CVPR' 25] Official repo for From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Cal…☆22Jun 6, 2025Updated 10 months ago
- AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence☆10Mar 2, 2025Updated last year
- [NeurIPS 2025] U-REPA: Aligning Diffusion U-Nets to ViTs☆35Dec 15, 2025Updated 3 months ago
- [AAAI 2024] The official implementation of the paper "3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Refer…☆45Dec 20, 2023Updated 2 years ago
- Streaming Video Diffusion: Online Video Editing with Diffusion Models☆18Jun 3, 2024Updated last year
- PICABench: How Far Are We from Physically Realistic Image Editing?☆36Nov 5, 2025Updated 5 months ago
- The official repo of continuous speculative decoding☆32Mar 28, 2025Updated last year
- Tutorial codes for KITTI360 Dataset.☆10Aug 24, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [ICLR2026] WeTok: Powerful Discrete Tokenization for High-Fidelity Visual Reconstruction☆65Sep 3, 2025Updated 7 months ago
- Unlocking Iterative Reasoning for Any Image Editor☆105Jan 18, 2026Updated 2 months ago
- This is the official repository of UltraHR-100K.☆46Nov 21, 2025Updated 4 months ago
- [ICCV2025] PyTorch implementation of "Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal …☆121Jan 24, 2026Updated 2 months ago
- Official eval code for ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation☆27Dec 12, 2025Updated 3 months ago
- Aerial Detection Toolbox☆11Jan 18, 2023Updated 3 years ago
- A mini Photoshop software with c++, OpenCV and Qt☆10Jun 6, 2021Updated 4 years ago