OpenEnvision-Lab/Awesome-Multimodal-Modeling

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OpenEnvision-Lab/Awesome-Multimodal-Modeling)

OpenEnvision-Lab / Awesome-Multimodal-Modeling

Awesome Multimodal Modeling [Covers MLLM, UMM, and NMM]

☆161

Alternatives and similar repositories for Awesome-Multimodal-Modeling

Users that are interested in Awesome-Multimodal-Modeling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Mwxinnn / UniAS
View on GitHub
The official repo for ”[WACV2025] Towards Accurate Unified Anomaly Segmentation“
☆15Apr 14, 2025Updated 11 months ago
hustvl / Spa3R
View on GitHub
Spa3R: Predictive Spatial Field Modeling for 3D Visual Reasoning
☆47Mar 25, 2026Updated 2 weeks ago
yangsizhe / MoVie
View on GitHub
[NeurIPS 2023] MoVie: Visual Model-Based Policy Adaptation for View Generalization
☆11Sep 22, 2023Updated 2 years ago
xie-lab-ml / Meissonic-Inference
View on GitHub
Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer
☆16Nov 21, 2024Updated last year
linxin0 / RSCP2GAN
View on GitHub
☆18Aug 7, 2025Updated 8 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
yesheng-THU / GFGE
View on GitHub
GFGE
☆15Sep 7, 2022Updated 3 years ago
princeton-computational-imaging / Neural-Volume-Super-Resolution
View on GitHub
☆17Jan 28, 2024Updated 2 years ago
GradientSpaces / HouseTour
View on GitHub
[ICCV 2025] HouseTour: A Virtual Real Estate A(I)gent
☆38Oct 22, 2025Updated 5 months ago
Ahren09 / SciEvo
View on GitHub
A longitudinal dataset for academic literature, including papers, metadata, and citation graphs, Also available on 🤗 HuggingFace and Kag…
☆17Sep 6, 2025Updated 7 months ago
yec22 / Fine-Grained-Indoor-Recon
View on GitHub
☆17Aug 13, 2024Updated last year
ShaoShuai0605 / Misevolution
View on GitHub
Official Repo of Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents
☆71Oct 28, 2025Updated 5 months ago
shirlyliu64 / ConvBench
View on GitHub
ConvBench: A Multi-Turn Conversation Evaluation Benchmark with Hierarchical Ablation Capability for Large Vision-Language Models
☆16Sep 27, 2024Updated last year
GasaiYU / ucas_computer_network_2022_spring
View on GitHub
The computer network lab in ucas spring 2022
☆10Nov 17, 2022Updated 3 years ago
THU-SI / Spatial-TTT
View on GitHub
Official Implementation of Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training
☆169Mar 13, 2026Updated 3 weeks ago
NordVPN Special Discount Offer • Ad
Save on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
Ephemeral182 / Empirical-Study-of-GPT-4o-Image-Gen
View on GitHub
An Empirical Study of GPT-4o Image Generation Capabilities
☆29Apr 16, 2025Updated 11 months ago
humansensinglab / dfcil-hgr
View on GitHub
[ICCV 2023] Data-Free Class-Incremental Hand Gesture Recognition
☆17Sep 21, 2023Updated 2 years ago
Kqp1227 / Sensitive-Channel-Pruning
View on GitHub
This is the official repository of the following paper: "Achieving Fairness Through Channel Pruning for Dermatological Disease Diagnosis"…
☆10Jan 4, 2025Updated last year
FuNz-0 / One-for-More
View on GitHub
The official implementation of “One-for-More: Continual Diffusion Model for Anomaly Detection” （CVPR2025)
☆61May 7, 2025Updated 11 months ago
Sonne-Ding / RRESM
View on GitHub
☆16Sep 3, 2025Updated 7 months ago
TencentARC / CubeComposer
View on GitHub
[CVPR 2026] Spatio-Temporal Autoregressive 4K 360° Video Generation from Perspective Video
☆107Mar 24, 2026Updated 2 weeks ago
caozidong / PanDA
View on GitHub
[CVPR 2025] Official code of "PanDA: Towards Panoramic Depth Anything with Unlabeled Panoramas and Mobius Spatial Augmentation"
☆46Mar 18, 2025Updated last year
Sphere-AI-Lab / fda
View on GitHub
Implementation of <Model Merging with Functional Dual Anchors>
☆47Nov 23, 2025Updated 4 months ago
XJTU-XGU / ARPM
View on GitHub
Code for ARPM ("Adversarial Reweighting with α-Power Maximization for Domain Adaptation"), IJCV, 2024.
☆13May 28, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
FrankYang-17 / Mavors
View on GitHub
☆15May 30, 2025Updated 10 months ago
LeCAR-Lab / flow-diffusion-1d-tutorial
View on GitHub
A simple 1-d diffusion/flow model tutorial for LeCAR group meeting
☆16Sep 27, 2025Updated 6 months ago
cubeyoung / Noise2Score
View on GitHub
[NeurIPS'21] Noise2Score: Tweedie's Approach to Self-Supervised Image Denoising without Clean Images
☆14Mar 26, 2024Updated 2 years ago
btma48 / AutoLA
View on GitHub
Code of our Neurips2020 paper "Auto Learning Attention", coming soon
☆22Apr 14, 2021Updated 4 years ago
OpenDCAI / Open-NotebookLM
View on GitHub
An Open Source implementation of Notebook LM.
☆57Apr 3, 2026Updated last week
InternRobotics / MMSI-Video-Bench
View on GitHub
MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence
☆57Mar 11, 2026Updated last month
AshleyLuo001 / UTANet
View on GitHub
[AAAI 2025] Open-source, End-to-end, Medical Image Segmentation model by Task allociation.
☆34May 22, 2025Updated 10 months ago
Show-han / Zeroshot_REC
View on GitHub
Official code for Zero-shot Referring Expression Comprehension via Structural Similarity Between Images and Captions (CVPR 2024)
☆28Jun 21, 2024Updated last year
starreeze / efuf
View on GitHub
the official repo for EMNLP 2024 (main) paper "EFUF: Efficient Fine-grained Unlearning Framework for Mitigating Hallucinations in Multimo…
☆21Apr 9, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
wei-mao-2019 / SDFFlow
View on GitHub
official implementation for our paper SDFFlow
☆37May 9, 2024Updated last year
TaoWangzj / PromptRR
View on GitHub
[arXiv 2024] PromptRR: Diffusion Models as Prompt Generators for Single Image Reflection Removal
☆17Feb 8, 2024Updated 2 years ago
cqylunlun / CoPS
View on GitHub
[ArXiv 2025] Official Implementation for "CoPS: Conditional Prompt Synthesis for Zero-Shot Anomaly Detection"
☆27Aug 11, 2025Updated 8 months ago
Nomination-NRB / RL-snack
View on GitHub
强化学习贪吃蛇
☆15Oct 19, 2023Updated 2 years ago
kaist-cvml / 3d-vlm-gd
View on GitHub
[EMNLP 2025 Findings] 3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation
☆32Jun 12, 2025Updated 10 months ago
Franklin-Zhang0 / ReasonGen-R1
View on GitHub
Official respository for ReasonGen-R1
☆75Jun 23, 2025Updated 9 months ago
mm-vl / ULM-R1
View on GitHub
Co-Reinforcement Learning for Unified Multimodal Understanding and Generation
☆43Jul 22, 2025Updated 8 months ago