John-Ge/Awesome-Native-Multimodal-Models

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/John-Ge/Awesome-Native-Multimodal-Models)

John-Ge / Awesome-Native-Multimodal-Models

☆35

Alternatives and similar repositories for Awesome-Native-Multimodal-Models

Users that are interested in Awesome-Native-Multimodal-Models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LeapLabTHU / UniTTA
View on GitHub
☆21Mar 5, 2025Updated last year
LeapLabTHU / DAT-Jittor
View on GitHub
Jittor implementation of Vision Transformer with Deformable Attention
☆32Mar 1, 2022Updated 4 years ago
PinJui / FDRL
View on GitHub
Unofficial implementation of "Feature Decomposition and Reconstruction Learning for Effective Facial Expression Recognition - CVPR'21"
☆19Mar 3, 2024Updated 2 years ago
hitachinsk / NeRF-Inpainting
View on GitHub
Unofficial implementation of the paper: "NeRF-In: Free-Form NeRF Inpainting with RGB-D Priors"
☆11Apr 30, 2023Updated 3 years ago
zal0302 / CII
View on GitHub
The official PyTorch implementation of IEEE Transactions on Image Processing 2021 paper "Rethinking the U-shape Structure for Salient Obj…
☆20Dec 1, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
THUNLP-MT / ActiView
View on GitHub
☆11Dec 20, 2024Updated last year
LINs-lab / cluster_tutorial
View on GitHub
☆17Mar 19, 2026Updated 4 months ago
VITA-MLLM / Sparrow
View on GitHub
Sparrow: Data-Efficient Video-LLM with Text-to-Image Augmentation
☆32Mar 28, 2025Updated last year
MonoFormer / MonoFormer
View on GitHub
The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"
☆92Oct 12, 2024Updated last year
ai-kunkun / PASA
View on GitHub
[ICML 2026] PASA: A Principled Embedding-Space Watermarking Approach for LLM-Generated Text under Semantic-Invariant Attacks
☆23May 13, 2026Updated 2 months ago
alibaba / conv-llava
View on GitHub
☆128Jul 29, 2024Updated last year
NoisyWinds / MachineLearning
View on GitHub
这里会收集一些简单的机器学习 demo。使用尽量简单的语言剖析原理，使用 Python3.6 下的 Tensorflow。
☆10Apr 7, 2018Updated 8 years ago
lhaof / Adversarial-Attack-Papers
View on GitHub
☆13Sep 21, 2019Updated 6 years ago
zhenyuanlu / awesome-pain-intensity-classification-papers
View on GitHub
A comprehensive list of pain intensity classification papers mainly based on deep learning algorithms
☆12Oct 20, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
blanclist / ICNet
View on GitHub
ICNet: Intra-saliency Correlation Network for Co-Saliency Detection, NeurIPS(2020)
☆30Apr 18, 2021Updated 5 years ago
showlab / Awesome-Unified-Multimodal-Models
View on GitHub
📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.
☆828Oct 10, 2025Updated 9 months ago
ChenyuHeidiZhang / VL-commonsense
View on GitHub
☆14May 23, 2022Updated 4 years ago
grignarder / high-quality-blendshape-generation
View on GitHub
☆19Jul 8, 2024Updated 2 years ago
ZurichRain / HMCGR
View on GitHub
code for COLING paper "A Hybrid Model of Classification and Generation for Spatial Relation Extraction"
☆10Oct 20, 2022Updated 3 years ago
trandangtrungduc / llama-paper-summary
View on GitHub
Code, Resources - Personal project - Llama Paper Summary - October 14, 2024.
☆11Oct 15, 2024Updated last year
huangyuxiang03 / Locret
View on GitHub
☆14Oct 3, 2024Updated last year
StevenGrove / LearnableTreeFilterV2
View on GitHub
☆92Jan 22, 2021Updated 5 years ago
thuiar / Robust-MSA
View on GitHub
☆11May 12, 2023Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
marcdemers / FIGR-8-SVG
View on GitHub
FIGR-8, but images in .SVG vector graphics format
☆15Feb 16, 2019Updated 7 years ago
SHI-Labs / Diffusion-Driven-Test-Time-Adaptation-via-Synthetic-Domain-Alignment
View on GitHub
Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment, arXiv 2024 / CVPR 2025
☆46Mar 1, 2025Updated last year
oranshayer / BRRF
View on GitHub
Boundaries and Region Representation Fusion
☆12Mar 24, 2023Updated 3 years ago
yoqim / PR-HFR
View on GitHub
☆13Nov 23, 2022Updated 3 years ago
thuiar / CTMWA
View on GitHub
Crossmodal Translation based Meta Weight Adaption for Robust Image-Text Sentiment Analysis
☆15May 16, 2024Updated 2 years ago
KejiaZhang-Robust / AI-Agent-papers
View on GitHub
Collection of recent works on AI Agents.
☆17Jun 5, 2025Updated last year
syjmelody / RankE
View on GitHub
Implementation of RankE: End-to-End Discrete Text-to-Image Post-Training via Rank-Consistent Alignment
☆20May 27, 2026Updated last month
facebookresearch / metamorph
View on GitHub
Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuning
☆235Jan 22, 2026Updated 5 months ago
JinXins / MergeMix
View on GitHub
[ICLR 2026] MergeMix: A Unified Augmentation Paradigm for Visual and Multi-Modal Understanding
☆21Feb 27, 2026Updated 4 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
KD-TAO / OmniAgent
View on GitHub
OmniAgent: Audio-Guided Active Perception Agent for Omnimodal Audio-Video Understanding
☆22Apr 9, 2026Updated 3 months ago
JiJingYu / tensorflow-exercise
View on GitHub
☆16Apr 2, 2017Updated 9 years ago
Teng-Sun / CLUE_model
View on GitHub
CLUE code
☆15May 1, 2025Updated last year
thuiar / AWESOME-Dialogue
View on GitHub
Paper List for Dialogue and Interactive Systems
☆15Jun 5, 2020Updated 6 years ago
pureexe / rf-inversion-sd3
View on GitHub
[Unofficial] RF Inversion implemented for SD3 / SD3.5
☆13Nov 4, 2024Updated last year
jinhong-ni / UniPano
View on GitHub
[ICCV 2025] Official implementation of "What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?"
☆21Aug 7, 2025Updated 11 months ago
geotle77 / UCAS_AICS
View on GitHub
☆10Dec 26, 2023Updated 2 years ago