☆35Apr 9, 2025Updated last year
Alternatives and similar repositories for Awesome-Native-Multimodal-Models
Users that are interested in Awesome-Native-Multimodal-Models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆21Mar 5, 2025Updated last year
- TriSplat: Simulation-Ready Feed-Forward 3D Scene Reconstruction☆302Jun 1, 2026Updated last week
- A curated list of domain adaptation papers, datasets and other resources.☆50Jul 17, 2020Updated 5 years ago
- Jittor implementation of Vision Transformer with Deformable Attention☆32Mar 1, 2022Updated 4 years ago
- Unofficial implementation of "Feature Decomposition and Reconstruction Learning for Effective Facial Expression Recognition - CVPR'21"☆19Mar 3, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The official PyTorch implementation of IEEE Transactions on Image Processing 2021 paper "Rethinking the U-shape Structure for Salient Obj…☆20Dec 1, 2022Updated 3 years ago
- Unofficial implementation of the paper: "NeRF-In: Free-Form NeRF Inpainting with RGB-D Priors"☆11Apr 30, 2023Updated 3 years ago
- [NeurIPS 2025] FastVID: Dynamic Density Pruning for Fast Video Large Language Models☆36Nov 10, 2025Updated 7 months ago
- ACM MM '22: Unified Multi-modal Pre-training for Few-shot Sentiment Analysis with Prompt-based Learning☆18Dec 20, 2022Updated 3 years ago
- 🚀全流程自己训练一个VLA 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!☆33Oct 16, 2025Updated 7 months ago
- ✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models☆42Apr 10, 2025Updated last year
- ☆128Jul 29, 2024Updated last year
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆92Oct 12, 2024Updated last year
- Sparrow: Data-Efficient Video-LLM with Text-to-Image Augmentation☆32Mar 28, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [TMM 2023] Official Implementation of "Bidirectional Translation Between UHD-HDR and HD-SDR Videos"☆10Aug 8, 2024Updated last year
- 这里会收集一些简单的机器学习 demo。使用尽量简单的语言剖析原理,使用 Python3.6 下的 Tensorflow。☆10Apr 7, 2018Updated 8 years ago
- HGFM : A Hierarchical Grained and Feature Model for Acoustic Emotion Recgnition☆11Oct 30, 2020Updated 5 years ago
- ICNet: Intra-saliency Correlation Network for Co-Saliency Detection, NeurIPS(2020)☆30Apr 18, 2021Updated 5 years ago
- [TPAMI 2023] Object Affinity Learning: Towards Annotation-free Instance Segmentation☆14Sep 14, 2023Updated 2 years ago
- ☆19Jul 8, 2024Updated last year
- ☆92Jan 22, 2021Updated 5 years ago
- ☆14Oct 3, 2024Updated last year
- The official repo for "SurFhead: Affine Rig Blending for Geometrically Accurate 2D Gaussian Surfel Head Avatars (ICLR 2025)"☆24Apr 21, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Boundaries and Region Representation Fusion☆12Mar 24, 2023Updated 3 years ago
- Crossmodal Translation based Meta Weight Adaption for Robust Image-Text Sentiment Analysis☆15May 16, 2024Updated 2 years ago
- ☆11Jun 22, 2024Updated last year
- FIGR-8, but images in .SVG vector graphics format☆15Feb 16, 2019Updated 7 years ago
- Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuning☆235Jan 22, 2026Updated 4 months ago
- Enhancing Ultrahigh Resolution Remote Sensing Imagery Analysis With ImageRAG [GRSM]☆31May 16, 2026Updated 3 weeks ago
- implementation of aided LLM codeplan algorithm in java☆10Jan 13, 2024Updated 2 years ago
- ☆10Dec 26, 2023Updated 2 years ago
- ☆31Jul 21, 2025Updated 10 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆39Sep 30, 2020Updated 5 years ago
- [ICML2026] ACTIVE-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO☆81Apr 30, 2026Updated last month
- Click Ctrl+G to instantly jump to the open folder of the file you’re working with.☆12Nov 19, 2022Updated 3 years ago
- ☆77May 16, 2019Updated 7 years ago
- This repository open-sources CreatiPoster, an AI-driven graphic design generation system for multi-layer and editable compositions with s…☆92Jun 14, 2025Updated 11 months ago
- [TPAMI2024] Learning to Holistically Detect Bridges from Large-Size VHR Remote Sensing Imagery☆15Mar 18, 2025Updated last year
- ☆16Apr 2, 2017Updated 9 years ago