Official implementation for "Diffusion Instruction Tuning"
☆34Apr 1, 2026Updated last week
Alternatives and similar repositories for vlm
Users that are interested in vlm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Oct 2, 2024Updated last year
- [ICCV 2025 Highlight] Official PyTorch implementation of "SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segment…☆19Jan 18, 2026Updated 2 months ago
- Official implementation of the paper “Endowing Vision-Language Models with System 2 Thinking for Fine-Grained Visual Recognition,” AAAI 2…☆37Jan 30, 2026Updated 2 months ago
- Extend BoxDiff to SDXL (SDXL-based layout-to-image generation)☆27May 23, 2024Updated last year
- [EMNLP 2025] Mitigating Object Hallucinations in MLLMs via Multi-Frequency Perturbations☆44Jan 14, 2026Updated 3 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- DyRAMO: Dynamic Reliability Adjustment for Multi-objective Optimization☆15Mar 17, 2025Updated last year
- The official pytorch implemention of our IJCV-2025 paper "Learning with Enriched Inductive Biases for Vision-Language Models".☆15Mar 26, 2025Updated last year
- Cross Visual Prompt Tuning [ICCV 2025]☆13Aug 3, 2025Updated 8 months ago
- [CVPR 2025] Fine-Grained Image-Text Correspondence with Cost Aggregation for Open-Vocabulary Part Segmentation☆24Nov 17, 2025Updated 4 months ago
- ☆18Nov 15, 2024Updated last year
- Image Encryption/Decryption using Rubik's Cube Principle and AES☆10Jan 13, 2022Updated 4 years ago
- Code of ["Spectral Prompt Tuning: Unveiling Unseen Classes for Zero-Shot Semantic Segmentation"]☆14Apr 26, 2024Updated last year
- ☆23Jan 24, 2024Updated 2 years ago
- [WACV 2025] Official Implementation of LIME: Localized Image Editing via Attention Regularization in Diffusion Models☆10Apr 7, 2025Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆29Oct 13, 2025Updated 6 months ago
- 🔥 Medical Image Analysis 2025: Towards Cardiac MRI Foundation Models: Comprehensive Visual-Tabular Representations for Whole-Heart Asses…☆30Jan 5, 2026Updated 3 months ago
- ModelB5 for Self Driving☆11Jan 2, 2023Updated 3 years ago
- ☆17Mar 17, 2020Updated 6 years ago
- [ICCV 2025 Highlight] Official code for UnZipLoRA: Separating Content and Style from a Single Image☆40Jul 30, 2025Updated 8 months ago
- [EMNLP 2025 Main] Official implementation of VRoPE: Rotary Position Embedding for Video Large Language Models.☆27Nov 18, 2025Updated 4 months ago
- This repo is created to keep track of my advancement in Problem Solving☆14Sep 16, 2020Updated 5 years ago
- Synthesizable 3D Molecule Generation via Joint Reaction and Coordinate Modeling☆25Mar 3, 2026Updated last month
- Counterfactual Generative Modeling with Variational Causal Inference (ICLR 2025)☆20Sep 30, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Reliable Wrist PPG Monitoring by Mitigating Poor Skin Sensor Contact (Scientific Reports)☆20Updated this week
- Official Implementation of CODE☆17Sep 26, 2024Updated last year
- Repository for "Graph2Pix: A Graph-Based Image to Image Translation Framework", AIM ICCV 2021☆24Nov 29, 2021Updated 4 years ago
- MCPL: MULTI-CONCEPT PROMPT LEARNING☆20May 27, 2024Updated last year
- ☆25Jan 30, 2025Updated last year
- [NeurIPS 2024] RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language Models☆31Nov 12, 2024Updated last year
- Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to …☆65Jan 28, 2026Updated 2 months ago
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆73Jul 13, 2025Updated 9 months ago
- Official Implementation of Object-aware Monocular Depth Prediction with Instance Convolutions☆22May 1, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official PyTorch implementation for paper "ProAPO: Progressively Automatic Prompt Optimization for Visual Classification". The paper is a…☆28Nov 9, 2025Updated 5 months ago
- Mind The Edge: Refining Depth Edges in Sparsely-Supervised Monocular Depth Estimation, CVPR24'☆22Nov 4, 2024Updated last year
- Discriminator for Model Docking☆11Dec 20, 2024Updated last year
- [ICLR2025] Enhancing Cognition and Explainability of Multimodal Foundation Models with Self-Synthesized Data☆28Mar 3, 2025Updated last year
- [CVPR 2026] Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout☆59Mar 21, 2026Updated 3 weeks ago
- [CVPR 2025] Understanding Fine-tuning CLIP for Open-vocabulary Semantic Segmentation in Hyperbolic Space☆38Jul 18, 2025Updated 8 months ago
- Simple implementation of Retrieval-Augmented Generation System☆29Oct 24, 2024Updated last year