Official implementation for "Diffusion Instruction Tuning"
☆34Apr 1, 2026Updated last month
Alternatives and similar repositories for vlm
Users that are interested in vlm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Oct 2, 2024Updated last year
- From Words to Wheels: Automated Style-Customized Policy Generation for Autonomous Driving☆11Mar 16, 2025Updated last year
- This repository contains the **official implementation** of the paper: "VL2Lite: Task-Specific Knowledge Distillation from Large Vision-…☆19Mar 23, 2025Updated last year
- Extend BoxDiff to SDXL (SDXL-based layout-to-image generation)☆27May 23, 2024Updated 2 years ago
- DyRAMO: Dynamic Reliability Adjustment for Multi-objective Optimization☆15Mar 17, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Cross Visual Prompt Tuning [ICCV 2025]☆13Aug 3, 2025Updated 9 months ago
- Code for Language-Inspired Relation Transfer for Few-Shot Class-Incremental Learning in IEEE TPAMI☆15Apr 18, 2025Updated last year
- ☆18Nov 15, 2024Updated last year
- The official implementation of "Black Sheep in the Herd: Playing with Spuriously Correlated Attributes for Vision-Language Recognition" …☆73Apr 4, 2026Updated last month
- [NeurIPS 2025] VT-FSL: Bridging Vision and Text with LLMs for Few-Shot Learning☆35May 6, 2026Updated 2 weeks ago
- [WACV 2025] Official Implementation of LIME: Localized Image Editing via Attention Regularization in Diffusion Models☆10Apr 7, 2025Updated last year
- ☆29Oct 13, 2025Updated 7 months ago
- ☆17Mar 17, 2020Updated 6 years ago
- [EMNLP 2025 Main] Official implementation of VRoPE: Rotary Position Embedding for Video Large Language Models.☆27Nov 18, 2025Updated 6 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [EMNLP 2024 poster] Lifelong Knowledge Editing for LLMs with Retrieval-Augmented Continuous Prompt Learning☆16Dec 17, 2024Updated last year
- [ICCV 2025 Highlight] Official code for UnZipLoRA: Separating Content and Style from a Single Image☆40Jul 30, 2025Updated 9 months ago
- Ghi chép trong quá trình tìm hiểu Prometheus, cảnh báo qua sms, telegram, slack, gmail☆13Sep 17, 2022Updated 3 years ago
- This repo includes all of the solutions to the Algorithmic Toolbox course from Coursera☆10Oct 10, 2022Updated 3 years ago
- Counterfactual Generative Modeling with Variational Causal Inference (ICLR 2025)☆20Sep 30, 2025Updated 7 months ago
- this is post-prune tree code for scikit-learn 0.18.0☆15Jul 25, 2022Updated 3 years ago
- Reliable Wrist PPG Monitoring by Mitigating Poor Skin Sensor Contact (Scientific Reports)☆21Apr 10, 2026Updated last month
- Official Implementation of CODE☆17Sep 26, 2024Updated last year
- Repository for "Graph2Pix: A Graph-Based Image to Image Translation Framework", AIM ICCV 2021☆24Nov 29, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆25Jan 30, 2025Updated last year
- [ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".☆27Oct 13, 2024Updated last year
- [NeurIPS 2024] RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language Models☆31Nov 12, 2024Updated last year
- My notes life-full-stack☆20Apr 8, 2021Updated 5 years ago
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆73Jul 13, 2025Updated 10 months ago
- [CVPR '25] Official implementation of the paper "Rethinking Few-Shot Adaptation of Vision-Language Models in Two Stages", CVPR 2025.☆31Mar 30, 2025Updated last year
- Official Implementation of Object-aware Monocular Depth Prediction with Instance Convolutions☆21May 1, 2023Updated 3 years ago
- [CVPR 2025] Lifelong Knowledge Editing for Vision Language Models with Low-Rank Mixture-of-Experts☆23Jun 22, 2025Updated 11 months ago
- Official PyTorch implementation for paper "ProAPO: Progressively Automatic Prompt Optimization for Visual Classification". The paper is a…☆29Nov 9, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Mind The Edge: Refining Depth Edges in Sparsely-Supervised Monocular Depth Estimation, CVPR24'☆21Nov 4, 2024Updated last year
- Discriminator for Model Docking☆11Dec 20, 2024Updated last year
- [CVPR 2025] Attention Distillation: A Unified Approach to Visual Characteristics Transfer☆234Mar 8, 2025Updated last year
- Simple implementation of Retrieval-Augmented Generation System☆28Oct 24, 2024Updated last year
- [CVPR 2025] Understanding Fine-tuning CLIP for Open-vocabulary Semantic Segmentation in Hyperbolic Space☆41Jul 18, 2025Updated 10 months ago
- ☆22May 13, 2019Updated 7 years ago
- Official implementation of CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding.☆48Sep 15, 2025Updated 8 months ago