Official implementation for "Diffusion Instruction Tuning"
☆35Apr 1, 2026Updated 2 months ago
Alternatives and similar repositories for vlm
Users that are interested in vlm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Contrast is All You Need For High-Fidelity Text-to-Image Diffusion Models [CVPR 2024]☆27Oct 7, 2024Updated last year
- This repository contains the **official implementation** of the paper: "VL2Lite: Task-Specific Knowledge Distillation from Large Vision-…☆19Mar 23, 2025Updated last year
- [ICCV 2025 Highlight] Official PyTorch implementation of "SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segment…☆23Jan 18, 2026Updated 4 months ago
- Official implementation of the paper “Endowing Vision-Language Models with System 2 Thinking for Fine-Grained Visual Recognition,” AAAI 2…☆41Jan 30, 2026Updated 4 months ago
- Extend BoxDiff to SDXL (SDXL-based layout-to-image generation)☆27May 23, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [EMNLP 2025] Mitigating Object Hallucinations in MLLMs via Multi-Frequency Perturbations☆44Jan 14, 2026Updated 5 months ago
- DyRAMO: Dynamic Reliability Adjustment for Multi-objective Optimization☆15Mar 17, 2025Updated last year
- The official pytorch implemention of our IJCV-2025 paper "Learning with Enriched Inductive Biases for Vision-Language Models".☆15Mar 26, 2025Updated last year
- OpenSUN3D Workshop Challenge - CVPR '24☆16May 31, 2024Updated 2 years ago
- Code for Language-Inspired Relation Transfer for Few-Shot Class-Incremental Learning in IEEE TPAMI☆15Apr 18, 2025Updated last year
- ☆18Nov 15, 2024Updated last year
- Image Encryption/Decryption using Rubik's Cube Principle and AES☆10Jan 13, 2022Updated 4 years ago
- [NeurIPS 2025] VT-FSL: Bridging Vision and Text with LLMs for Few-Shot Learning☆35May 6, 2026Updated last month
- Code of ["Spectral Prompt Tuning: Unveiling Unseen Classes for Zero-Shot Semantic Segmentation"]☆14Apr 26, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [WACV 2025] Official Implementation of LIME: Localized Image Editing via Attention Regularization in Diffusion Models☆10Apr 7, 2025Updated last year
- ☆29Oct 13, 2025Updated 8 months ago
- This repo is about implementing pose estimation with HRNet and also, is a sub-task of the smart hospital bed project☆12Jan 21, 2022Updated 4 years ago
- ☆17Mar 17, 2020Updated 6 years ago
- Synthesizable 3D Molecule Generation via Joint Reaction and Coordinate Modeling☆30Jun 3, 2026Updated last week
- Ghi chép trong quá trình tìm hiểu Prometheus, cảnh báo qua sms, telegram, slack, gmail☆13Sep 17, 2022Updated 3 years ago
- Counterfactual Generative Modeling with Variational Causal Inference (ICLR 2025)☆20Sep 30, 2025Updated 8 months ago
- An ECG Foundation Model: Boosting Masked ECG-Text Auto-Encoders as Discriminative Learners (ICML 2025)☆33Mar 7, 2026Updated 3 months ago
- Reliable Wrist PPG Monitoring by Mitigating Poor Skin Sensor Contact (Scientific Reports)☆21Apr 10, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ICML 2026] Elastic Diffusion Transformer: Accelerating SOTA generation models (e.g., Qwen-Image, Hunyuan3d ) through adaptive computatio…☆44May 1, 2026Updated last month
- MCPL: MULTI-CONCEPT PROMPT LEARNING☆20May 27, 2024Updated 2 years ago
- A browser extension that removes undesired websites from Google results to make easy to access high quality and clean information using t…☆23May 4, 2020Updated 6 years ago
- Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to …☆69Jan 28, 2026Updated 4 months ago
- [NeurIPS 2024] RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language Models☆32Nov 12, 2024Updated last year
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆72Jul 13, 2025Updated 11 months ago
- Official Implementation of Object-aware Monocular Depth Prediction with Instance Convolutions☆21May 1, 2023Updated 3 years ago
- [CVPR 2025] Lifelong Knowledge Editing for Vision Language Models with Low-Rank Mixture-of-Experts☆23Jun 22, 2025Updated 11 months ago
- Official PyTorch implementation for paper "ProAPO: Progressively Automatic Prompt Optimization for Visual Classification". The paper is a…☆32Nov 9, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Convolutional Neural Network for Text Classification in Keras☆14Jul 22, 2017Updated 8 years ago
- Discriminator for Model Docking☆11Dec 20, 2024Updated last year
- [ICML 2025] This is the official PyTorch implementation of "🎵 HarmoniCa: Harmonizing Training and Inference for Better Feature Caching i…☆45Jul 10, 2025Updated 11 months ago
- ☆22May 13, 2019Updated 7 years ago
- Awesome-GenAITech: a curated list of Generative AI Techniques☆11Jul 11, 2023Updated 2 years ago
- We used a web scraper to obtain all the papers from ECCV that have not yet been officially announced, making them available for those who…☆24Sep 2, 2024Updated last year
- A benchmark that focuses on the sampling dilemma in long-video tasks. Through well-designed tasks, it evaluates the sampling efficiency o…☆28Aug 7, 2025Updated 10 months ago