Official implementation for "Diffusion Instruction Tuning"
☆35Apr 1, 2026Updated 3 months ago
Alternatives and similar repositories for vlm
Users that are interested in vlm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Oct 2, 2024Updated last year
- RFTT: Reasoning with Reinforced Functional Token Tuning☆29Feb 12, 2026Updated 4 months ago
- This repository contains the **official implementation** of the paper: "VL2Lite: Task-Specific Knowledge Distillation from Large Vision-…☆20Mar 23, 2025Updated last year
- [ICCV 2025 Highlight] Official PyTorch implementation of "SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segment…☆23Jan 18, 2026Updated 5 months ago
- Official implementation of the paper “Endowing Vision-Language Models with System 2 Thinking for Fine-Grained Visual Recognition,” AAAI 2…☆42Jan 30, 2026Updated 5 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [EMNLP 2025] Mitigating Object Hallucinations in MLLMs via Multi-Frequency Perturbations☆44Jan 14, 2026Updated 5 months ago
- DyRAMO: Dynamic Reliability Adjustment for Multi-objective Optimization☆15Mar 17, 2025Updated last year
- Cross Visual Prompt Tuning [ICCV 2025]☆13Aug 3, 2025Updated 11 months ago
- The official code repository for the paper "CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments…☆32Jun 14, 2026Updated 2 weeks ago
- OpenSUN3D Workshop Challenge - CVPR '24☆16May 31, 2024Updated 2 years ago
- ☆18Nov 15, 2024Updated last year
- [NeurIPS 2025] VT-FSL: Bridging Vision and Text with LLMs for Few-Shot Learning☆35May 6, 2026Updated last month
- Code of ["Spectral Prompt Tuning: Unveiling Unseen Classes for Zero-Shot Semantic Segmentation"]☆14Apr 26, 2024Updated 2 years ago
- ☆23Jan 24, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [WACV 2025] Official Implementation of LIME: Localized Image Editing via Attention Regularization in Diffusion Models☆10Apr 7, 2025Updated last year
- A telegram bot that checks current grades from registration.boun.edu.tr☆10Jan 4, 2018Updated 8 years ago
- This repo is about implementing pose estimation with HRNet and also, is a sub-task of the smart hospital bed project☆12Jan 21, 2022Updated 4 years ago
- [EMNLP 2025 Main] Official implementation of VRoPE: Rotary Position Embedding for Video Large Language Models.☆28Nov 18, 2025Updated 7 months ago
- [EMNLP 2024 poster] Lifelong Knowledge Editing for LLMs with Retrieval-Augmented Continuous Prompt Learning☆16Dec 17, 2024Updated last year
- [ICCV 2025 Highlight] Official code for UnZipLoRA: Separating Content and Style from a Single Image☆41Updated this week
- This repo includes all of the solutions to the Algorithmic Toolbox course from Coursera☆10Oct 10, 2022Updated 3 years ago
- Reliable Wrist PPG Monitoring by Mitigating Poor Skin Sensor Contact (Scientific Reports)☆21Apr 10, 2026Updated 2 months ago
- Official Implementation of CODE☆17Sep 26, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ICML 2026] Elastic Diffusion Transformer: Accelerating SOTA generation models (e.g., Qwen-Image, Hunyuan3d ) through adaptive computatio…☆45May 1, 2026Updated 2 months ago
- ☆25Jan 30, 2025Updated last year
- Search3D: Hierarchical Open-Vocabulary 3D Segmentation☆24May 20, 2025Updated last year
- [ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".☆27Oct 13, 2024Updated last year
- Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to …☆70Jan 28, 2026Updated 5 months ago
- [NeurIPS 2024] RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language Models☆32Nov 12, 2024Updated last year
- This is my project to solve the Lunar Lander environment using the Deep Q-Learning Algorithm with Experience Replay☆12Jan 3, 2023Updated 3 years ago
- [CVPR 2025] Offical implementation of the paper "Skip Tuning: Pre-trained Vision-Language Models are Effective and Efficient Adapters The…☆31Mar 12, 2026Updated 3 months ago
- Official Implementation of Object-aware Monocular Depth Prediction with Instance Convolutions☆21May 1, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official PyTorch implementation for paper "ProAPO: Progressively Automatic Prompt Optimization for Visual Classification". The paper is a…☆32Nov 9, 2025Updated 7 months ago
- Discriminator for Model Docking☆11Dec 20, 2024Updated last year
- [ICML 2025] This is the official PyTorch implementation of "🎵 HarmoniCa: Harmonizing Training and Inference for Better Feature Caching i…☆45Jul 10, 2025Updated 11 months ago
- Simple implementation of Retrieval-Augmented Generation System☆28Oct 24, 2024Updated last year
- [CVPR 2025] Understanding Fine-tuning CLIP for Open-vocabulary Semantic Segmentation in Hyperbolic Space☆42Jul 18, 2025Updated 11 months ago
- Awesome-GenAITech: a curated list of Generative AI Techniques☆11Jul 11, 2023Updated 2 years ago
- A benchmark that focuses on the sampling dilemma in long-video tasks. Through well-designed tasks, it evaluates the sampling efficiency o…☆28Aug 7, 2025Updated 10 months ago