aim-uofa/VLModel

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aim-uofa/VLModel)

aim-uofa / VLModel

Repo of HawkLlama.

☆16

Alternatives and similar repositories for VLModel

Users that are interested in VLModel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aim-uofa / PerturboLLaVA
View on GitHub
☆17Apr 20, 2025Updated last year
YukunLi99 / AdaptSAM
View on GitHub
☆22Jun 30, 2023Updated 3 years ago
aim-uofa / dLLM-MidTruth
View on GitHub
[ICLR'26] Official PyTorch implementation of "Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models".
☆66Mar 5, 2026Updated 4 months ago
aim-uofa / VFN
View on GitHub
[ICLR 2024 Spotlight] The official repo for the paper "De novo Protein Design using Geometric Vector Field Networks".
☆31Aug 23, 2024Updated last year
aim-uofa / COSINE
View on GitHub
[ICCV'25] Unified Open-World Segmentation with Multi-Modal Prompts
☆16Jun 16, 2026Updated last month
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
aim-uofa / GSI-Bench
View on GitHub
[CVPR2026] Exploring Spatial Intelligence from a Generative Perspective
☆30Jun 3, 2026Updated last month
aim-uofa / DiffewS
View on GitHub
[NeurIPS'24] Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation (Diffews)
☆51Apr 14, 2025Updated last year
aim-uofa / TVRBench
View on GitHub
TVRBench: Target Viewpoint Reproduction Benchmark for Active Spatial Intelligence
☆25Jun 2, 2026Updated last month
aim-uofa / FreeCompose
View on GitHub
☆51Oct 6, 2024Updated last year
aim-uofa / DiverGen
View on GitHub
DiverGen (CVPR 2024) & BSGAL (ICML 2024)
☆53Jul 6, 2025Updated last year
aim-uofa / ReasonMatch
View on GitHub
[CVPR2026] Eliciting Complex Spatial Reasoning in MLLMs through Wide-Baseline Matching
☆19Jun 4, 2026Updated last month
aim-uofa / AGILE
View on GitHub
☆46May 6, 2026Updated 2 months ago
aim-uofa / EvoTokenDLM
View on GitHub
[ACL'26] EvoToken-DLM (Beyond Hard Masks: Progressive Token Evolution for Diffusion Language)
☆48Apr 7, 2026Updated 3 months ago
EnVision-Research / StreamMA
View on GitHub
Official implementation of "Streaming Communication in Multi-Agent Reasoning"
☆34Jun 6, 2026Updated last month
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Eurekashen / R2Seg
View on GitHub
Training-Free OOD Medical Tumor Segmentation via Anatomical Reasoning and Statistical Rejection
☆24Jun 29, 2026Updated 3 weeks ago
mhh0318 / OneMoreStep
View on GitHub
☆25Nov 30, 2023Updated 2 years ago
QingZhong1996 / Awesome-Video-Instance-Segmentation-Papers
View on GitHub
☆36Oct 21, 2022Updated 3 years ago
aim-uofa / RGM
View on GitHub
☆70Oct 19, 2023Updated 2 years ago
aim-uofa / SegAgent
View on GitHub
[CVPR2025] SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories
☆106Aug 8, 2025Updated 11 months ago
aim-uofa / OmniJigsaw
View on GitHub
☆34Apr 10, 2026Updated 3 months ago
baaivision / URSA
View on GitHub
[ICLR 2026] 🐻 Uniform Discrete Diffusion with Metric Path for Video Generation
☆123May 20, 2026Updated 2 months ago
aim-uofa / AutoStory
View on GitHub
[IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort
☆149Mar 5, 2026Updated 4 months ago
ayatough / vscode-image-tile-viewer
View on GitHub
vscode extension for showing images in tile view
☆11Mar 6, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
EnVision-Research / MTI
View on GitHub
[ACL 2026] Official implementation of "Less is More: Improving LLM Reasoning with Minimal Test-Time Intervention"
☆41Apr 18, 2026Updated 3 months ago
alibaba-damo-academy / K-Forcing
View on GitHub
Official implementation for "K-Forcing: Joint Next-K-Token Decoding via Push-Forward Language Modeling"
☆16Jun 14, 2026Updated last month
jamessealesmith / ConStruct-VL
View on GitHub
PyTorch code for the CVPR'23 paper: "ConStruct-VL: Data-Free Continual Structured VL Concepts Learning"
☆13Feb 5, 2024Updated 2 years ago
showlab / VisorGPT
View on GitHub
[NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT
☆138May 4, 2024Updated 2 years ago
zju3dv / pats
View on GitHub
Code for "PATS: Patch Area Transportation with Subdivision for Local Feature Matching", CVPR 2023
☆99Aug 28, 2023Updated 2 years ago
aim-uofa / Framer
View on GitHub
[ICLR'25] Official PyTorch implementation of "Framer: Interactive Frame Interpolation".
☆498Jan 9, 2025Updated last year
aim-uofa / Tinker
View on GitHub
One-shot and Few-shot 3D Editing without Per-Scene Optimization
☆175Aug 21, 2025Updated 11 months ago
aim-uofa / StaMo
View on GitHub
Unsupervised Learning of Generalizable Robot Motion from Compact State Representation
☆40Jun 10, 2026Updated last month
aim-uofa / Omni-R1
View on GitHub
[NeurIPS 2025] Official Repo of Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration
☆126Dec 3, 2025Updated 7 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
OpenGVLab / PonderV2
View on GitHub
[T-PAMI 2025] PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm
☆376Sep 30, 2025Updated 9 months ago
aim-uofa / Matcher
View on GitHub
[ICLR'24 & IJCV‘25] Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching
☆569Dec 3, 2025Updated 7 months ago
yoctta / XPaste
View on GitHub
☆54Aug 3, 2023Updated 2 years ago
aim-uofa / GenPercept
View on GitHub
[ICLR2025] GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models
☆229Jan 24, 2025Updated last year
jingchenchen / ReasoningConsistency-VQA
View on GitHub
☆13Aug 14, 2022Updated 3 years ago
harrytea / TGDoc
View on GitHub
"Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023
☆16Nov 28, 2024Updated last year
see-say-segment / sesame
View on GitHub
🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"
☆47Jun 16, 2024Updated 2 years ago