thaoshibe/relsim

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thaoshibe/relsim)

thaoshibe / relsim

🍑 relsim: Relational Visual Similarity | pip install relsim 🌍 (CVPR 2026)

☆87

Alternatives and similar repositories for relsim

Users that are interested in relsim are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

adobe-research / GroupDiff
View on GitHub
☆19Dec 22, 2025Updated 7 months ago
WisconsinAIVision / YoLLaVA
View on GitHub
🌋👵🏻 Yo'LLaVA: Your Personalized Language and Vision Assistant (NeurIPS 2024)
☆123Mar 26, 2025Updated last year
shjo-april / TRACE
View on GitHub
[ICLR 2026 Oral] TRACE: Your Diffusion Model Is Secretly an Instance Edge Detector
☆18Mar 2, 2026Updated 4 months ago
thaoshibe / awesome-personalized-lmms
View on GitHub
A curated list of Awesome Personalized Large Multimodal Models resources
☆59Jun 18, 2026Updated last month
intchous / Text2SVG
View on GitHub
☆16Oct 24, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
sterzhang / PVIT
View on GitHub
Official Repository of Personalized Visual Instruct Tuning
☆34Mar 6, 2025Updated last year
plnguyen2908 / LASER_ASD
View on GitHub
[WACV 2026 Oral] LASER: Lip Landmark Assisted Speaker Detection for Robustness official implemntation
☆30Feb 26, 2026Updated 5 months ago
plnguyen2908 / UniTalk-ASD-code
View on GitHub
[Interspeech 2026] Revisiting Active Speaker Detection: An In-the-Wild Benchmark for Generalization and Robustness
☆22Jun 25, 2026Updated last month
AIFrontierLab / UniGame
View on GitHub
[CVPR'26] UniGame code implementation
☆20Apr 21, 2026Updated 3 months ago
Zhengjun-Du / ImageVectorViaLayerDecomposition
View on GitHub
The source code of the paper: Image vectorization and editing via linear gradient layer decomposition.
☆37Dec 14, 2023Updated 2 years ago
KlingAIResearch / VANS
View on GitHub
[CVPR 2026] Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO
☆119Feb 28, 2026Updated 5 months ago
SalesforceAIResearch / strefer
View on GitHub
Strefer: Empowering Video LLMs with Space-Time Referring and Reasoning via Synthetic Instruction Data
☆19Jun 2, 2026Updated last month
WisconsinAIVision / edit-one-for-all
View on GitHub
✏️ Edit One for All: Interactive Batch Image Editing (CVPR 2024)
☆69Aug 8, 2024Updated last year
Eliot-Shen / Awesome-Multi-User-Agents
View on GitHub
A curated list of Awesome Multi-User Agents resources
☆17Jul 11, 2026Updated 2 weeks ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
THU-KEG / LongWriter-V
View on GitHub
[ACM MM25] LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models
☆24Mar 29, 2025Updated last year
mayuelala / FollowYourShape
View on GitHub
[ICLR 2026] Follow-Your-Shape: This repo is the official implementation of "Follow-Your-Shape: Shape-Aware Image Editing via Trajectory-…
☆71Apr 10, 2026Updated 3 months ago
UCSC-VLAA / Complex-Edit
View on GitHub
Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark
☆29Apr 22, 2025Updated last year
refkxh / BiCo
View on GitHub
[CVPR 2026 Highlight] Official implementation of BiCo: Composing Concepts from Images and Videos via Concept-prompt Binding
☆86May 31, 2026Updated last month
pPetrichor / WorldCanvas
View on GitHub
☆147Dec 19, 2025Updated 7 months ago
VinAIResearch / SwiftTry
View on GitHub
☆15Jun 9, 2025Updated last year
ZYM-PKU / UTDesign
View on GitHub
A Unified Framework for Stylized Text Editing and Generation in Graphic Design Images
☆15Jan 6, 2026Updated 6 months ago
WisconsinAIVision / YoChameleon
View on GitHub
🦎 Yo'Chameleon: Your Personalized Chameleon (CVPR 2025)
☆151May 13, 2025Updated last year
daniel3303 / StoryReasoning
View on GitHub
Code for the paper: "StoryReasoning Dataset: Using Chain-of-Thought for Scene Understanding and Grounded Story Generation"
☆40May 16, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
TIGER-AI-Lab / ImagenWorld
View on GitHub
Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks [ICLR 2026]
☆32Apr 2, 2026Updated 3 months ago
umm-emma / emma
View on GitHub
Official repo for paper "EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture."
☆62Dec 16, 2025Updated 7 months ago
zhiyupan42 / VC-STaR
View on GitHub
[ICLR 2026 Oral] Through the Lens of Contrast: Self-Improving Visual Reasoning in VLMs
☆19Apr 29, 2026Updated 2 months ago
gemlab-vt / clora
View on GitHub
☆18Oct 15, 2025Updated 9 months ago
little-misfit / GRAG-Image-Editing
View on GitHub
https://little-misfit.github.io/GRAG-Image-Editing/
☆119Nov 27, 2025Updated 8 months ago
lijun2005 / CVPR26-DreamPRVR
View on GitHub
[CVPR 2026 main] Imagine Before Concentration: Diffusion-Guided Registers Enhance Partially Relevant Video Retrieval
☆37Jun 1, 2026Updated last month
qirui-chen / RGA3-release
View on GitHub
[ICCV 2025] Object-centric Video Question Answering with Visual Grounding and Referring
☆24Aug 8, 2025Updated 11 months ago
ByteDance-BandAI / CodeVision
View on GitHub
[CVPR 2026] Thinking with Programming Vision: Towards a Unified View for Thinking with Images
☆71Jan 23, 2026Updated 6 months ago
Bharath-K3 / MMFace-DiT
View on GitHub
Accepted to IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026.
☆20Jun 2, 2026Updated last month
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
billpsomas / icir
View on GitHub
[NeurIPS 2025] Official implementation of "Instance-Level Composed Image Retrieval".
☆53Dec 22, 2025Updated 7 months ago
NJU-PCALab / DiP
View on GitHub
[CVPR 2026] DiP: Taming Diffusion Models in Pixel Space
☆72Jun 15, 2026Updated last month
libaolu312 / VFXMaster
View on GitHub
VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning
☆65Apr 7, 2026Updated 3 months ago
MikeWangWZHL / VDLM
View on GitHub
Repo for paper: https://arxiv.org/abs/2404.06479
☆30Oct 3, 2024Updated last year
gitcat-404 / SVGen
View on GitHub
☆27Apr 28, 2026Updated 3 months ago
hzphzp / WeGen
View on GitHub
☆27Apr 25, 2025Updated last year
yunlong10 / Video-R4
View on GitHub
Reinforcing Text-Rich Video Reasoning with Visual Rumination
☆28Jun 5, 2026Updated last month