π relsim: Relational Visual Similarity | pip install relsim π (CVPR 2026)
β63Feb 21, 2026Updated last week
Alternatives and similar repositories for relsim
Users that are interested in relsim are comparing it to the libraries listed below
Sorting:
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generationβ30Dec 22, 2025Updated 2 months ago
- β39Oct 29, 2025Updated 4 months ago
- [CVPR 2026] Official Implementation of "Interact2Ar: Full-Body Human-Human Interaction Generation via Autoregressive Diffusion Models".β15Feb 23, 2026Updated last week
- β75Dec 8, 2025Updated 2 months ago
- [NeurIPS 2025] The official code for "IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation"β22Jun 5, 2025Updated 9 months ago
- Official repo for paper "IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning"β40Jan 29, 2026Updated last month
- Block-Recurrent Dynamics in ViTs π¦β31Dec 24, 2025Updated 2 months ago
- VanGogh: A Unified Multimodal Diffusion-based Framework for Video Colorizationβ18Jan 17, 2025Updated last year
- OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Modelsβ154Sep 24, 2025Updated 5 months ago
- VidKV: Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Modelsβ26Mar 26, 2025Updated 11 months ago
- β29May 7, 2025Updated 9 months ago
- β24Jun 4, 2024Updated last year
- Extend the Conditioning of Stable Diffusion to take Audio Embeddings Instead of Text Embeddings using Wav2Vec2-BERT modelβ13Sep 25, 2024Updated last year
- [CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representationsβ12Sep 4, 2024Updated last year
- Transform AI image generation from random exploration into deliberate artistic navigation. This advanced KSampler replacement blends tradβ¦β62Feb 14, 2026Updated 2 weeks ago
- β21Dec 14, 2025Updated 2 months ago
- [NeurIPS 2025] Official implementation of HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidanceβ86Sep 18, 2025Updated 5 months ago
- A ComfyUI custom node that converts videos into iPhone-compatible Live Photos, with features for key frame selection, duration control, aβ¦β37Jan 6, 2025Updated last year
- An open-ended, self-improving AI system that evolves its own source code using a local LLM. Built for autonomy, reflection, and code evolβ¦β22Jan 24, 2026Updated last month
- A free and open-source focus stacking software that supports multi-focus image alignment and fusion.β20Feb 5, 2026Updated last month
- InstantUnify: Integrates Multimodal LLM into Diffusion Models π₯β40Aug 8, 2024Updated last year
- ComfyUI-AutoSplitGridImage: A custom node for ComfyUI that intelligently splits images into grids, combining edge detection for columns aβ¦β43Jan 6, 2025Updated last year
- Q-HEART: ECG Question Answering via Knowledge-Informed Multimodal LLMs (ECAI 2025)β14Jan 23, 2026Updated last month
- [ICLR 2026] SpatialLadder: Progressive Training for Spatial Reasoning in Vision-Language Modelsβ74Jan 29, 2026Updated last month
- This is the official repository for the paper "Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction". ICCV β¦β24Dec 4, 2025Updated 3 months ago
- β15Mar 11, 2025Updated 11 months ago
- β16Nov 7, 2025Updated 3 months ago
- ComfyUI custom node to automate batch generation with randomize prompts from text files. It mimics Forge's functionality, allowing you toβ¦β13Aug 23, 2025Updated 6 months ago
- β43Dec 1, 2025Updated 3 months ago
- Skip Mamba Diffusion for Monocular 3D Semantic Scene Completionβ12Jan 14, 2026Updated last month
- ComfyUI workflows to create smooth transitions between video clips using Wan VACE. Works with video from any model or other source-LTX-2,β¦β31Feb 10, 2026Updated 3 weeks ago
- [ISBI 2024] Official PyTorch implementation of Towards Cross-Domain Single Blood Cell Image Classification via Large-Scale LoRA-based Segβ¦β11Aug 12, 2024Updated last year
- This repository extends the mask editor in Comfyui and supports lasso method for applying masksβ14Jul 23, 2025Updated 7 months ago
- β123Jan 28, 2026Updated last month
- Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPOβ92Dec 1, 2025Updated 3 months ago
- code for "TVG: A Training-free Transition Video Generation Method with Diffusion Models"β48Aug 19, 2024Updated last year
- Home Made Diffusion Modelsβ192Dec 9, 2025Updated 2 months ago
- A universal adapter including zero-copy Python bindings for Philip Turner's metal flash attention library.β23Dec 15, 2025Updated 2 months ago
- Symbolic Graphics Programming with Large Language Modelsβ37Sep 14, 2025Updated 5 months ago