m-arda-aydn / ITACLIP
ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements
☆18Updated last month
Alternatives and similar repositories for ITACLIP:
Users that are interested in ITACLIP are comparing it to the libraries listed below
- ☆37Updated this week
- The official implementation of "[MASK] is All You Need"☆104Updated last month
- ☆35Updated 6 months ago
- Diffusion Models as Data Mining Tools☆53Updated 3 months ago
- Official code for "DiffCut: Catalyzing Zero-Shot Semantic Segmentation with Diffusion Features and Recursive Normalized Cut", NeurIPS 202…☆34Updated last month
- Official PyTorch implementation of "Generalized Consistency Trajectory Models for Image Manipulation"☆34Updated 9 months ago
- More dimensions = More fun☆21Updated 5 months ago
- [ECCV 2024] DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control☆66Updated last month
- Towards training VQ-VAE models robustly!☆43Updated last week
- Official Pytorch Implementation of Paper "A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Des…☆54Updated 6 months ago
- DistillDIFT: Distillation of Diffusion Features for Semantic Correspondence (WACV 2025)☆16Updated last month
- [ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Lea…☆97Updated 8 months ago
- [NeurIPS 2024] Official implementation of the paper "Interfacing Foundation Models' Embeddings"☆118Updated 4 months ago
- [NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone".☆31Updated last month
- [ECCV 2024] Official Release of SILC: Improving vision language pretraining with self-distillation☆40Updated 3 months ago
- ☆65Updated 2 months ago
- [ECCV'24] Official Implementation of SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance☆118Updated 4 months ago
- This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…☆30Updated 2 months ago
- [ICML 2024] Compositional Image Decomposition with Diffusion Models☆46Updated 6 months ago
- Code release for "SegLLM: Multi-round Reasoning Segmentation"☆56Updated last week
- [NeurIPS 2024] TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration☆21Updated 3 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆33Updated 6 months ago
- [ECCV 2024] Official pytorch implementation of "Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts"☆34Updated 6 months ago
- This repo is the official implementation of iSeg: An Iterative Refinement-based Framework for Training-free Segmentation.☆36Updated last month
- Implementation of ViTaR: ViTAR: Vision Transformer with Any Resolution in PyTorch☆30Updated 2 months ago
- ☆54Updated 6 months ago
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆99Updated last month
- ☆32Updated 3 weeks ago
- NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…☆38Updated 2 months ago
- Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment☆38Updated 2 weeks ago