This is a public repository for Image Clustering Conditioned on Text Criteria (IC|TC)
☆93Mar 19, 2024Updated 2 years ago
Alternatives and similar repositories for ICTC
Users that are interested in ICTC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Censored Sampling of Diffusion Models Using 3 Minutes of Human Feedback☆26Aug 27, 2023Updated 2 years ago
- Rare-to-Frequent (R2F), ICLR'25, Spotlight☆53Apr 23, 2025Updated 11 months ago
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.☆19Jun 27, 2024Updated last year
- Code for the paper "Image Clustering with External Guidance" (ICML 2024)☆18Oct 26, 2024Updated last year
- Project for SNARE benchmark☆11Jun 5, 2024Updated last year
- MambaFormer in-context learning experiments and implementation for https://arxiv.org/abs/2402.04248☆59Jun 18, 2024Updated last year
- ☆13Aug 7, 2025Updated 7 months ago
- Official Pytorch Implementation of Paper "A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Des…☆55Aug 27, 2025Updated 6 months ago
- Official code of MoSA (Mixture of Sparse Adapters).☆13Dec 14, 2023Updated 2 years ago
- [ECAI 2023] MonoSKD: General Distillation Framework for Monocular 3D Object Detection via Spearman Correlation Coefficient☆32Dec 8, 2023Updated 2 years ago
- ☆25Sep 19, 2023Updated 2 years ago
- ☆82Oct 13, 2025Updated 5 months ago
- Code for the paper "Image Clustering with External Guidance" (ICML 2024)☆67Oct 26, 2024Updated last year
- [ICLR 2024] Official repo. for Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis☆104Jan 18, 2024Updated 2 years ago
- Code and data for CoachLM, an automatic instruction revision approach LLM instruction tuning.☆60Mar 20, 2024Updated 2 years ago
- ☆13Aug 14, 2022Updated 3 years ago
- SVL-Adapter: Self-Supervised Adapter for Vision-Language Pretrained Models☆21Jan 11, 2024Updated 2 years ago
- Bias-to-Text: Debiasing Unknown Visual Biases through Language Interpretation☆32May 21, 2023Updated 2 years ago
- Official implementation of TINC: Tree-structured Implicit Neural Compression☆22Sep 26, 2023Updated 2 years ago
- How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challenges☆30Sep 24, 2023Updated 2 years ago
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆13Sep 30, 2023Updated 2 years ago
- A Simple Image Clustering Script using CLIP and Hierarchial Clustering☆37Apr 27, 2023Updated 2 years ago
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"☆33Nov 29, 2023Updated 2 years ago
- Convert datasets from Hugging Face to FiftyOne for Visualization☆11Mar 15, 2024Updated 2 years ago
- ☆16May 24, 2024Updated last year
- Implementation of "DIME-FM: DIstilling Multimodal and Efficient Foundation Models"☆15Oct 12, 2023Updated 2 years ago
- [ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.☆27Oct 27, 2023Updated 2 years ago
- Code repository for the paper - "Neural Priming for Sample-Efficient Adaptation"☆14Nov 13, 2023Updated 2 years ago
- ☆78May 8, 2025Updated 10 months ago
- ☆35Jan 23, 2024Updated 2 years ago
- [IJCV 2024] MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation☆128Oct 8, 2024Updated last year
- RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with t…☆154Jun 25, 2024Updated last year
- source code for NeurIPS'24 paper "Towards Calibrated Robust Fine-Tuning of Vision-Language Models"☆14Oct 31, 2025Updated 4 months ago
- Code for "SePPO: Semi-Policy Preference Optimization for Diffusion Alignment."☆18Oct 7, 2024Updated last year
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"☆33Mar 26, 2025Updated 11 months ago
- Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?☆18Jun 3, 2025Updated 9 months ago
- ☆44Sep 19, 2024Updated last year
- CLAIR: A (surprisingly) simple semantic text metric with large language models.☆22Jan 28, 2024Updated 2 years ago
- ☆23Sep 3, 2020Updated 5 years ago