☆29Jun 10, 2024Updated 2 years ago
Alternatives and similar repositories for CountCLIP
Users that are interested in CountCLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implemention of "Make It Count: Text-to-Image Generation with an Accurate Number of Objects" (CVPR 2025)☆96Mar 12, 2025Updated last year
- [ECCV 2024] Teach CLIP to Develop a Number Sense for Ordinal Regression☆22Apr 1, 2025Updated last year
- Official codebase for the NeurIPS 2023 paper: Towards Last-layer Retraining for Group Robustness with Fewer Annotations. https://arxiv.or…☆12May 15, 2024Updated 2 years ago
- Ranking-Consistent Language-Image Pretraining☆13Oct 24, 2025Updated 7 months ago
- Code for "CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modally"☆29Feb 27, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆13Sep 30, 2023Updated 2 years ago
- CVPR 2024 Official Repository☆13Mar 27, 2024Updated 2 years ago
- Implementation of the research paper of Cam which is the alternative to the current SOTA☆30Aug 4, 2022Updated 3 years ago
- GeckoNum Benchmark for T2I Model Eval.☆15Dec 5, 2024Updated last year
- [NeurIPS 2024] Official implementation of "Unlocking the Capabilities of Masked Generative Models for Image Synthesis via Self-Guidance"☆17Dec 4, 2024Updated last year
- If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions☆17Apr 4, 2024Updated 2 years ago
- [ECCV 2024] Official code for "Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation"☆18Jul 31, 2025Updated 10 months ago
- ☆17Aug 8, 2024Updated last year
- Code to reproduce the experiments in the paper: Does CLIP Bind Concepts? Probing Compositionality in Large Image Models.☆16Oct 14, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ALTo: Adaptive-Length Tokenizer for Autoregressive Mask Generation☆29May 27, 2025Updated last year
- ☆17Jul 25, 2021Updated 4 years ago
- Code for the CCE algorithm proposed in "Towards Compositionality in Concept Learning" at ICML 2024.☆16Jun 2, 2024Updated 2 years ago
- [ACM MM23] CLIP-Count: Towards Text-Guided Zero-Shot Object Counting☆121Mar 20, 2024Updated 2 years ago
- Official Implementation of ISR-DPO:Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO (AAAI'25)☆23Nov 25, 2025Updated 6 months ago
- [IJCV 2026] HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts☆26Feb 28, 2025Updated last year
- [CVPR 2024] Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Fine-grained Understanding☆56Apr 7, 2025Updated last year
- [CVPR 2026 Highlight] GlyphPrinter: Region-Grouped Direct Preference Optimization for Glyph-Accurate Visual Text Rendering☆101Apr 9, 2026Updated 2 months ago
- Official repo for [CVPR 2026] "SARMAE: Masked Autoencoder for SAR Representation Learning"☆47Apr 15, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆24May 24, 2026Updated 2 weeks ago
- Mitigating Open-Vocabulary Caption Hallucinations (EMNLP 2024)☆18Oct 18, 2024Updated last year
- The code for paper entitled "Data-Driven Modulation Optimization with LMMSE Equalization for Reliability Enhancement in Underwater Acoust…☆19Apr 9, 2026Updated 2 months ago
- This is the official repository of the paper "SAR-TEXT: A Large-Scale SAR Image-Text Dataset Built with SAR-Narrator and Progressive Tran…☆35Oct 22, 2025Updated 7 months ago
- PyTorch implementation of ``Masked-Attention Diffusion Guidance for Spatially Controlling Text-to-Image Generation'' [The Visual Computer…☆24Jan 7, 2025Updated last year
- [NeurIPS 2024 Spotlight] code for "Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement"☆20Jan 26, 2025Updated last year
- ☆39May 20, 2025Updated last year
- Repository to perform multi animal pose detection. In particular this code is used for bee pose estimation.☆10Jan 10, 2022Updated 4 years ago
- ☆20May 3, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Evaluation and dataset construction code for the CVPR 2025 paper "Vision-Language Models Do Not Understand Negation"☆47Feb 26, 2026Updated 3 months ago
- [ICML 2025] Official Implementation of "Hessian Geometry of Latent Space in Generative Models"☆19Aug 16, 2025Updated 9 months ago
- Implementation of "Conditional Score Guidance for Text-Driven Image-to-Image Translation" (NeurIPS 2023).☆11Jul 19, 2023Updated 2 years ago
- Data & Code for FEDD published @ MICCAI 23☆12Oct 11, 2023Updated 2 years ago
- This repo consists of my implementation of DocFormerV2☆12Mar 31, 2024Updated 2 years ago
- Text-to-Drive: Diverse Driving Behaviors Synthesis via Large Language Models☆11Mar 17, 2024Updated 2 years ago
- FineCLIP: Self-distilled Region-based CLIP for Better Fine-grained Understanding (NIPS24)☆38Nov 12, 2025Updated 6 months ago