☆29Jun 10, 2024Updated last year
Alternatives and similar repositories for CountCLIP
Users that are interested in CountCLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implemention of "Make It Count: Text-to-Image Generation with an Accurate Number of Objects" (CVPR 2025)☆96Mar 12, 2025Updated last year
- [ECCV 2024] Teach CLIP to Develop a Number Sense for Ordinal Regression☆22Apr 1, 2025Updated last year
- Official codebase for the NeurIPS 2023 paper: Towards Last-layer Retraining for Group Robustness with Fewer Annotations. https://arxiv.or…☆12May 15, 2024Updated 2 years ago
- Ranking-Consistent Language-Image Pretraining☆13Oct 24, 2025Updated 6 months ago
- Code for "CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modally"☆29Feb 27, 2026Updated 2 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆13Sep 30, 2023Updated 2 years ago
- CVPR 2024 Official Repository☆13Mar 27, 2024Updated 2 years ago
- GeckoNum Benchmark for T2I Model Eval.☆15Dec 5, 2024Updated last year
- [NeurIPS 2024] Official implementation of "Unlocking the Capabilities of Masked Generative Models for Image Synthesis via Self-Guidance"☆17Dec 4, 2024Updated last year
- [ECCV 2024] Official code for "Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation"☆18Jul 31, 2025Updated 9 months ago
- ☆17Aug 8, 2024Updated last year
- Code to reproduce the experiments in the paper: Does CLIP Bind Concepts? Probing Compositionality in Large Image Models.☆16Oct 14, 2023Updated 2 years ago
- ☆11Sep 15, 2023Updated 2 years ago
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆23Nov 8, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ALTo: Adaptive-Length Tokenizer for Autoregressive Mask Generation☆29May 27, 2025Updated 11 months ago
- Official repository for the paper "Instance-Wise Holistic Order Prediction in Natural Scenes".☆27Jan 11, 2024Updated 2 years ago
- Code for the CCE algorithm proposed in "Towards Compositionality in Concept Learning" at ICML 2024.☆16Jun 2, 2024Updated last year
- [ACM MM23] CLIP-Count: Towards Text-Guided Zero-Shot Object Counting☆121Mar 20, 2024Updated 2 years ago
- Official Implementation of ISR-DPO:Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO (AAAI'25)☆23Nov 25, 2025Updated 5 months ago
- [IJCV 2026] HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts☆26Feb 28, 2025Updated last year
- [CVPR 2024] Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Fine-grained Understanding☆56Apr 7, 2025Updated last year
- [CVPR 2026 Highlight] GlyphPrinter: Region-Grouped Direct Preference Optimization for Glyph-Accurate Visual Text Rendering☆100Apr 9, 2026Updated last month
- Official repo for [CVPR 2026] "SARMAE: Masked Autoencoder for SAR Representation Learning"☆40Apr 15, 2026Updated last month
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Detail-Oriented CLIP for Fine-Grained Tasks (ICLR SSI-FM 2025)☆58Mar 26, 2025Updated last year
- ☆24Sep 12, 2023Updated 2 years ago
- Mitigating Open-Vocabulary Caption Hallucinations (EMNLP 2024)☆18Oct 18, 2024Updated last year
- [T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection☆39Aug 29, 2023Updated 2 years ago
- This is the official repository of the paper "SAR-TEXT: A Large-Scale SAR Image-Text Dataset Built with SAR-Narrator and Progressive Tran…☆33Oct 22, 2025Updated 7 months ago
- PyTorch implementation of ``Masked-Attention Diffusion Guidance for Spatially Controlling Text-to-Image Generation'' [The Visual Computer…☆24Jan 7, 2025Updated last year
- [NeurIPS 2024 Spotlight] code for "Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement"☆20Jan 26, 2025Updated last year
- ☆39May 20, 2025Updated last year
- Repository to perform multi animal pose detection. In particular this code is used for bee pose estimation.☆10Jan 10, 2022Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆20May 3, 2025Updated last year
- [ICML 2025] Official Implementation of "Hessian Geometry of Latent Space in Generative Models"☆18Aug 16, 2025Updated 9 months ago
- Data & Code for FEDD published @ MICCAI 23☆12Oct 11, 2023Updated 2 years ago
- This repo consists of my implementation of DocFormerV2☆12Mar 31, 2024Updated 2 years ago
- Text-to-Drive: Diverse Driving Behaviors Synthesis via Large Language Models☆11Mar 17, 2024Updated 2 years ago
- [ECCV'24] Official Implementation of Autoregressive Visual Entity Recognizer.☆14Mar 2, 2024Updated 2 years ago
- ☆21Apr 10, 2023Updated 3 years ago