opendatalab / CLIP-Parrot-BiasView external linksLinks
ECCV2024_Parrot Captions Teach CLIP to Spot Text
☆66Sep 6, 2024Updated last year
Alternatives and similar repositories for CLIP-Parrot-Bias
Users that are interested in CLIP-Parrot-Bias are comparing it to the libraries listed below
Sorting:
- ☆73May 10, 2024Updated last year
- This is an official implementation of GRIT-VLP☆20Aug 8, 2022Updated 3 years ago
- AAAI 2024: Visual Instruction Generation and Correction☆96Feb 4, 2024Updated 2 years ago
- This is the project page of ShowRoom3D☆25Dec 22, 2023Updated 2 years ago
- V-MAGE: A Game Evaluation Framework for Assessing Visual-Centric Capabilities in MLLMs☆24Jul 31, 2025Updated 6 months ago
- ☆25Nov 7, 2022Updated 3 years ago
- This repository contains the code of our paper 'Skip \n: A simple method to reduce hallucination in Large Vision-Language Models'.☆15Feb 12, 2024Updated 2 years ago
- Implementation for "Correcting Diffusion Generation through Resampling" [CVPR 2024]☆34Dec 12, 2023Updated 2 years ago
- [ECCV 2024] Official repo for UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diff…☆234Feb 14, 2025Updated last year
- ☆94Apr 21, 2025Updated 9 months ago
- MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance (ACM MM2024)☆141Apr 16, 2025Updated 9 months ago
- [ICCV25 Highlight] The official implementation of the paper "LEGION: Learning to Ground and Explain for Synthetic Image Detection"☆74Oct 22, 2025Updated 3 months ago
- ImageNet-12k subset of ImageNet-21k (fall11)☆21Jun 13, 2023Updated 2 years ago
- Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"☆86Dec 26, 2023Updated 2 years ago
- lightweight LAMA inference wrapper☆26Sep 28, 2023Updated 2 years ago
- Paper Today I Read☆27Jan 27, 2026Updated 2 weeks ago
- Official implementation of Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning☆28Oct 30, 2024Updated last year
- ☆27Nov 29, 2023Updated 2 years ago
- Implementation of layer diffuse inference using refiners☆25Apr 25, 2024Updated last year
- HOSNeRF: Dynamic Human-Object-Scene Neural Radiance Fields from a Single Video☆68Dec 12, 2023Updated 2 years ago
- Weakly opinionated library for implementing ML models. Less boilerplate, More rigor☆21Jul 1, 2022Updated 3 years ago
- Plan, Posture and Go: Towards Open-World Text-to-Motion Generation☆42Nov 19, 2024Updated last year
- ☆13Jul 25, 2023Updated 2 years ago
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆51Oct 14, 2024Updated last year
- Codebase for the paper HawkI: HawkI: Homography & Mutual Information Guidance for 3D-free Single Image to Aerial View☆13Jun 5, 2024Updated last year
- Implementation for "Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffu…☆13Sep 8, 2023Updated 2 years ago
- Huggingface Backup - Jupyter, Colab and Python Script☆10Jan 20, 2026Updated 3 weeks ago
- ☆18Nov 20, 2024Updated last year
- 【CVPR 2025】SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting☆16Jul 1, 2025Updated 7 months ago
- Official repository for Fourier model that can generate periodic signals☆10Mar 10, 2022Updated 3 years ago
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆455Sep 28, 2025Updated 4 months ago
- Unofficial implementation of Layer Diffuse in diffusers☆27Apr 3, 2024Updated last year
- Understand Human Behavior to Align True Needs☆25Jul 11, 2024Updated last year
- VimTS: A Unified Video and Image Text Spotter☆78Nov 10, 2024Updated last year
- MLLM-DataEngine: An Iterative Refinement Approach for MLLM☆48May 24, 2024Updated last year
- [CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》☆151Jun 7, 2023Updated 2 years ago
- [ICLR 2025] HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models☆353Mar 14, 2024Updated last year
- 🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"☆109Nov 24, 2025Updated 2 months ago
- Navigate dreamscapes with a click – your chosen point guides the drone’s flight in a thrilling visual journey.☆48Sep 2, 2025Updated 5 months ago