☆34Jun 22, 2023Updated 2 years ago
Alternatives and similar repositories for disentangling_spelling_in_clip
Users that are interested in disentangling_spelling_in_clip are comparing it to the libraries listed below
Sorting:
- CogView2 for GPUs with 12/16/24GB vRAM☆16Jun 24, 2022Updated 3 years ago
- Online BaseHangul Encoder And Decoder☆12Jan 30, 2023Updated 3 years ago
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Nov 11, 2022Updated 3 years ago
- Official implementation of ICML 2025 paper "Understanding Multimodal LLMs Under Distribution Shifts: An Information-Theoretic Approach"☆11May 27, 2025Updated 9 months ago
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings☆13May 22, 2025Updated 9 months ago
- Follow-Up Differential Descriptions: Language Models Resolve Ambiguities for Image Classification☆11Nov 15, 2023Updated 2 years ago
- utilities to facilitate working with codebases that don't ascribe to normal package management paradigms, e.g. ML research code that can …☆13Nov 26, 2022Updated 3 years ago
- Text-writing denoising diffusion (and much more)☆30May 14, 2023Updated 2 years ago
- ☆28Dec 16, 2021Updated 4 years ago
- ☆20Aug 19, 2021Updated 4 years ago
- ☆30Nov 25, 2021Updated 4 years ago
- Pytorch Implementation for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pr…☆28Jul 31, 2022Updated 3 years ago
- A tool for benchmarking image generation models.☆33Jan 13, 2023Updated 3 years ago
- ☆14Jul 30, 2022Updated 3 years ago
- Code implementation for paper "On the Efficacy of Small Self-Supervised Contrastive Models without Distillation Signals".☆17Dec 15, 2021Updated 4 years ago
- ☆14May 26, 2023Updated 2 years ago
- codebase for the SIMAT dataset and evaluation☆38Feb 16, 2022Updated 4 years ago
- TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers☆21Jul 26, 2022Updated 3 years ago
- Code for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-L…☆37Apr 13, 2022Updated 3 years ago
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Jan 16, 2023Updated 3 years ago
- Hopefully, a compact and general-purpose Python package for Multiperturbation Shapley value Analysis (MSA).☆20Jul 14, 2025Updated 7 months ago
- Official implementation of Generative Colorization of Structured Mobile Web Pages, WACV 2023.☆22Dec 7, 2023Updated 2 years ago
- ☆23Oct 30, 2023Updated 2 years ago
- Fine-tune of Florence-2 for shot categorization.☆26Mar 6, 2025Updated last year
- ☆22Dec 8, 2022Updated 3 years ago
- The official repository of "Encode-in-Style: Latent-based Video Encoding using StyleGAN2"☆47Feb 15, 2023Updated 3 years ago
- ☆21Nov 7, 2022Updated 3 years ago
- AdaCat☆49Aug 4, 2022Updated 3 years ago
- Patching open-vocabulary models by interpolating weights☆91Sep 28, 2023Updated 2 years ago
- Official pytorch implementation of I2I translation with low resolution conditioning☆23Sep 2, 2021Updated 4 years ago
- ☆21Mar 15, 2022Updated 3 years ago
- Implémentation du papier Colorization Transformer (ICLR 2021) - Version Expérimentale☆17Feb 24, 2021Updated 5 years ago
- 한국어 노이즈 생성을 위한 라이브러리입니다.☆23May 18, 2023Updated 2 years ago
- [AAAI 2021] Confidence-aware Non-repetitive Multimodal Transformers for TextCaps☆24Mar 29, 2023Updated 2 years ago
- ☆56Sep 9, 2022Updated 3 years ago
- This repository will contain code for the paper "CLIP meets GamePhysics: Towards bug identification in gameplay videos using zero-shot tr…☆26Dec 23, 2023Updated 2 years ago
- ☆24Aug 18, 2023Updated 2 years ago
- A flexible gateway for running ML inference jobs through cloud providers or your own GPU. Powered by Replicate and Cloudflare Workers.☆27Jul 17, 2022Updated 3 years ago
- Official repository for the ICCV 2023 paper: "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts…☆62Jul 8, 2023Updated 2 years ago