☆34Jun 22, 2023Updated 2 years ago
Alternatives and similar repositories for disentangling_spelling_in_clip
Users that are interested in disentangling_spelling_in_clip are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official code for PLoP☆19Mar 6, 2026Updated last month
- CogView2 for GPUs with 12/16/24GB vRAM☆16Jun 24, 2022Updated 3 years ago
- Pytorch Implementation for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pr…☆28Jul 31, 2022Updated 3 years ago
- ☆30Nov 25, 2021Updated 4 years ago
- Follow-Up Differential Descriptions: Language Models Resolve Ambiguities for Image Classification☆11Nov 15, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-L…☆37Apr 13, 2022Updated 4 years ago
- Text-writing denoising diffusion (and much more)☆30May 14, 2023Updated 2 years ago
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings☆13May 22, 2025Updated 10 months ago
- Official implementation of ICML 2025 paper "Understanding Multimodal LLMs Under Distribution Shifts: An Information-Theoretic Approach"☆12May 27, 2025Updated 10 months ago
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Nov 11, 2022Updated 3 years ago
- utilities to facilitate working with codebases that don't ascribe to normal package management paradigms, e.g. ML research code that can …☆13Nov 26, 2022Updated 3 years ago
- A tool for benchmarking image generation models.☆33Jan 13, 2023Updated 3 years ago
- [NeurIPS'23] Parts of Speech–Grounded Subspaces in Vision-Language Models☆29Feb 25, 2024Updated 2 years ago
- ☆21Nov 7, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official implementation of Generative Colorization of Structured Mobile Web Pages, WACV 2023.☆22Dec 7, 2023Updated 2 years ago
- ☆14May 26, 2023Updated 2 years ago
- Official repository for the ICCV 2023 paper: "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts…☆61Jul 8, 2023Updated 2 years ago
- ☆14Jul 30, 2022Updated 3 years ago
- Python Reader for the Ultrasound File Format☆13Aug 7, 2023Updated 2 years ago
- codebase for the SIMAT dataset and evaluation☆38Feb 16, 2022Updated 4 years ago
- PyTorch implementation of Contrastive Feature Loss for Image Prediction (AIM Workshop at ICCV 2021)☆55Nov 19, 2021Updated 4 years ago
- Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"☆29Apr 27, 2024Updated last year
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆83Jan 30, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Repository for Fantastic Style Channels and Where to Find Them: A Submodular Framework for Discovering Diverse Directions in GANs☆28Mar 17, 2022Updated 4 years ago
- ☆14Aug 30, 2022Updated 3 years ago
- Unlocking the Essence of Beauty: Advanced Aesthetic Reasoning with Relative-Absolute Policy Optimization☆24Jan 27, 2026Updated 2 months ago
- This repository will contain code for the paper "CLIP meets GamePhysics: Towards bug identification in gameplay videos using zero-shot tr…☆26Dec 23, 2023Updated 2 years ago
- Fine-tune of Florence-2 for shot categorization.☆26Mar 6, 2025Updated last year
- ☆201May 10, 2023Updated 2 years ago
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆13Sep 30, 2023Updated 2 years ago
- kdexd/coco-caption@de6f385☆26Apr 21, 2020Updated 5 years ago
- A flexible gateway for running ML inference jobs through cloud providers or your own GPU. Powered by Replicate and Cloudflare Workers.☆27Jul 17, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code for Label Propagation for Zero-shot Classification with Vision-Language Models (CVPR2024)☆44Jul 23, 2024Updated last year
- ☆20Jul 12, 2023Updated 2 years ago
- Automatic Cardiac MRI Segmentation via Context Aware Recurrent Generative Adversarial Neural Network☆12Feb 6, 2018Updated 8 years ago
- The official repository of "Encode-in-Style: Latent-based Video Encoding using StyleGAN2"☆47Feb 15, 2023Updated 3 years ago
- ☆23Oct 30, 2023Updated 2 years ago
- Code for the papers: "Stop Throwing Away Discriminators! Re-using Adversaries for Test-Time Training", Valvano et al., DART 2021; and "Re…☆10Jan 20, 2022Updated 4 years ago
- [AAAI 2021] Confidence-aware Non-repetitive Multimodal Transformers for TextCaps☆24Mar 29, 2023Updated 3 years ago