joaanna/disentangling_spelling_in_clip

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/joaanna/disentangling_spelling_in_clip)

joaanna / disentangling_spelling_in_clip

☆36

Alternatives and similar repositories for disentangling_spelling_in_clip

Users that are interested in disentangling_spelling_in_clip are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dzryk / cliptalk
View on GitHub
☆19Aug 19, 2021Updated 4 years ago
lkwq007 / CogView2-low-vram
View on GitHub
CogView2 for GPUs with 12/16/24GB vRAM
☆16Jun 24, 2022Updated 4 years ago
dzryk / clip-grams
View on GitHub
☆30Nov 25, 2021Updated 4 years ago
zipengxuc / PPE
View on GitHub
Code for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-L…
☆37Apr 13, 2022Updated 4 years ago
BatsResearch / fudd
View on GitHub
Follow-Up Differential Descriptions: Language Models Resolve Ambiguities for Image Classification
☆11Nov 15, 2023Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
facebookresearch / task_bench
View on GitHub
The TaskBench500 dataset and code for generating tasks.
☆16Jul 16, 2022Updated 4 years ago
nostalgebraist / improved-diffusion
View on GitHub
Text-writing denoising diffusion (and much more)
☆30May 14, 2023Updated 3 years ago
hotchpotch / yasem
View on GitHub
YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings
☆13May 22, 2025Updated last year
onealwj / MVLT
View on GitHub
PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition
☆28Nov 11, 2022Updated 3 years ago
james-oldfield / PoS-subspaces
View on GitHub
[NeurIPS'23] Parts of Speech–Grounded Subspaces in Vision-Language Models
☆29Feb 25, 2024Updated 2 years ago
ayaka14732 / basehangul-online
View on GitHub
Online BaseHangul Encoder And Decoder
☆13Jan 30, 2023Updated 3 years ago
nousr / dream-bench
View on GitHub
A tool for benchmarking image generation models.
☆32Jan 13, 2023Updated 3 years ago
CyberAgentAILab / webcolor
View on GitHub
Official implementation of Generative Colorization of Structured Mobile Web Pages, WACV 2023.
☆22Dec 7, 2023Updated 2 years ago
zhangce01 / DualAdapter
View on GitHub
Code for Negative Yields Positive: Unified Dual-Path Adapter for Vision-Language Models
☆25Oct 29, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ml-jku / cloob
View on GitHub
☆161Jun 13, 2022Updated 4 years ago
EleutherAI / magiCARP
View on GitHub
One stop shop for all things carp
☆58Sep 9, 2022Updated 3 years ago
Xiaomeng-Yang / STR_benchmark_cleansed
View on GitHub
☆14May 26, 2023Updated 3 years ago
KaliYuga-ai / Lithography-Diffusion
View on GitHub
☆14Jul 30, 2022Updated 3 years ago
naver-ai / hmix-gmix
View on GitHub
☆21Nov 7, 2022Updated 3 years ago
amazon-science / textadain-robust-recognition
View on GitHub
TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers
☆21Jul 26, 2022Updated 4 years ago
Holmes-Alan / SR-VAE
View on GitHub
SR-VAE
☆10Jul 26, 2021Updated 5 years ago
UCSB-AI / Discffusion
View on GitHub
Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"
☆29Apr 27, 2024Updated 2 years ago
NightmareAI / cogflare
View on GitHub
A flexible gateway for running ML inference jobs through cloud providers or your own GPU. Powered by Replicate and Cloudflare Workers.
☆27Jul 17, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
joanrod / ocr-vqgan
View on GitHub
OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…
☆85Jan 30, 2023Updated 3 years ago
facebookresearch / SIMAT
View on GitHub
codebase for the SIMAT dataset and evaluation
☆39Feb 16, 2022Updated 4 years ago
alexandonian / contrastive-feature-loss
View on GitHub
PyTorch implementation of Contrastive Feature Loss for Image Prediction (AIM Workshop at ICCV 2021)
☆55Nov 19, 2021Updated 4 years ago
catlab-team / fantasticstyles
View on GitHub
Repository for Fantastic Style Channels and Where to Find Them: A Submodular Framework for Discovering Diverse Directions in GANs
☆28Mar 17, 2022Updated 4 years ago
adobe-research / llava-score
View on GitHub
☆11Oct 2, 2024Updated last year
Miruzuki / Stable-Diffusion-Prompt-Dictionary
View on GitHub
☆14Aug 30, 2022Updated 3 years ago
asgaardlab / CLIPxGamePhysics
View on GitHub
This repository will contain code for the paper "CLIP meets GamePhysics: Towards bug identification in gameplay videos using zero-shot tr…
☆25Dec 23, 2023Updated 2 years ago
crowsonkb / dice-mc
View on GitHub
DiCE: The Infinitely Differentiable Monte-Carlo Estimator
☆33Jul 28, 2023Updated 3 years ago
huggingface / movie-shot-categorizer
View on GitHub
Fine-tune of Florence-2 for shot categorization.
☆26Mar 6, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
cat-state / tinypar
View on GitHub
☆20Jul 12, 2023Updated 3 years ago
sarahpratt / CuPL
View on GitHub
☆203May 10, 2023Updated 3 years ago
LuoweiZhou / coco-caption
View on GitHub
kdexd/coco-caption@de6f385
☆26Apr 21, 2020Updated 6 years ago
Data-Intelligence-Lab / DEFT-korean-alpaca
View on GitHub
☆23Oct 30, 2023Updated 2 years ago
wzk1015 / CNMT
View on GitHub
[AAAI 2021] Confidence-aware Non-repetitive Multimodal Transformers for TextCaps
☆24Mar 29, 2023Updated 3 years ago
trevineoorloff / ExpressiveFaceVideoEncoding
View on GitHub
The official repository of "Encode-in-Style: Latent-based Video Encoding using StyleGAN2"
☆47Feb 15, 2023Updated 3 years ago
facebookresearch / SLIP
View on GitHub
Code release for SLIP Self-supervision meets Language-Image Pre-training
☆791Feb 9, 2023Updated 3 years ago