joanrod / ocr-vqgan
View external linksLinks

OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Perceptual loss for clear text-within-image generation. Fork from VQGAN in CompVis/taming-transformers
83Jan 30, 2023Updated 3 years ago

Alternatives and similar repositories for ocr-vqgan

Users that are interested in ocr-vqgan are comparing it to the libraries listed below

Sorting:

Are these results useful?