fidelity / sim2real-docsLinks
Synthesize image datasets of documents in natural scenes with Python+Blender3D
☆59Updated 3 years ago
Alternatives and similar repositories for sim2real-docs
Users that are interested in sim2real-docs are comparing it to the libraries listed below
Sorting:
- An open source implementation of CLIP.☆33Updated 3 years ago
- ☆27Updated 4 years ago
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆59Updated 4 years ago
- clip retrieval benchmark☆17Updated 3 years ago
- A non-JIT version implementation / replication of CLIP of OpenAI in pytorch☆34Updated 4 years ago
- Pytorch augmentation☆119Updated last year
- ☆15Updated 3 years ago
- Official PyTorch Implementation of DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis - ICDAR 2021☆89Updated 4 years ago
- ☆15Updated 3 years ago
- Explore the image embeddings of Unsplash using CLIP's image similarity☆48Updated 4 years ago
- A criticism of a recent paper on buggy image downsampling methods in popular image processing and deep learning libraries.☆72Updated 3 years ago
- A Unet based deeplearning model to line/box/spurious artifacts from text images. Unsupervised training.☆59Updated 6 years ago
- Pytorch based library to rank predicted bounding boxes using text/image user's prompts.☆51Updated 3 years ago
- Script and models for clustering LAION-400m CLIP embeddings.☆26Updated 3 years ago
- Load any clip model with a standardized interface☆21Updated last month
- A python library for highly configurable transformers - easing model architecture search and experimentation.☆49Updated 3 years ago
- AdaCat☆49Updated 3 years ago
- ☆24Updated 4 years ago
- code and data for paper "One-shot Text Field Labeling using Attention and BeliefPropagation for Structure Information Extraction"☆61Updated 5 years ago
- A simple wrapper library for binding timm models as detectron2 backbones☆44Updated 2 years ago
- Conversions between kornia and other computer vision libraries formats☆37Updated 2 years ago
- Code for CVPR21 paper A Multiplexed Network for End-to-End, Multilingual OCR☆80Updated 2 years ago
- ☆44Updated 4 years ago
- [NeurIPS 2022] Official PyTorch implementation of Optimizing Relevance Maps of Vision Transformers Improves Robustness. This code allows …☆133Updated 3 years ago
- When Dall E was a baby trained on a bit of data☆26Updated 4 years ago
- (partial) replication of results from https://arxiv.org/abs/1912.07768☆26Updated 5 years ago
- Optimized library for large-scale extraction of frames and audio from video.☆205Updated 2 years ago
- Imitating someone's handwriting by converting it to the temporal domain and back again☆77Updated 2 years ago
- The ORBIT dataset is a collection of videos of objects in clean and cluttered scenes recorded by people who are blind/low-vision on a mob…☆109Updated 5 months ago
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆88Updated 3 years ago