fidelity / sim2real-docs
Synthesize image datasets of documents in natural scenes with Python+Blender3D
☆58Updated 2 years ago
Alternatives and similar repositories for sim2real-docs:
Users that are interested in sim2real-docs are comparing it to the libraries listed below
- An open source implementation of CLIP.☆32Updated 2 years ago
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆59Updated 4 years ago
- ☆27Updated 4 years ago
- Official PyTorch Implementation of DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis - ICDAR 2021☆77Updated 3 years ago
- Explore the image embeddings of Unsplash using CLIP's image similarity☆50Updated 4 years ago
- A non-JIT version implementation / replication of CLIP of OpenAI in pytorch☆34Updated 4 years ago
- Official PyTorch implementation of RIO☆18Updated 3 years ago
- Load any clip model with a standardized interface☆21Updated 11 months ago
- Another attempt at a long-context / efficient transformer by me☆37Updated 2 years ago
- Index of URLs to pdf files all over the internet and scripts☆23Updated last year
- A criticism of a recent paper on buggy image downsampling methods in popular image processing and deep learning libraries.☆72Updated 2 years ago
- Minimal implementation of adaptive gradient clipping (https://arxiv.org/abs/2102.06171) in TensorFlow 2.☆85Updated 3 years ago
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆89Updated 3 years ago
- ☆17Updated 5 years ago
- A Unet based deeplearning model to line/box/spurious artifacts from text images. Unsupervised training.☆58Updated 5 years ago
- Little article showing how to load pytorch's models with linear memory consumption☆34Updated 2 years ago
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Updated 3 years ago
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆78Updated 2 years ago
- Code for CVPR21 paper A Multiplexed Network for End-to-End, Multilingual OCR☆80Updated 2 years ago
- CLOOB training (JAX) and inference (JAX and PyTorch)☆71Updated 2 years ago
- Comparing PyTorch, JIT and ONNX for inference with Transformers☆17Updated 4 years ago
- Optimized library for large-scale extraction of frames and audio from video.☆203Updated last year
- Script and models for clustering LAION-400m CLIP embeddings.☆25Updated 3 years ago
- ☆12Updated 3 years ago
- ☆22Updated 3 years ago
- ☆26Updated 3 years ago
- ☆24Updated 3 years ago
- Official repository for the paper "End-to-End Visual Editing with a Generatively Pre-Trained Artist", which is accepted at ECCV 2022. Her…☆29Updated 2 years ago
- A dashboard for exploring timm learning rate schedulers☆19Updated 4 months ago
- A GPT, made only of MLPs, in Jax☆57Updated 3 years ago