Let there be clock in the beach - WACV 2022
☆15Nov 15, 2021Updated 4 years ago
Alternatives and similar repositories for object-bias
Users that are interested in object-bias are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching☆16Dec 10, 2021Updated 4 years ago
- Hadwritten Text Recognition in Few-shot Scenario☆22Mar 25, 2023Updated 3 years ago
- Optocal Character Recognition (OCR / HTR) using Transformers☆11Aug 20, 2022Updated 3 years ago
- ICDAR 2019☆25Aug 2, 2019Updated 6 years ago
- [ICDAR 2024] (Best Student Paper🏆) Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation☆15Sep 6, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Based on the WACV 2020 paper - Fine Grained Classification and Retrieval by Combining Visual and Locally Pooled Textual Features☆25Nov 15, 2021Updated 4 years ago
- [WACV 2026 Round 1] Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout☆23Oct 11, 2025Updated 6 months ago
- ☆16Jun 14, 2024Updated last year
- Implementation on pytorch of the code from the ECCV 2018 paper - Single Shot Scene Text Retrieval☆13Dec 15, 2021Updated 4 years ago
- OCR Annotations from Amazon Textract for Industry Documents Library☆103Aug 20, 2022Updated 3 years ago
- STVQA and TextVQA OCR results from Amazon Text in Image pipeline☆12Jul 18, 2022Updated 3 years ago
- Official PyTorch Implementation of DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis - ICDAR 2021☆92Jul 16, 2021Updated 4 years ago
- Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval☆65Dec 1, 2022Updated 3 years ago
- This repo contains the code of "Contrastive Supervised Distillation for Continual Representation Learning", Tommaso Barletti, Niccolò Bio…☆20Jul 5, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆32May 28, 2025Updated 10 months ago
- ☆12Mar 8, 2021Updated 5 years ago
- Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.☆137Oct 18, 2025Updated 5 months ago
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆30Jul 12, 2023Updated 2 years ago
- Document Image Enhancement with GANs - TPAMI journal☆218Mar 24, 2023Updated 3 years ago
- Codebase for the paper HawkI: HawkI: Homography & Mutual Information Guidance for 3D-free Single Image to Aerial View☆13Jun 5, 2024Updated last year
- [ECCV'24] NamedCurves: Learned Image Enhancement via Color Naming☆33Sep 8, 2025Updated 7 months ago
- Look and Modify: Modification Networks for Image Captioning, BMVC 2019☆21Feb 18, 2020Updated 6 years ago
- ☆38Feb 4, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- CLIP-based simple image-text matching baseline for COCO and F30K☆14Sep 16, 2021Updated 4 years ago
- DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022☆186Jan 17, 2025Updated last year
- Code for Negative Yields Positive: Unified Dual-Path Adapter for Vision-Language Models☆26Oct 29, 2024Updated last year
- [ICLR 2025] Code for Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models☆25Apr 14, 2025Updated last year
- This repository contains the source code, models and data files for the work titled: "Unsupervised Image Style Embeddings for Retrieval a…☆13May 29, 2021Updated 4 years ago
- [AAAI 2021] Confidence-aware Non-repetitive Multimodal Transformers for TextCaps☆24Mar 29, 2023Updated 3 years ago
- ☆11Jan 25, 2019Updated 7 years ago
- Diverse Image Captioning with Context-Object Split Latent Spaces (NeurIPS 2020)☆37May 16, 2022Updated 3 years ago
- Ranking-Consistent Language-Image Pretraining☆12Oct 24, 2025Updated 5 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel multimodal architecture for Scene Text Visual Question Answer…☆56Oct 30, 2024Updated last year
- ☆11Dec 18, 2024Updated last year
- ☆24Dec 22, 2016Updated 9 years ago
- Fill the boring catsalud covid vaccine form with a console command☆16Jul 17, 2021Updated 4 years ago
- Implementation of our IJCAI2022 oral paper, ER-SAN: Enhanced-Adaptive Relation Self-Attention Network for Image Captioning.☆24Aug 5, 2023Updated 2 years ago
- Code for the Human-related Object Detection based on Natural Language Parsing of Image Query Expressions article☆13Aug 8, 2017Updated 8 years ago
- Watermarking LLM papers up-to-date☆11Dec 17, 2023Updated 2 years ago