kevalmorabia97 / CoVA-Web-Object-DetectionLinks
A Context-aware Visual Attention-based training pipeline for Object Detection from a Webpage screenshot!
☆93Updated 4 months ago
Alternatives and similar repositories for CoVA-Web-Object-Detection
Users that are interested in CoVA-Web-Object-Detection are comparing it to the libraries listed below
Sorting:
- Simplified DOM Trees for Transferable Attribute Extraction from the Web☆38Updated 9 months ago
- Semantic search with embeddings: index anything☆139Updated 3 years ago
- ☆247Updated 2 years ago
- [NAACL 2022] TIE: Topological Information Enhanced Structural Reading Comprehension on Web Pages☆20Updated 3 years ago
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆179Updated 2 years ago
- ☆652Updated last month
- We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datas…☆80Updated 2 years ago
- It includes two datasets that are used in the downstream tasks for evaluating UIBert: App Similar Element Retrieval data and Visual Item …☆44Updated 3 years ago
- Completion After Prompt Probability. Make your LLM make a choice☆79Updated 8 months ago
- Simply, faster, sentence-transformers☆143Updated 10 months ago
- ☆32Updated last year
- Object Detection for Graphical User Interface: Old Fashioned or Deep Learning or a Combination?☆127Updated last year
- H&M Fashion Image similarity search with Weaviate and DocArray☆43Updated last year
- Input text or image, get back matching image fashion results, using Jina, DocArray, and CLIP☆50Updated 2 years ago
- The largest multilingual image-text classification dataset. It contains fashion products.☆72Updated 2 years ago
- Index of URLs to pdf files all over the internet and scripts☆24Updated 2 years ago
- multimodal document analysis☆166Updated last year
- A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs☆114Updated 2 years ago
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆106Updated 10 months ago
- A python utility for downloading Common Crawl data☆242Updated 2 years ago
- Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…☆281Updated 2 years ago
- DocLLM: A layout-aware generative language model for multimodal document understanding☆126Updated last year
- My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"☆72Updated 3 months ago
- Article extraction benchmark: dataset and evaluation scripts☆318Updated last year
- Label data using HuggingFace's transformers and automatically get a prediction service☆190Updated 2 years ago
- RaKUn 2.0 - A fast keyword detection algorithm☆67Updated 2 months ago
- ☆58Updated 3 years ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆108Updated last year
- Build Semantic Search with S-BERT and Fine-tune your model in unsupervised way☆58Updated 3 years ago
- Huggingface inference with GPU Docker on AWS☆41Updated 3 years ago