kevalmorabia97 / CoVA-Web-Object-DetectionLinks
A Context-aware Visual Attention-based training pipeline for Object Detection from a Webpage screenshot!
☆93Updated 9 months ago
Alternatives and similar repositories for CoVA-Web-Object-Detection
Users that are interested in CoVA-Web-Object-Detection are comparing it to the libraries listed below
Sorting:
- Simplified DOM Trees for Transferable Attribute Extraction from the Web☆40Updated last year
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆179Updated 2 years ago
- ☆250Updated 2 years ago
- Index of URLs to pdf files all over the internet and scripts☆25Updated 2 years ago
- It includes two datasets that are used in the downstream tasks for evaluating UIBert: App Similar Element Retrieval data and Visual Item …☆46Updated 4 years ago
- multimodal document analysis☆166Updated last month
- Completion After Prompt Probability. Make your LLM make a choice☆82Updated last year
- We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datas…☆80Updated 2 years ago
- H&M Fashion Image similarity search with Weaviate and DocArray☆43Updated last year
- ☆58Updated 4 years ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated last year
- Unofficial Pytorch implementation of Dom-LM paper.☆33Updated 2 years ago
- [NAACL 2022] TIE: Topological Information Enhanced Structural Reading Comprehension on Web Pages☆22Updated 3 years ago
- Build Semantic Search with S-BERT and Fine-tune your model in unsupervised way☆59Updated 3 years ago
- My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"☆74Updated last month
- A dataset featuring diverse dialogues between two ChatGPT (gpt-3.5-turbo) instances with system messages written by GPT-4. Covering vario…☆164Updated 2 years ago
- Input text or image, get back matching image fashion results, using Jina, DocArray, and CLIP☆49Updated 3 years ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆98Updated 2 years ago
- Chart-to-Text: Generating Natural Language Explanations for Charts by Adapting the Transformer Model☆158Updated 2 years ago
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆115Updated last year
- Label data using HuggingFace's transformers and automatically get a prediction service☆193Updated 2 years ago
- Comprehensive NLP Evaluation System☆189Updated last year
- ☆32Updated last year
- A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one pac…☆297Updated 6 months ago
- The scripts for training Detectron2-based Layout Models on popular layout analysis datasets☆216Updated 2 years ago
- The largest multilingual image-text classification dataset. It contains fashion products.☆75Updated 2 years ago
- Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…☆286Updated 2 years ago
- Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 q…☆89Updated last year
- ☆125Updated 2 years ago
- Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine☆244Updated 2 years ago