kevalmorabia97 / CoVA-Web-Object-DetectionLinks
A Context-aware Visual Attention-based training pipeline for Object Detection from a Webpage screenshot!
☆93Updated 10 months ago
Alternatives and similar repositories for CoVA-Web-Object-Detection
Users that are interested in CoVA-Web-Object-Detection are comparing it to the libraries listed below
Sorting:
- It includes two datasets that are used in the downstream tasks for evaluating UIBert: App Similar Element Retrieval data and Visual Item …☆46Updated 4 years ago
- Simplified DOM Trees for Transferable Attribute Extraction from the Web☆40Updated last year
- [NAACL 2022] TIE: Topological Information Enhanced Structural Reading Comprehension on Web Pages☆22Updated 3 years ago
- multimodal document analysis☆166Updated last month
- Index of URLs to pdf files all over the internet and scripts☆25Updated 2 years ago
- Completion After Prompt Probability. Make your LLM make a choice☆82Updated last year
- ☆249Updated 2 years ago
- A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs☆115Updated 2 years ago
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆179Updated 2 years ago
- Chart-to-Text: Generating Natural Language Explanations for Charts by Adapting the Transformer Model☆158Updated 2 years ago
- We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datas…☆80Updated 2 years ago
- ☆672Updated 6 months ago
- Simply, faster, sentence-transformers☆143Updated last year
- DocLLM: A layout-aware generative language model for multimodal document understanding☆131Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆111Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated last year
- Semantic search with embeddings: index anything☆140Updated 3 years ago
- A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one pac…☆297Updated 7 months ago
- Comprehensive NLP Evaluation System☆189Updated last year
- Unofficial Pytorch implementation of Dom-LM paper.☆33Updated 2 years ago
- The largest multilingual image-text classification dataset. It contains fashion products.☆75Updated 2 years ago
- My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"☆74Updated last month
- Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine☆244Updated 2 years ago
- This is the repository for our paper "INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning"☆207Updated last year
- SIGIR-2022 Webformer: Pre-training with Web Pages for Information Retrieval☆50Updated 3 years ago
- A dataset featuring diverse dialogues between two ChatGPT (gpt-3.5-turbo) instances with system messages written by GPT-4. Covering vario…☆164Updated 2 years ago
- Article extraction benchmark: dataset and evaluation scripts☆341Updated 3 months ago
- ☆125Updated 2 years ago
- No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval☆29Updated 3 years ago
- The Screen Annotation dataset consists of pairs of mobile screenshots and their annotations. The annotations are in text format, and desc…☆80Updated last year