webis-de / cikm20-web-page-segmentation-revisited-evaluation-framework-and-dataset
Code for "Web Page Segmentation Revisited: Evaluation Framework and Dataset", accepted as resources paper to CIKM 2020
☆14Updated 2 years ago
Alternatives and similar repositories for cikm20-web-page-segmentation-revisited-evaluation-framework-and-dataset
Users that are interested in cikm20-web-page-segmentation-revisited-evaluation-framework-and-dataset are comparing it to the libraries listed below
Sorting:
- ☆26Updated 9 months ago
- Simplified DOM Trees for Transferable Attribute Extraction from the Web☆38Updated 7 months ago
- Implementation of Microsoft Vips algorithm in Python☆18Updated 5 years ago
- Implementation of Vision Based Page Segmentation algorithm in Java☆101Updated 5 years ago
- The dataset includes UI object type labels (e.g., BUTTON, IMAGE, CHECKBOX) that describes the semantic type of an UI object on Android ap…☆52Updated 3 years ago
- It includes two datasets that are used in the downstream tasks for evaluating UIBert: App Similar Element Retrieval data and Visual Item …☆42Updated 3 years ago
- Object Detection for Graphical User Interface: Old Fashioned or Deep Learning or a Combination?☆127Updated last year
- Training/test data for Dragnet☆41Updated 10 years ago
- Web page segmentation and noise removal☆55Updated last year
- Source code for the paper "Web2Text: Deep Structured Boilerplate Removal", full paper @ ECIR'18☆169Updated 3 years ago
- Semantic Code Search☆35Updated 2 years ago
- Web content extraction using machine learning☆33Updated 4 years ago
- code and data used to build a training dataset for dragnet models☆10Updated 4 years ago
- A Context-aware Visual Attention-based training pipeline for Object Detection from a Webpage screenshot!☆93Updated 2 months ago
- SIGIR-2022 Webformer: Pre-training with Web Pages for Information Retrieval☆47Updated 2 years ago
- Large-scale query-focused multi-document Summarization dataset☆10Updated 3 years ago
- Recognize graphic user interface layout through grouping GUI elements according to their visual attributes☆41Updated 2 years ago
- simple rule based named entity recognition☆43Updated 3 years ago
- Boilerplate Removal using Deep Learning☆82Updated 3 years ago
- Extraction code used to create the Dresden Web Table Corpus☆14Updated 10 years ago
- [EACL 2024] ICE-Score: Instructing Large Language Models to Evaluate Code☆75Updated 11 months ago
- Text similarity using BERT sentence embeddings☆20Updated 5 years ago
- Indri search implementation on top of Lucene search engine☆34Updated last year
- A extension of Transformers library to include T5ForSequenceClassification class.☆38Updated 2 years ago
- Code for Relevance-guided Supervision for OpenQA with ColBERT (TACL'21)☆41Updated 3 years ago
- This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.☆20Updated last year
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.☆44Updated last year
- Unofficial Pytorch implementation of Dom-LM paper.☆33Updated 2 years ago
- MultiCite code and data. Models are available on Huggingface.☆31Updated 3 years ago
- The dataset includes widget captions that describes UI element's functionalities. It is used for training and evaluation of the widget ca…☆21Updated 3 years ago