webis-de / ecir21-an-empirical-comparison-of-web-page-segmentation-algorithms
☆26Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for ecir21-an-empirical-comparison-of-web-page-segmentation-algorithms
- Code for "Web Page Segmentation Revisited: Evaluation Framework and Dataset", accepted as resources paper to CIKM 2020☆14Updated last year
- ☆56Updated 3 months ago
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.☆43Updated 6 months ago
- Maximum entropy named-entity recognition (NER)☆13Updated last year
- [NAACL 2022] TIE: Topological Information Enhanced Structural Reading Comprehension on Web Pages☆19Updated 2 years ago
- A Context-aware Visual Attention-based training pipeline for Object Detection from a Webpage screenshot!☆91Updated last year
- A Neural Model for Joint Topic Segmentation and Classification☆34Updated 4 years ago
- Simplified DOM Trees for Transferable Attribute Extraction from the Web☆37Updated last month
- Web content extraction using machine learning☆32Updated 3 years ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆43Updated 3 months ago
- Kex is a python library for unsupervised keyword extraction from a document, providing an easy interface and benchmarks on 15 public data…☆53Updated 2 years ago
- init☆12Updated 3 years ago
- ☆34Updated 3 years ago
- ☆51Updated 3 years ago
- ☆85Updated 2 years ago
- FAMIE: A Fast Active Learning Framework for Multilingual Information Extraction☆23Updated 2 years ago
- The official repository for Efficient Long-Text Understanding Using Short-Text Models (Ivgi et al., 2022) paper☆67Updated last year
- source code of bison☆26Updated 4 years ago
- An easy to use framework for large-scale fact-checking and question answering☆69Updated last year
- ☆37Updated last week
- Resources for the "CTRLsum: Towards Generic Controllable Text Summarization" paper☆146Updated last year
- Extract templated Open Information Extraction☆13Updated 7 years ago
- Sequence tagger based on BERT☆21Updated 2 years ago
- MTab: Entity Search and Table Annotation with Wikidata, Wikipedia, and DBpedia☆30Updated 2 years ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆96Updated last year
- Data and Code for Paper "Reflect Not Reflex: Inference-Based Common Ground Improves Dialogue Response Quality" (EMNLP 2022)☆11Updated last year
- ☆101Updated 3 years ago
- CHOLAN: A Modular Approach for Neural Entity Linking on Wikipedia and Wikidata☆33Updated 2 years ago
- ☆16Updated 4 years ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆153Updated 2 years ago