epfl-dlab / homepage2vec
Language-Agnostic Website Embedding and Classification
☆43Updated last year
Alternatives and similar repositories for homepage2vec:
Users that are interested in homepage2vec are comparing it to the libraries listed below
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.☆52Updated last year
- The NewSHead dataset is a multi-doc headline dataset used in NHNet for training a headline summarization model.☆37Updated 3 years ago
- Code for Relevance-guided Supervision for OpenQA with ColBERT (TACL'21)☆41Updated 3 years ago
- The CleanCoNLL dataset from our EMNLP 2023 paper where we corrected annotation errors and inconsistencies in CoNLL-03.☆24Updated 9 months ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆156Updated 2 years ago
- The unified platform for data-related resources.☆135Updated 2 years ago
- Official implementation of the paper "IteraTeR: Understanding Iterative Revision from Human-Written Text" (ACL 2022)☆78Updated last year
- ☆34Updated 6 months ago
- FrugalScore is an approach to learn a fixed, low cost version of any expensive NLG metric, while retaining most of its original performan…☆15Updated 2 years ago
- Datasets used for iSarcasmEval shared-task (Task 6 at SemEval 2022)☆24Updated 2 years ago
- ☆87Updated 3 years ago
- GLUCOSE: GeneraLized and COntextualized Story Explanations https://arxiv.org/abs/2009.07758☆92Updated 4 years ago
- StAtutory Reasoning Assessment☆13Updated 2 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 3 years ago
- SciWING is a modern toolkit for scientific document processing from WING-NUS☆63Updated last year
- Dataset and code for directed sentiment analysis in news text.☆16Updated 3 years ago
- MultiCite code and data. Models are available on Huggingface.☆31Updated 2 years ago
- ☆75Updated 3 years ago
- 💫 SpaCy wrapper for ConceptNet 💫☆92Updated last year
- Learned string similarity for entity names using optimal transport.☆35Updated 4 years ago
- ☆54Updated 3 years ago
- Repository for the paper "Named Entity Recognition for Entity Linking: What Works and What's Next" (EMNLP 2021).☆75Updated 3 years ago
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.☆19Updated 2 years ago
- The dataset and code for ACL 2022 paper "SciNLI: A Corpus for Natural Language Inference on Scientific Text" are released here.☆27Updated last year
- A template for starting a new allennlp project using config files and `allennlp train`☆38Updated last year
- MediaSum: A Large-scale Media Interview Dataset for Dialogue Summarization☆72Updated 3 years ago
- ☆18Updated 2 years ago
- A Dataset for Direct Quotation Extraction and Attribution in News Articles.☆13Updated 3 years ago
- ☆58Updated 2 years ago
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.☆44Updated 11 months ago