epfl-dlab / homepage2vec
Language-Agnostic Website Embedding and Classification
☆41Updated 11 months ago
Alternatives and similar repositories for homepage2vec:
Users that are interested in homepage2vec are comparing it to the libraries listed below
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.☆50Updated last year
- A spaCy wrapper for DBpedia Spotlight☆107Updated last year
- The NewSHead dataset is a multi-doc headline dataset used in NHNet for training a headline summarization model.☆36Updated 3 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 3 years ago
- An easy to use framework for large-scale fact-checking and question answering☆69Updated last year
- MultiCite code and data. Models are available on Huggingface.☆29Updated 2 years ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆96Updated last year
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated 9 months ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆65Updated 2 years ago
- Unsupervised method for extracting quotation-speaker pairs from large news corpora.☆29Updated 6 years ago
- Sentence transformers models for SpaCy☆107Updated last year
- SciWING is a modern toolkit for scientific document processing from WING-NUS☆62Updated last year
- ☆74Updated 3 years ago
- ☆52Updated 10 months ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆86Updated 2 years ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆93Updated last year
- ConfliBERT: A Pre-trained Language Model for Political Conflict and Violence (NAACL 2022)☆25Updated this week
- ☘️ Code for Convex Aggregation for Opinion Summarization (Iso et al; Findings of EMNLP 2021)☆34Updated 2 years ago
- The project proposes a framework to apply topic models on a text-corpus and eventually topic labels on the generated topics.☆35Updated 8 months ago
- ☆29Updated 3 months ago
- ☆85Updated 3 years ago
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆68Updated 3 years ago
- A Super-Lightweight Annotation Tool for Experts: Label text in a terminal with just Python☆101Updated 3 weeks ago
- MTab: Entity Search and Table Annotation with Wikidata, Wikipedia, and DBpedia☆30Updated 2 years ago
- ☆17Updated 2 years ago
- Repro is a library for easily running code from published papers via Docker.☆40Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated 10 months ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆117Updated 9 months ago
- Implementation, trained models and result data for the paper "Pairwise Multi-Class Document Classification for Semantic Relations between…☆32Updated last year
- Information and data related to the ProtestNews shared task at CASE @ ACL-IJCNLP 2021 workshop☆43Updated 2 years ago