epfl-dlab / homepage2vec
Language-Agnostic Website Embedding and Classification
☆40Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for homepage2vec
- Repository for the paper "Named Entity Recognition for Entity Linking: What Works and What's Next" (EMNLP 2021).☆75Updated 2 years ago
- MultiCite code and data. Models are available on Huggingface.☆29Updated 2 years ago
- Python tools for interacting with Wikidata☆141Updated last year
- Reimplementation of a BERT based model (Shi et al, 2019), currently the state-of-the-art for English SRL. This model implements also pred…☆69Updated 2 years ago
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.☆43Updated 6 months ago
- A spaCy wrapper for DBpedia Spotlight☆105Updated last year
- Entity Disambiguation as text extraction (ACL 2022)☆177Updated 2 years ago
- Repro is a library for easily running code from published papers via Docker.☆40Updated last year
- Sentence embeddings for unsupervised event detection in the Twitter stream: study on English and French corpora☆31Updated last month
- The CleanCoNLL dataset from our EMNLP 2023 paper where we corrected annotation errors and inconsistencies in CoNLL-03.☆19Updated 4 months ago
- CausalNLP is a practical toolkit for causal inference with text as treatment, outcome, or "controlled-for" variable.☆140Updated 5 months ago
- Research framework for low resource text classification that allows the user to experiment with classification models and active learning…☆97Updated 2 years ago
- A Python Commonsense Knowledge Inference Toolkit☆63Updated 11 months ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆192Updated last year
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆153Updated 2 years ago
- Mapping Wikipedia pages to Wikidata IDs and vice versa.☆153Updated last year
- Collection of NLP model explanations and accompanying analysis tools☆145Updated last year
- The data and the PyTorch implementation for the models and experiments in the paper "Exploiting Asymmetry for Synthetic Training Data Gen…☆58Updated last year
- The NewSHead dataset is a multi-doc headline dataset used in NHNet for training a headline summarization model.☆36Updated 2 years ago
- An easy to use framework for large-scale fact-checking and question answering☆69Updated last year
- This is a simple Python package for calculating a variety of lexical diversity indices☆65Updated last year
- CoCo-Ex extracts meaningful concepts from natural language texts and maps them to conjunct concept nodes in ConceptNet, utilizing the max…☆58Updated last year
- The AI Knowledge Editor☆182Updated 2 years ago
- Learned string similarity for entity names using optimal transport.☆34Updated 4 years ago
- Testing and training detection models for emoji-based hate speech.☆23Updated 2 years ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆153Updated 2 years ago
- 💫 SpaCy wrapper for ConceptNet 💫☆88Updated last year
- This is a repository of the study performed under the Adversarial Paraphrasing Task (APT).☆21Updated 3 years ago
- Unsupervised method for extracting quotation-speaker pairs from large news corpora.☆28Updated 6 years ago
- A simple library for training named entity recognition model from partially annotated data☆21Updated last year