NorskRegnesentral / NeuralTextSanitizerLinks
Neural models for detecting and masking personal information from texts
☆16Updated 3 years ago
Alternatives and similar repositories for NeuralTextSanitizer
Users that are interested in NeuralTextSanitizer are comparing it to the libraries listed below
Sorting:
- Fine-grained sentiment annotations of NoReC☆20Updated 3 years ago
- Semantically Structured Sentence Embeddings☆69Updated last year
- ParaNames: A multilingual resource for parallel names☆37Updated last year
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆69Updated 4 years ago
- Annotated corpus + evaluation metrics for text anonymisation☆70Updated 4 months ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆87Updated 6 months ago
- ☆17Updated 2 years ago
- A survey of corpora for Germanic low-resource languages and dialects☆26Updated 11 months ago
- Code for the CRAC 2021 paper "On Generalization in Coreference Resolution" (Best short paper award)☆36Updated 2 years ago
- GC4LM: A Colossal (Biased) language model for German☆13Updated 4 years ago
- Multilingual Entity Linking model by BELA model☆12Updated 2 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆41Updated 3 years ago
- SQuARE: Software for question answering research.☆75Updated last year
- Corresponding code repo for the paper at COLING 2020 - ARGMIN 2020: "DebateSum: A large-scale argument mining and summarization dataset"☆55Updated 3 years ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Updated 3 years ago
- ☆75Updated 4 years ago
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.☆54Updated 2 years ago
- ☆10Updated last year
- 🌾 Universal, customizable and deployable fine-grained evaluation for text generation.☆24Updated 2 years ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆157Updated 3 years ago
- The dataset and code for ACL 2022 paper "SciNLI: A Corpus for Natural Language Inference on Scientific Text" are released here.☆28Updated 2 years ago
- SeqScore: Scoring for named entity recognition and other sequence labeling tasks☆23Updated 8 months ago
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2…☆69Updated 2 years ago
- ☆37Updated last month
- Collection of NLP model explanations and accompanying analysis tools☆144Updated 2 years ago
- UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating them☆38Updated 3 years ago
- ☆13Updated 4 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆81Updated last year
- ☆22Updated 3 years ago
- Repository for the paper "MultiNERD: A Multilingual, Multi-Genre and Fine-Grained Dataset for Named Entity Recognition (and Disambiguatio…☆45Updated last year