NorskRegnesentral / NeuralTextSanitizerLinks
Neural models for detecting and masking personal information from texts
☆16Updated 2 years ago
Alternatives and similar repositories for NeuralTextSanitizer
Users that are interested in NeuralTextSanitizer are comparing it to the libraries listed below
Sorting:
- ParaNames: A multilingual resource for parallel names☆34Updated last year
- A survey of corpora for Germanic low-resource languages and dialects☆25Updated 8 months ago
- Fine-grained sentiment annotations of NoReC☆21Updated 3 years ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆87Updated 2 months ago
- ☆17Updated 2 years ago
- GC4LM: A Colossal (Biased) language model for German☆13Updated 4 years ago
- Annotated corpus + evaluation metrics for text anonymisation☆60Updated 2 weeks ago
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆69Updated 3 years ago
- Multilingual Open Text☆25Updated 3 months ago
- SeqScore: Scoring for named entity recognition and other sequence labeling tasks☆23Updated 4 months ago
- Semantically Structured Sentence Embeddings☆66Updated 9 months ago
- PropSegmEnt is an annotated dataset for segmenting English text into propositions, and recognizing proposition-level entailment relations…☆19Updated 2 years ago
- Repository for the paper "MultiNERD: A Multilingual, Multi-Genre and Fine-Grained Dataset for Named Entity Recognition (and Disambiguatio…☆44Updated last year
- A spaCy custom component that extracts and normalizes temporal expressions☆55Updated 2 years ago
- An easy-to-use API for analyzing INCEpTION annotation projects.☆18Updated last year
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)☆71Updated 2 years ago
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.☆54Updated last year
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2…☆69Updated 2 years ago
- 🌾 Universal, customizable and deployable fine-grained evaluation for text generation.☆23Updated last year
- 🐸 KERMIT - A lightweight library to encode and interpret Universal Syntactic Embeddings☆58Updated 2 years ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆97Updated last year
- These are lists for a variety of languages containing words that are distinctive to each language.☆38Updated 3 years ago
- Structured Prediction for Entity Linking☆36Updated last year
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Updated 3 years ago
- The official implementation of "Distilling Relation Embeddings from Pre-trained Language Models, EMNLP 2021 main conference", a high-qual…☆47Updated 8 months ago
- Multilingual Entity Linking model by BELA model☆12Updated 2 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆41Updated 3 years ago
- ☆13Updated 3 years ago
- Code for SaGe subword tokenizer (EACL 2023)☆25Updated 8 months ago
- ☆23Updated 4 years ago