langtech-bsc / AnonymizationPipelineLinks
Anonymization Pipeline for injesting data from outside of BSC that contains GDPR protected data.
☆14Updated last year
Alternatives and similar repositories for AnonymizationPipeline
Users that are interested in AnonymizationPipeline are comparing it to the libraries listed below
Sorting:
- Framework for working with brat-annotated .ann files☆10Updated 2 months ago
- A spaCy custom component that extracts and normalizes temporal expressions☆54Updated 2 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated 2 years ago
- A High-level Library for Named Entity Recognition in Python.☆24Updated last year
- Code for experiments done for EMNLP2020.☆11Updated 2 years ago
- Pre-production releases for Spacy in Catalan☆14Updated 3 years ago
- ☆23Updated 2 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆156Updated last year
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- Bi-encoder Based Entity Linking Tutorial. You can run experiment only in 5 minutes. Experiments on Co-lab pro GPU are also supported!☆34Updated 4 years ago
- ☆43Updated 2 years ago
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2…☆67Updated 2 years ago
- Information extraction from English and German texts based on predicate logic☆137Updated 2 years ago
- 🤗 Push your spaCy pipelines to the Hugging Face Hub☆44Updated last year
- Personal information identification standard☆21Updated last year
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆79Updated last year
- NLP @ TU Wien☆18Updated 6 months ago
- German Alpaca Dataset (Cleaned + Translated)☆25Updated 2 years ago
- ☆13Updated 9 months ago
- T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.☆12Updated last year
- Generalist and Lightweight Model for Text Classification☆134Updated 2 weeks ago
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆106Updated last year
- Explainable Zero-Shot Topic Extraction☆62Updated 10 months ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- ☆27Updated 4 months ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆59Updated 10 months ago
- A pre-trained language model for social media text in Spanish☆35Updated 2 years ago
- GLADIS: A General and Large Acronym Disambiguation Benchmark (EACL 23)☆16Updated last year
- Easy PDF to text to spaCy text extraction in Python.☆39Updated 8 months ago
- XAI based human-in-the-loop framework for automatic rule-learning.☆49Updated 11 months ago