openredact / anonymizer
A Python module that provides multiple anonymization techniques for text (This is only a prototype)
β22Updated 9 months ago
Alternatives and similar repositories for anonymizer:
Users that are interested in anonymizer are comparing it to the libraries listed below
- This is a prototype of a semi-automatic data anonymization app for German documents.β20Updated last year
- This is a prototype of a Python module for simple modification of document files.β17Updated 3 years ago
- π Dehyphenation of broken text (mainly German), i.e., extracted from a PDFβ38Updated 2 years ago
- This is a prototype of a multi-lingual suite for named-entity recognition in Python.β21Updated 9 months ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacingβ68Updated 2 weeks ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to iβ¦β46Updated 10 months ago
- Rust-based Python wrapper for duckling library in Haskellβ25Updated 4 years ago
- Plan and train German transformer models.β23Updated 3 years ago
- Implementation, trained models and result data for the paper "Pairwise Multi-Class Document Classification for Semantic Relations betweenβ¦β32Updated last year
- Augmenty is an augmentation library based on spaCy for augmenting texts.β151Updated 8 months ago
- BERT and ELECTRA models trained on Europeana Newspapersβ37Updated 3 years ago
- Align the token outputs from Spacy and Huggingface to help understand what language structures transformers seeβ44Updated 2 years ago
- German GPT-2 modelβ32Updated 3 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doβ¦β79Updated 7 months ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheniβ¦β12Updated last year
- β22Updated last year
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.β117Updated 10 months ago
- Generate reports for spaCy models.β29Updated 2 years ago
- Language detection using Spacy and Fasttextβ55Updated last year
- Python package for deduplication/entity resolution using active learningβ76Updated 5 months ago
- TextComplexityDE dataset consists of 1000 sentences in the German language with subjective complexity rating, collected from German learnβ¦β12Updated 2 years ago
- Language Model and Text Classification for German Language using Deep Learningβ18Updated 6 years ago
- Confection: the sweetest config system for Pythonβ182Updated 8 months ago
- A Python library aimed at dissecting and augmenting NER training data.β58Updated last year
- Information extraction from English and German texts based on predicate logicβ135Updated last year
- π§ͺ Cutting-edge experimental spaCy components and featuresβ96Updated 9 months ago
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.β68Updated 3 years ago
- KIND: an Italian Multi-Domain Dataset for Named Entity Recognitionβ15Updated last year
- GC4LM: A Colossal (Biased) language model for Germanβ13Updated 3 years ago
- BabelNet (and WordNet) sense embedding trained with Word2Vec and FastTextβ10Updated 5 years ago