openredact / expose-text
This is a prototype of a Python module for simple modification of document files.
☆17Updated 3 years ago
Alternatives and similar repositories for expose-text:
Users that are interested in expose-text are comparing it to the libraries listed below
- This is a prototype of a semi-automatic data anonymization app for German documents.☆20Updated last year
- GC4LM: A Colossal (Biased) language model for German☆13Updated 3 years ago
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆38Updated 2 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆37Updated 3 years ago
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated last year
- Legal Reference Extraction☆29Updated 5 months ago
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.☆19Updated 2 years ago
- This repository contains all manually labeled data from the GermEval-2018 shared task.☆30Updated 6 years ago
- Neural models for detecting and masking personal information from texts☆15Updated 2 years ago
- Language Model and Text Classification for German Language using Deep Learning☆18Updated 6 years ago
- German lemmatization with IWNLP as extension for spaCy☆24Updated last year
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 4 years ago
- This is a prototype of a multi-lingual suite for named-entity recognition in Python.☆21Updated 8 months ago
- Coreference resolution for German☆16Updated 7 years ago
- Finds linguistic patterns effortlessly☆34Updated last year
- SEM, a free NLP tool relying on machine learning technologies, especially CRFs.☆24Updated 3 years ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Updated 10 months ago
- A web application tagging and retrieval of arguments in text☆29Updated last year
- Python tools for interacting with Wikidata☆148Updated last year
- ☆25Updated 4 years ago
- A part-of-speech tagger with support for domain adaptation and external resources.☆22Updated 2 years ago
- PyPremise - Python tool for the Premise algorithm to identify patterns or explanations of where a machine learning classifier performs we…☆17Updated last year
- A spaCy custom component that extracts and normalizes temporal expressions☆52Updated last year
- ☆70Updated 2 years ago
- CLI for loading Wikidata subsets (or all of it) into Elasticsearch☆68Updated 2 years ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆93Updated last year
- Plan and train German transformer models.☆23Updated 3 years ago
- Compiled tools, datasets, and other resources for historical text normalization.☆16Updated 5 years ago
- EpiTator annotates epidemiological information in text documents. It is the natural language processing framework that powers GRITS and E…☆41Updated 2 years ago
- Training data for the NLPContributionGraph Shared Task 11 at SemEval-2021☆14Updated 4 years ago