openredact / expose-text
This is a prototype of a Python module for simple modification of document files.
β18Updated 3 years ago
Alternatives and similar repositories for expose-text:
Users that are interested in expose-text are comparing it to the libraries listed below
- This is a prototype of a semi-automatic data anonymization app for German documents.β20Updated 2 years ago
- π Dehyphenation of broken text (mainly German), i.e., extracted from a PDFβ38Updated 3 years ago
- BERT and ELECTRA models trained on Europeana Newspapersβ37Updated 3 years ago
- German lemmatization with IWNLP as extension for spaCyβ24Updated last year
- Python port for IWNLP.Lemmatizerβ17Updated last year
- Legal Reference Extractionβ29Updated 7 months ago
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissionsβ19Updated 2 years ago
- GC4LM: A Colossal (Biased) language model for Germanβ13Updated 3 years ago
- Language Model and Text Classification for German Language using Deep Learningβ18Updated 6 years ago
- Use spaCy for NLP and output to the FoLiA XML format.β12Updated last year
- Repository for "Towards Robust Named Entity Recognition for Historic German"β18Updated 4 years ago
- This repository contains all manually labeled data from the GermEval-2018 shared task.β30Updated 6 years ago
- A spaCy custom component that extracts and normalizes temporal expressionsβ54Updated 2 years ago
- Tool for parsing and converting various span encoding schemes.β23Updated last year
- A thin wrapper around the DBpedia Spotlight HTTP APIβ25Updated 7 years ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidataβ94Updated last year
- Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers atβ¦β22Updated 7 months ago
- KenLM extension for spaCy 2.0.β16Updated 7 years ago
- Wikidata embeddingβ50Updated 4 months ago
- German sentiment scores with SentiWS as extension for spaCyβ37Updated 2 years ago
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.β19Updated 2 years ago
- Mining Legal Arguments in Court Decisions - Data and softwareβ66Updated last year
- Named entity recognition for the legal domainβ42Updated 3 years ago
- π§ͺ Cutting-edge experimental spaCy components and featuresβ97Updated 11 months ago
- Plan and train German transformer models.β23Updated 4 years ago
- Named Entity Recognition (LSTM + CRF + FastText) with models for [historic] Germanβ26Updated 3 years ago
- A Named-Entity Recogniser based on Grobid.β51Updated 6 months ago
- β70Updated 2 years ago
- β64Updated 2 years ago
- A part-of-speech tagger with support for domain adaptation and external resources.β22Updated 2 years ago